Hadoop in Flight: Migrating Live MapReduce Jobs for Power-Shifting Data Centers
Citations Over TimeTop 10% of 2016 papers
Abstract
Renewable energy sources such as wind and solar are unpredictable for power utilities, which must produce exactly as much power as is needed at any given time. To help manage the demand, some utilities have begun deploying real-time energy prices to their customers. Data centers, which often run Hadoop jobs on thousands of machines, have become some of the utilities' largest consumers. In fact, recent studies have shown that, when processing at full capacity, data centers can require as much power as a mid-sized U. S. city. By implementing a method in which data centers can offload their work to locations on different power grids, they can take advantage of the lower-priced energy and thereby minimize operational costs. To this end, we have designed and implemented a new mechanism directly within the Hadoop 2 codebase that allows users to pause, migrate, and resume a job at arbitrary points of execution. We have evaluated this scheme using popular applications and show that energy can be delayed and shifted to a different location with reasonable overheads. Our experiments justify the migration use-case, showing that it saves both energy and time over either restarting the job remotely or allowing it to complete locally.
Related Papers
- → Is Hot IT a False Economy? An Analysis of Server and Data Center Energy Efficiency as Temperatures Rise(2023)4 cited
- → Assessment of Data Center Energy Efficiency. Methods and Metrics(2018)4 cited
- → Joint Host-Network Power Scaling with Minimizing VM Migration in SDN-enabled Cloud Data Centers(2020)3 cited
- → Energy Efficiency Assessment for Data Center in Finland: Case Study(2011)2 cited
- → Using live virtual machine migration to improve resource efficiency on virtualized data centers(2012)1 cited