FRESH: Fair and Efficient Slot Configuration and Scheduling for Hadoop Clusters
Citations Over TimeTop 10% of 2014 papers
Abstract
Hadoop is an emerging framework for parallel big data processing. While becoming popular, Hadoop is too complex for regular users to fully understand all the system parameters and tune them appropriately. Especially when processing a batch of jobs, default Hadoop setting may cause inefficient resource utilization and unnecessarily prolong the execution time. This paper considers an extremely important setting of slot configuration which by default is fixed and static. We proposed an enhanced Hadoop system called FRESH which can derive the best slot setting, dynamically configure slots, and appropriately assign tasks to the available slots. The experimental results show that when serving a batch of MapReduce jobs, FRESH significantly improves the makespan as well as the fairness among jobs.
Related Papers
- → Minimizing makespan on a batch-processing machine with non-identical job sizes using genetic algorithms(2006)128 cited
- Makespan minimization on a single batch processing machine with unequal job ready times(2011)
- → Single machine batch scheduling problem to minimize makespan with controllable setup and jobs processing times(2015)1 cited
- → Parallel-batching scheduling problem with family jobs for minimizing makespan(2010)2 cited
- Scheduling of Semi-Continuous Batching Machine in Chains Structure(2010)