Abstract
MapReduce has become the standard model for supporting big data analytics. In particular, MapReduce job optimization has been widely considered to be crucial in the implementations of big data analytics. However, there is still a lack of guidelines especially for practitioners to understand how the MapReduce jobs can be optimized. This paper aims to systematic identify and taxonomically classify the existing work on job optimization. We conducted a mapping study on 47 selected papers that were published between 2004 and 2014. We classified and compared the selected papers based on a 5WH-based characterization framework. This study generates a knowledge base of current job optimization solutions and also identifies a set of research gaps and opportunities. This study concludes that job optimization is still in an early stage of maturity. More attentions need to be paid to the cross-data center, cluster or rack job optimization to improve communication efficiency.
Original language | English |
---|---|
Title of host publication | Proceedings - 2015 International Conference on Cloud Computing and Big Data, CCBD 2015 |
Publisher | IEEE - Institute of Electrical and Electronics Engineers Inc. |
Pages | 81-88 |
Number of pages | 8 |
ISBN (Electronic) | 9781467383509 |
DOIs | |
Publication status | Published - 2016 Apr 8 |
Event | International Conference on Cloud Computing and Big Data, CCBD 2015 - Shanghai, China Duration: 2015 Nov 4 → 2015 Nov 6 |
Conference
Conference | International Conference on Cloud Computing and Big Data, CCBD 2015 |
---|---|
Country/Territory | China |
City | Shanghai |
Period | 2015/11/04 → 2015/11/06 |
Subject classification (UKÄ)
- Information Systems
Free keywords
- big data
- job optimization
- mapping study
- MapReduce
- systematic literature review