TY - GEN
T1 - Breaking the MapReduce stage barrier
AU - Verma, Abhishek
AU - Zea, Nicolas
AU - Cho, Brian
AU - Gupta, Indranil
AU - Campbell, Roy H.
PY - 2010
Y1 - 2010
N2 - The MapReduce model uses a barrier between the Map and Reduce stages. This provides simplicity in both programming and implementation. However, in many situations, this barrier hurts performance because it is overly restrictive. Hence, we develop a method to break the barrier in MapReduce in a way that improves efficiency. Careful design of our barrier- less MapReduce framework results in equivalent generality and retains ease of programming. We motivate our case with, and experimentally study our barrier-less techniques in, a wide variety of MapReduce applications divided into seven classes. Our experiments show that our approach can achieve better performance times than a traditional MapReduce framework. We achieve a reduction in job completion times that is 25% on average and 87% in the best case.
AB - The MapReduce model uses a barrier between the Map and Reduce stages. This provides simplicity in both programming and implementation. However, in many situations, this barrier hurts performance because it is overly restrictive. Hence, we develop a method to break the barrier in MapReduce in a way that improves efficiency. Careful design of our barrier- less MapReduce framework results in equivalent generality and retains ease of programming. We motivate our case with, and experimentally study our barrier-less techniques in, a wide variety of MapReduce applications divided into seven classes. Our experiments show that our approach can achieve better performance times than a traditional MapReduce framework. We achieve a reduction in job completion times that is 25% on average and 87% in the best case.
UR - http://www.scopus.com/inward/record.url?scp=78649457559&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=78649457559&partnerID=8YFLogxK
U2 - 10.1109/CLUSTER.2010.29
DO - 10.1109/CLUSTER.2010.29
M3 - Conference contribution
AN - SCOPUS:78649457559
SN - 9780769542201
T3 - Proceedings - IEEE International Conference on Cluster Computing, ICCC
SP - 235
EP - 244
BT - Proceedings - 2010 IEEE International Conference on Cluster Computing, Cluster 2010
ER -