TY - JOUR
T1 - When huge is routine
T2 - Scaling genetic algorithms and estimation of distribution algorithms via data-intensive computing
AU - Llorà, Xavier
AU - Verma, Abhishek
AU - Campbell, Roy H.
AU - Goldberg, David E.
PY - 2010
Y1 - 2010
N2 - Data-intensive computing has emerged as a key player for processing large volumes of data exploiting massive parallelism. Data-intensive computing frameworks have shown that terabytes and petabytes of data can be routinely processed. However, there has been little effort to explore how data-intensive computing can help scale evolutionary computation. In this book chapter we explore how evolutionary computation algorithms can be modeled using two different data-intensive frameworks-Yahoo!'s Hadoop and NCSA's Meandre. We present a detailed step-by-step description of how three different evolutionary computation algorithms, having different execution profiles, can be translated into the data-intensive computing paradigms. Results show that (1) Hadoop is an excellent choice to push evolutionary computation boundaries on very large problems, and (2) that transparent Meandre linear speedups are possible without changing the underlying data-intensive flow thanks to its inherent parallel processing.
AB - Data-intensive computing has emerged as a key player for processing large volumes of data exploiting massive parallelism. Data-intensive computing frameworks have shown that terabytes and petabytes of data can be routinely processed. However, there has been little effort to explore how data-intensive computing can help scale evolutionary computation. In this book chapter we explore how evolutionary computation algorithms can be modeled using two different data-intensive frameworks-Yahoo!'s Hadoop and NCSA's Meandre. We present a detailed step-by-step description of how three different evolutionary computation algorithms, having different execution profiles, can be translated into the data-intensive computing paradigms. Results show that (1) Hadoop is an excellent choice to push evolutionary computation boundaries on very large problems, and (2) that transparent Meandre linear speedups are possible without changing the underlying data-intensive flow thanks to its inherent parallel processing.
UR - http://www.scopus.com/inward/record.url?scp=77149146500&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=77149146500&partnerID=8YFLogxK
U2 - 10.1007/978-3-642-10675-0_2
DO - 10.1007/978-3-642-10675-0_2
M3 - Article
AN - SCOPUS:77149146500
SN - 1860-949X
VL - 269
SP - 11
EP - 41
JO - Studies in Computational Intelligence
JF - Studies in Computational Intelligence
ER -