TY - GEN
T1 - Debugging large scale applications in a virtualized environment
AU - Gioachin, Filippo
AU - Zheng, Gengbin
AU - Kalé, Laxmikant V.
PY - 2011
Y1 - 2011
N2 - With the advent of petascale machines with hundreds of thousands of processors, debugging parallel applications is becoming an increasing challenge. Aside from the complicated debugging techniques required to debug applications at such scale, it is often difficult to gain access to these machines for a sufficient period of time, if at all. Some existing parallel debuggers are capable of handling these machines, but they still require the whole machine to be allocated. In this paper, we present an innovative approach to address debugging on such extreme scales. By leveraging the concept of object-based processor virtualization, our technique enables debugging of even a million processor execution under a simulated environment using only a relatively small cluster. We describe the obstacles we overcame to achieve this goal within two message passing programming models: Charm++ and MPI. We demonstrate the results using real world applications such as Molecular Dynamics and Cosmological simulation programs.
AB - With the advent of petascale machines with hundreds of thousands of processors, debugging parallel applications is becoming an increasing challenge. Aside from the complicated debugging techniques required to debug applications at such scale, it is often difficult to gain access to these machines for a sufficient period of time, if at all. Some existing parallel debuggers are capable of handling these machines, but they still require the whole machine to be allocated. In this paper, we present an innovative approach to address debugging on such extreme scales. By leveraging the concept of object-based processor virtualization, our technique enables debugging of even a million processor execution under a simulated environment using only a relatively small cluster. We describe the obstacles we overcame to achieve this goal within two message passing programming models: Charm++ and MPI. We demonstrate the results using real world applications such as Molecular Dynamics and Cosmological simulation programs.
UR - http://www.scopus.com/inward/record.url?scp=79952613842&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=79952613842&partnerID=8YFLogxK
U2 - 10.1007/978-3-642-19595-2_14
DO - 10.1007/978-3-642-19595-2_14
M3 - Conference contribution
AN - SCOPUS:79952613842
SN - 9783642195945
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 199
EP - 214
BT - Languages and Compilers for Parallel Computing - 23rd International Workshop, LCPC 2010, Revised Selected Papers
T2 - 23rd International Workshop on Languages and Compilers for Parallel Computing, LCPC 2010
Y2 - 7 October 2010 through 9 October 2010
ER -