TY - GEN
T1 - A scalable process-management environment for parallel programs
AU - Butler, Ralph
AU - Gropp, William
AU - Lusk, Ewing
N1 - Publisher Copyright:
© Springer-Verlag Berlin Heidelberg 2000.
PY - 2000
Y1 - 2000
N2 - We present a process management system for parallel programs such as those written using MPI.A primary goal of the system, which we call MPD (for multipurpose daemon), is to be scalable. By this we mean that startup of interactive parallel jobs comprising a thousand processes is quick, that signals can be quickly delivered to processes, and that stdin, stdout, and stderr are managed intuitively. Our primary target is parallel machines made up of clusters of SMPs, but the system is also useful in more tightly integrated environments. We describe how MPD enables much faster startup and better runtime management of MPICH jobs. We show how close control of stdio can support the easy implementation of a number of convenient system utilities, even a parallel debugger. MPD is implemented and freely distributed with MPICH.
AB - We present a process management system for parallel programs such as those written using MPI.A primary goal of the system, which we call MPD (for multipurpose daemon), is to be scalable. By this we mean that startup of interactive parallel jobs comprising a thousand processes is quick, that signals can be quickly delivered to processes, and that stdin, stdout, and stderr are managed intuitively. Our primary target is parallel machines made up of clusters of SMPs, but the system is also useful in more tightly integrated environments. We describe how MPD enables much faster startup and better runtime management of MPICH jobs. We show how close control of stdio can support the easy implementation of a number of convenient system utilities, even a parallel debugger. MPD is implemented and freely distributed with MPICH.
UR - http://www.scopus.com/inward/record.url?scp=84957017252&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84957017252&partnerID=8YFLogxK
U2 - 10.1007/3-540-45255-9_25
DO - 10.1007/3-540-45255-9_25
M3 - Conference contribution
AN - SCOPUS:84957017252
SN - 3540410104
SN - 9783540410102
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 168
EP - 175
BT - Recent Advances in Parallel Virtual Machine and Message Passing Interface - 7th European PVM/MPI Users’ Group Meeting, Proceedings
A2 - Dongarra, Jack
A2 - Kacsuk, Peter
A2 - Podhorszki, Norbert
PB - Springer
T2 - 7th European Parallel Virtual Machine and Message Passing Interface Users’ Group Meeting, PVM/MPI 2000
Y2 - 10 September 2000 through 13 September 2000
ER -