TY - GEN
T1 - A malleable-job system for timeshared parallel machines
AU - Kale, Laxmikant V.
AU - Kumar, Sameer
AU - Desouza, Jayant
PY - 2002
Y1 - 2002
N2 - Malleable jobs are parallel programs that can change the number of processors on which they are executing at run time in response to an external command. One of the advantages of such jobs is that a job scheduler for malleable jobs can provide improved system utilization and average response time over a scheduler for traditional jobs. In this paper, we present a programming system for creating malleable jobs that is more general than other current malleable systems. In particular, it is not limited to the master-worker paradigm or the Fortran SPMD programming model, but can also support general purpose parallel programs including those written in MPI and Charm++, and has built-in migration and load-balancing, among other features.
AB - Malleable jobs are parallel programs that can change the number of processors on which they are executing at run time in response to an external command. One of the advantages of such jobs is that a job scheduler for malleable jobs can provide improved system utilization and average response time over a scheduler for traditional jobs. In this paper, we present a programming system for creating malleable jobs that is more general than other current malleable systems. In particular, it is not limited to the master-worker paradigm or the Fortran SPMD programming model, but can also support general purpose parallel programs including those written in MPI and Charm++, and has built-in migration and load-balancing, among other features.
UR - http://www.scopus.com/inward/record.url?scp=10044276198&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=10044276198&partnerID=8YFLogxK
U2 - 10.1109/CCGRID.2002.1017131
DO - 10.1109/CCGRID.2002.1017131
M3 - Conference contribution
AN - SCOPUS:10044276198
SN - 0769515827
SN - 9780769515823
T3 - 2nd IEEE/ACM International Symposium on Cluster Computing and the Grid, CCGrid 2002
SP - 230
EP - 237
BT - 2nd IEEE/ACM International Symposium on Cluster Computing and the Grid, CCGrid 2002
PB - IEEE Computer Society
T2 - 2nd IEEE/ACM International Symposium on Cluster Computing and the Grid, CCGrid 2002
Y2 - 21 May 2002 through 24 May 2002
ER -