TY - GEN
T1 - Integrating computing resources on multiple grid-enabled job scheduling systems through a grid RPC system
AU - Nakajima, Yoshihiro
AU - Sato, Mitsuhisa
AU - Aida, Yoshiaki
AU - Boku, Taisuke
AU - Cappello, Franck
PY - 2006
Y1 - 2006
N2 - We present a framework for a parallel programming model by remote procedure calls bridging between large-scale computing resource pools managed by multiple grid-enabled job scheduling systems. With this system, the user can exploit not only each remote servers and clusters, but also computing resources provided with grid-enabled job scheduling systems located on different sites. This framework requires a Grid RFC system to decouple the computation in a remote node from the Grid RFC mechanism and uses document-based communication rather than connection-based communication. We implemented the proposed framework as an extension of the OmniRPC system, which is a Grid RFC system for parallel programming in a grid environment. We designed a general interface to adapt the OmniRPC system to various grid-enabled job scheduling systems easily and applied the proposed system to several grid-enabled job scheduling systems, including XtremWeb, CyberGRIP, Condor and Grid Engine, we show the preliminary performance of these implementations using a phylogenetic application. We found that the proposed system can achieve approximately the same performance as using OmniRPC and can handle interruptions in worker programs on remote nodes.
AB - We present a framework for a parallel programming model by remote procedure calls bridging between large-scale computing resource pools managed by multiple grid-enabled job scheduling systems. With this system, the user can exploit not only each remote servers and clusters, but also computing resources provided with grid-enabled job scheduling systems located on different sites. This framework requires a Grid RFC system to decouple the computation in a remote node from the Grid RFC mechanism and uses document-based communication rather than connection-based communication. We implemented the proposed framework as an extension of the OmniRPC system, which is a Grid RFC system for parallel programming in a grid environment. We designed a general interface to adapt the OmniRPC system to various grid-enabled job scheduling systems easily and applied the proposed system to several grid-enabled job scheduling systems, including XtremWeb, CyberGRIP, Condor and Grid Engine, we show the preliminary performance of these implementations using a phylogenetic application. We found that the proposed system can achieve approximately the same performance as using OmniRPC and can handle interruptions in worker programs on remote nodes.
UR - http://www.scopus.com/inward/record.url?scp=33751098031&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=33751098031&partnerID=8YFLogxK
U2 - 10.1109/CCGRID.2006.60
DO - 10.1109/CCGRID.2006.60
M3 - Conference contribution
AN - SCOPUS:33751098031
SN - 0769525857
SN - 9780769525853
T3 - Sixth IEEE International Symposium on Cluster Computing and the Grid, 2006. CCGRID 06
SP - 296
EP - 300
BT - Sixth IEEE International Symposium on Cluster Computing and the Grid
T2 - 6th IEEE International Symposium on Cluster Computing and the Grid, 2006. CCGRID 06
Y2 - 16 May 2006 through 19 May 2006
ER -