TY - GEN
T1 - Performance of the NAS benchmarks on a cluster of SMP PCs using a parallelization of the mpi programs with openMP
AU - Cappello, Franck
AU - Richard, Olivier
AU - Etiemble, Daniel
N1 - Publisher Copyright:
© Springer-Verlag Berlin Heidelberg 1999.
PY - 1999
Y1 - 1999
N2 - The availability of multiprocessors and high performance networks offer the opportunity to build CLUMPs (Cluster of Multiprocessors) and use them as parallel computing platforms. The main distinctive feature of the CLUMP architecture over the usual parallel computers is its hybrid memory model (message passing between the nodes and shared memory inside the nodes). To be largely used, the CLUMPs must be able to execute the existing programs with few modifications. We investigate the performance of a programming approach based on the MPI for inter-multiprocessor communications and OpenMP standards for intra-multiprocessor exchanges. The approach consists inthe intra-node parallelization of the MPI programs with an OpenMP directive based parallel compiler. The paper details the approach in the context of the biprocessor PC CLUMPs and presents a performance evaluation for the NAS parallel benchmarks.
AB - The availability of multiprocessors and high performance networks offer the opportunity to build CLUMPs (Cluster of Multiprocessors) and use them as parallel computing platforms. The main distinctive feature of the CLUMP architecture over the usual parallel computers is its hybrid memory model (message passing between the nodes and shared memory inside the nodes). To be largely used, the CLUMPs must be able to execute the existing programs with few modifications. We investigate the performance of a programming approach based on the MPI for inter-multiprocessor communications and OpenMP standards for intra-multiprocessor exchanges. The approach consists inthe intra-node parallelization of the MPI programs with an OpenMP directive based parallel compiler. The paper details the approach in the context of the biprocessor PC CLUMPs and presents a performance evaluation for the NAS parallel benchmarks.
UR - http://www.scopus.com/inward/record.url?scp=82555191534&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=82555191534&partnerID=8YFLogxK
U2 - 10.1007/3-540-48387-X_36
DO - 10.1007/3-540-48387-X_36
M3 - Conference contribution
AN - SCOPUS:82555191534
SN - 3540663630
SN - 9783540663638
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 339
EP - 350
BT - Parallel Computing Technologies - 5th International Conference, PaCT 1999, Proceedings
A2 - Malyshkin, Victor
PB - Springer
T2 - 5th International Conference on Parallel Computing Technologies, PaCT 1999
Y2 - 6 September 1999 through 10 September 1999
ER -