ACM SRC Poster: Optimizing all-to-all algorithm for percs network using simulation

Ehsan Totoni, Laxmikant V. Kale

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Communication algorithms play a crucial role in the performance of large-scale parallel systems. They are implemented in runtime systems and used in most parallel applications as a critical component. As vendors are willing to design new custom networks with significantly different performance properties for their new supercomputers, designing new efficient communication algorithms is an inevitable challenge. This task is desirable to be done before the machine comes online since inefficient use of the system before the new algorithm's availability is a huge waste of a possibly hundreds of millions of dollars resource. Here, we demonstrate the usability of our simulation framework, BigSim, in meeting this challenge. Using BigSim, we observe that the commonly used Pairwise-Exchange algorithm for all-to-all communication pattern is suboptimal for a supernode of the PERCS network (two-level directly connected similar to Dragony topology). We designed a new all-to-all algorithm for it and predict a five-fold performance improvement for large message sizes using this algorithm.

Original languageEnglish (US)
Title of host publicationSC'11 - Proceedings of the 2011 High Performance Computing Networking, Storage and Analysis Companion, Co-located with SC'11
Pages123-124
Number of pages2
DOIs
StatePublished - Dec 1 2011
Event2011 High Performance Computing Networking, Storage and Analysis, SC'11, Co-located with SC'11 - Seattle, WA, United States
Duration: Nov 12 2011Nov 18 2011

Publication series

NameSC'11 - Proceedings of the 2011 High Performance Computing Networking, Storage and Analysis Companion, Co-located with SC'11

Other

Other2011 High Performance Computing Networking, Storage and Analysis, SC'11, Co-located with SC'11
CountryUnited States
CitySeattle, WA
Period11/12/1111/18/11

Keywords

  • Design
  • General Terms Algorithms
  • Measurement
  • Performance

ASJC Scopus subject areas

  • Computer Networks and Communications

Fingerprint Dive into the research topics of 'ACM SRC Poster: Optimizing all-to-all algorithm for percs network using simulation'. Together they form a unique fingerprint.

Cite this