Dist-gem5: Distributed simulation of computer clusters

Mohammad Alian, Umur Darbaz, Gabor Dozsa, Stephan Diestelhorst, Daehoon Kim, Nam Sung Kim

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

When analyzing a distributed computer system, we often observe that the complex interplay among processor, node, and network sub-systems can profoundly affect the performance and power efficiency of the distributed computer system. Therefore, to effectively cross-optimize hardware and software components of a distributed computer system, we need a full-system simulation infrastructure that can precisely capture the complex interplay. Responding to the aforementioned need, we present dist-gem5, a flexible, detailed, and open-source full-system simulation infrastructure that can model and simulate a distributed computer system using multiple simulation hosts. Then we validate dist-gem5 against a physical cluster and show that the latency and bandwidth of the simulated network sub-system are within 18% of the physical one. Compared with the single threaded and parallel versions of gem5, dist-gem5 speeds up the simulation of a 63-node computer cluster by 83.1× and 12.8×, respectively.

Original languageEnglish (US)
Title of host publicationISPASS 2017 - IEEE International Symposium on Performance Analysis of Systems and Software
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages153-162
Number of pages10
ISBN (Electronic)9781538638897
DOIs
StatePublished - Jul 11 2017
Event2017 IEEE International Symposium on Performance Analysis of Systems and Software, ISPASS 2017 - Santa Rosa, United States
Duration: Apr 24 2017Apr 25 2017

Publication series

NameISPASS 2017 - IEEE International Symposium on Performance Analysis of Systems and Software

Other

Other2017 IEEE International Symposium on Performance Analysis of Systems and Software, ISPASS 2017
CountryUnited States
CitySanta Rosa
Period4/24/174/25/17

ASJC Scopus subject areas

  • Computer Science (miscellaneous)
  • Hardware and Architecture
  • Software

Fingerprint Dive into the research topics of 'Dist-gem5: Distributed simulation of computer clusters'. Together they form a unique fingerprint.

Cite this