Dynamic Tuning of Core Counts to Maximize Performance in Object-Based Runtime Systems

Kavitha Chandrasekar, Laxmikant V. Kale

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Relatively recent developments in supercomputer nodes, such as higher physical and virtual core counts per node, aim to speed up HPC application execution time. However, not all applications benefit from increased thread level parallelism and may exhibit performance degradation with increased concurrency. Additionally, the best performing thread count may not be known apriori as it can vary with application or with input size for a given application. This motivates the need for dynamically tuning the number of threads or cores used by an application, at run-time. However, such tuning of core counts in popular object-based or task-based runtime system is non-trivial since objects or tasks are anchored to processing elements (PEs) for locality. In this work, we identify the steps for adaptive tuning of core count to the most performant configuration, at run-time, for an object-based runtime system,  Charm++. We show performance benefit of dynamic profiling and adaptively selecting core (physical or virtual) count for a variety of applications including compute, memory and cache-intensive applications. Specifically, we show that our mechanism can improve performance by almost 40% in presence of cache and memory contention, by over 20% with SMT in Skylake nodes and by about 35% in KNL nodes. We also show energy savings, and in some cases power savings alongside performance improvement.

Original languageEnglish (US)
Title of host publicationAsynchronous Many-Task Systems and Applications - 2nd International Workshop, WAMTA 2024, Proceedings
EditorsPatrick Diehl, Joseph Schuchart, Pedro Valero-Lara, George Bosilca
PublisherSpringer
Pages92-104
Number of pages13
ISBN (Print)9783031617621
DOIs
StatePublished - 2024
Event2nd International Workshop on Asynchronous Many-Task Systems and Applications, WAMTA 2024 - Knoxville, United States
Duration: Feb 14 2024Feb 16 2024

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume14626 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference2nd International Workshop on Asynchronous Many-Task Systems and Applications, WAMTA 2024
Country/TerritoryUnited States
CityKnoxville
Period2/14/242/16/24

ASJC Scopus subject areas

  • Theoretical Computer Science
  • General Computer Science

Fingerprint

Dive into the research topics of 'Dynamic Tuning of Core Counts to Maximize Performance in Object-Based Runtime Systems'. Together they form a unique fingerprint.

Cite this