Enabling Support for Zero Copy Semantics in an Asynchronous Task-Based Programming Model

Nitin Bhat, Sam White, Laxmikant V. Kale

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Communication is critical to the scalable and efficient performance of scientific simulations on extreme scale computing systems. Part of the promise of task-based programming models is that they can naturally overlap communication with computation and exploit locality between tasks. Copy-based semantics using eager communication protocols easily enable such asynchrony by alleviating the responsibility of buffer management from the user, both on the sender and the receiver. However, these semantics increase memory allocations and copies and in turn affect application memory footprint and performance, especially with large message buffers. In this work we describe how the so-called “zero copy” messaging semantics can be supported in Converse, the message-driven parallel programming framework that is used by Charm++, by implementing support for user-owned buffer transfers in its lower level runtime system, LRTS. These semantics work on user-provided buffers and do not semantically require copies by either the user or the runtime system. We motivate our work by reviewing the existing messaging model in Converse/Charm++, identify its semantic shortcomings, and define new LRTS and Converse APIs to support zero copy communication based on RDMA capabilities. We demonstrate the utility of our new communication interfaces with benchmarks written in Converse. The result is up to 91% of message latency improvement and improved memory usage. These advances will enable future work on user-facing APIs in Charm++.

Original languageEnglish (US)
Title of host publicationEuro-Par 2021
Subtitle of host publicationParallel Processing Workshops - Euro-Par 2021 International Workshops, 2021, Revised Selected Papers
EditorsRicardo Chaves, Dora B. Heras, Aleksandar Ilic, Didem Unat, Rosa M. Badia, Andrea Bracciali, Patrick Diehl, Anshu Dubey, Oh Sangyoon, Stephen L. Scott, Laura Ricci
PublisherSpringer
Pages496-505
Number of pages10
ISBN (Print)9783031061554
DOIs
StatePublished - 2022
Event27th International Conference on Parallel and Distributed Computing, Euro-Par 2021 - Virtual, Online
Duration: Aug 30 2021Aug 31 2021

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume13098 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference27th International Conference on Parallel and Distributed Computing, Euro-Par 2021
CityVirtual, Online
Period8/30/218/31/21

Keywords

  • Asynchronous tasking
  • Charm++
  • Communication optimizations
  • Converse
  • Parallel programming
  • RDMA

ASJC Scopus subject areas

  • Theoretical Computer Science
  • General Computer Science

Fingerprint

Dive into the research topics of 'Enabling Support for Zero Copy Semantics in an Asynchronous Task-Based Programming Model'. Together they form a unique fingerprint.

Cite this