Abstract

High-performance computing relies on accelerators (such as GPGPUs) to achieve fast execution of scientific applications. Traditionally, these accelerators have been programmed with specialized languages, such as CUDA or OpenCL. In recent years, OpenMP emerged as a promising alternative for supporting accelerators, providing advantages such as maintaining a single code base for the host and different accelerator types and providing a simple way to extend support for accelerators to existing application codes. Efficiently using this support requires solving several challenges, related to performance, work partitioning, and concurrent execution on multiple device types. In this article, we discuss our experiences with using OpenMP for accelerators and present performance guidelines. We also introduce a library, Hydra, that addresses several of the challenges of using OpenMP for such devices. We apply Hydra to a scientific application, PlasCom2, that has not previously been able to use accelerators. Experiments on three architectures show that Hydra results in performance gains of up to 10× compared with CPU-only execution. Concurrent execution on the host and GPU resulted in additional gains of up to 20% compared to running on the GPU only.

Original languageEnglish (US)
Article numbere5728
JournalConcurrency and Computation: Practice and Experience
Volume32
Issue number20
DOIs
StatePublished - Oct 25 2020

Keywords

  • GPGPU
  • Nvidia
  • OpenMP
  • accelerators
  • heterogeneous computing

ASJC Scopus subject areas

  • Software
  • Theoretical Computer Science
  • Computer Networks and Communications
  • Computer Science Applications
  • Computational Theory and Mathematics

Fingerprint

Dive into the research topics of 'Heterogeneous computing with OpenMP and Hydra'. Together they form a unique fingerprint.

Cite this