OpenACC acceleration for the PN–PN-2 algorithm in Nek5000

Evelyn Otero, Jing Gong, Misun Min, Paul Fischer, Philipp Schlatter, Erwin Laure

Research output: Contribution to journalArticle

Abstract

Due to its high performance and throughput capabilities, GPU-accelerated computing is becoming a popular technology in scientific computing, in particular using programming models such as CUDA and OpenACC. The main advantage with OpenACC is that it enables to simply port codes in their “original” form to GPU systems through compiler directives, thus allowing an incremental approach. An OpenACC implementation is applied to the CFD code Nek5000 for simulation of incompressible flows, based on the spectral-element method. The work follows up previous implementations and focuses now on the PN−PN−2 method for the spatial discretization of the Navier–Stokes equations. Performance results of the ported code show a speed-up of up to 3.1 on multi-GPU for a polynomial order N>11.

Original languageEnglish (US)
Pages (from-to)69-78
Number of pages10
JournalJournal of Parallel and Distributed Computing
Volume132
DOIs
StatePublished - Oct 2019
Externally publishedYes

Keywords

  • GPU programming
  • High performance computing
  • Nek5000
  • OpenACC
  • Spectral element method

ASJC Scopus subject areas

  • Software
  • Theoretical Computer Science
  • Hardware and Architecture
  • Computer Networks and Communications
  • Artificial Intelligence

Fingerprint Dive into the research topics of 'OpenACC acceleration for the P<sub>N</sub>–P<sub>N-2</sub> algorithm in Nek5000'. Together they form a unique fingerprint.

  • Cite this