GPU accelerated finite-element computation for electromagnetic analysis

Huan Ting Meng, Bao Lin Nie, Steven Wong, Charles Macon, Jian Ming Jin

Research output: Contribution to journalArticlepeer-review


General-purpose computing on graphics processing units (GPGPU), with programming models such as the Compute Unified Device Architecture (CUDA) by NVIDIA, offers the capability for accelerating the solution process of computational electromagnetics analysis. However, due to the communication-intensive nature of the finite-element algorithm, both the assembly and the solution phases cannot be implemented via fine-grained many-core GPU processors in a straightforward manner. In this paper, we identify the bottlenecks in the GPU parallelization of the Finite-Element Method for electromagnetic analysis, and propose potential solutions to alleviate the bottlenecks. We first discuss efficient parallelization strategies for the finite-element matrix assembly on a single GPU and on multiple GPUs. We then explore parallelization strategies for the finite-element matrix solution, in conjunction with parallelizable preconditioners to reduce the total solution time. We show that with a proper parallelization and implementation, GPUs are able to achieve significant speedups over OpenMP-enabled multi-core CPUs.

Original languageEnglish (US)
Article number6837065
Pages (from-to)39-62
Number of pages24
JournalIEEE Antennas and Propagation Magazine
Issue number2
StatePublished - Apr 2014


  • Computational electromagnetics
  • finite element analysis
  • frequency-domain analysis
  • graphics processing units
  • high performance computing
  • parallel programming

ASJC Scopus subject areas

  • Condensed Matter Physics
  • Electrical and Electronic Engineering


Dive into the research topics of 'GPU accelerated finite-element computation for electromagnetic analysis'. Together they form a unique fingerprint.

Cite this