TY - JOUR
T1 - Convergence of artificial intelligence and high performance computing on NSF-supported cyberinfrastructure
AU - Huerta, E. A.
AU - Khan, Asad
AU - Davis, Edward
AU - Bushell, Colleen
AU - Gropp, William D.
AU - Katz, Daniel S.
AU - Kindratenko, Volodymyr
AU - Koric, Seid
AU - Kramer, William T.C.
AU - McGinty, Brendan
AU - McHenry, Kenton
AU - Saxton, Aaron
N1 - Funding Information:
EAH, AK, DSK, and VK gratefully acknowledge National Science Foundation (NSF) award OAC-1931561. EAH and VK also acknowledge NSF award OAC-1934757. This work utilized XSEDE resources through the NSF award TG-PHY160053, and the NSF’s Major Research Instrumentation program, award OAC-1725729, as well as the University of Illinois at Urbana-Champaign. This research used resources of the Oak Ridge Leadership Computing Facility, which is a DOE Office of Science User Facility supported under Contract DE-AC05-00OR22725. Acknowledgements
Funding Information:
We thank Nicholas A. Nystrom, Paola Buitrago and Julian Uran for their support using Bridges-AI; and Arjun Shankar, Tom Gibbs, Junqi Yin, and Jeff Larking for their support and guidance using the Summit supercomputer. We also thank Ben Blaiszik, Ryan Chard and Logan Ward for their support deploying our neural network models and testing datasets at the Data and Learning Hub for Science hosted by Argonne National Lab.
Publisher Copyright:
© 2020, The Author(s).
PY - 2020/12/1
Y1 - 2020/12/1
N2 - Significant investments to upgrade and construct large-scale scientific facilities demand commensurate investments in R&D to design algorithms and computing approaches to enable scientific and engineering breakthroughs in the big data era. Innovative Artificial Intelligence (AI) applications have powered transformational solutions for big data challenges in industry and technology that now drive a multi-billion dollar industry, and which play an ever increasing role shaping human social patterns. As AI continues to evolve into a computing paradigm endowed with statistical and mathematical rigor, it has become apparent that single-GPU solutions for training, validation, and testing are no longer sufficient for computational grand challenges brought about by scientific facilities that produce data at a rate and volume that outstrip the computing capabilities of available cyberinfrastructure platforms. This realization has been driving the confluence of AI and high performance computing (HPC) to reduce time-to-insight, and to enable a systematic study of domain-inspired AI architectures and optimization schemes to enable data-driven discovery. In this article we present a summary of recent developments in this field, and describe specific advances that authors in this article are spearheading to accelerate and streamline the use of HPC platforms to design and apply accelerated AI algorithms in academia and industry.
AB - Significant investments to upgrade and construct large-scale scientific facilities demand commensurate investments in R&D to design algorithms and computing approaches to enable scientific and engineering breakthroughs in the big data era. Innovative Artificial Intelligence (AI) applications have powered transformational solutions for big data challenges in industry and technology that now drive a multi-billion dollar industry, and which play an ever increasing role shaping human social patterns. As AI continues to evolve into a computing paradigm endowed with statistical and mathematical rigor, it has become apparent that single-GPU solutions for training, validation, and testing are no longer sufficient for computational grand challenges brought about by scientific facilities that produce data at a rate and volume that outstrip the computing capabilities of available cyberinfrastructure platforms. This realization has been driving the confluence of AI and high performance computing (HPC) to reduce time-to-insight, and to enable a systematic study of domain-inspired AI architectures and optimization schemes to enable data-driven discovery. In this article we present a summary of recent developments in this field, and describe specific advances that authors in this article are spearheading to accelerate and streamline the use of HPC platforms to design and apply accelerated AI algorithms in academia and industry.
KW - Artificial intelligence
KW - High performance computing
UR - http://www.scopus.com/inward/record.url?scp=85092643334&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85092643334&partnerID=8YFLogxK
U2 - 10.1186/s40537-020-00361-2
DO - 10.1186/s40537-020-00361-2
M3 - Article
AN - SCOPUS:85092643334
SN - 2196-1115
VL - 7
JO - Journal of Big Data
JF - Journal of Big Data
IS - 1
M1 - 88
ER -