TY - JOUR
T1 - Scientific discovery in the age of artificial intelligence
AU - Wang, Hanchen
AU - Fu, Tianfan
AU - Du, Yuanqi
AU - Gao, Wenhao
AU - Huang, Kexin
AU - Liu, Ziming
AU - Chandak, Payal
AU - Liu, Shengchao
AU - Van Katwyk, Peter
AU - Deac, Andreea
AU - Anandkumar, Anima
AU - Bergen, Karianne
AU - Gomes, Carla P.
AU - Ho, Shirley
AU - Kohli, Pushmeet
AU - Lasenby, Joan
AU - Leskovec, Jure
AU - Liu, Tie Yan
AU - Manrai, Arjun
AU - Marks, Debora
AU - Ramsundar, Bharath
AU - Song, Le
AU - Sun, Jimeng
AU - Tang, Jian
AU - Veličković, Petar
AU - Welling, Max
AU - Zhang, Linfeng
AU - Coley, Connor W.
AU - Bengio, Yoshua
AU - Zitnik, Marinka
N1 - M.Z. gratefully acknowledges the support of the National Institutes of Health under R01HD108794, U.S. Air Force under FA8702-15-D-0001, awards from Harvard Data Science Initiative, Amazon Faculty Research, Google Research Scholar Program, Bayer Early Excellence in Science, AstraZeneca Research, Roche Alliance with Distinguished Scientists, and Kempner Institute for the Study of Natural and Artificial Intelligence. C.P.G. and Y.D. acknowledge the support from the U.S. Air Force Office of Scientific Research under Multidisciplinary University Research Initiatives Program (MURI) FA9550-18-1-0136, Defense University Research Instrumentation Program (DURIP) FA9550-21-1-0316, and awards from Scientific Autonomous Reasoning Agent (SARA), and AI for Discovery Assistant (AIDA). Any opinions, findings, conclusions or recommendations expressed in this material are those of the authors and do not necessarily reflect the views of the funders. We thank D. Hassabis, A. Davies, S. Mohamed, Z. Li, K. Ma, Z. Qiao, E. Weinstein, A. V. Weller, Y. Zhong and A. M. Brandt for discussions on the paper.
PY - 2023/8/3
Y1 - 2023/8/3
N2 - Artificial intelligence (AI) is being increasingly integrated into scientific discovery to augment and accelerate research, helping scientists to generate hypotheses, design experiments, collect and interpret large datasets, and gain insights that might not have been possible using traditional scientific methods alone. Here we examine breakthroughs over the past decade that include self-supervised learning, which allows models to be trained on vast amounts of unlabelled data, and geometric deep learning, which leverages knowledge about the structure of scientific data to enhance model accuracy and efficiency. Generative AI methods can create designs, such as small-molecule drugs and proteins, by analysing diverse data modalities, including images and sequences. We discuss how these methods can help scientists throughout the scientific process and the central issues that remain despite such advances. Both developers and users of AI toolsneed a better understanding of when such approaches need improvement, and challenges posed by poor data quality and stewardship remain. These issues cut across scientific disciplines and require developing foundational algorithmic approaches that can contribute to scientific understanding or acquire it autonomously, making them critical areas of focus for AI innovation.
AB - Artificial intelligence (AI) is being increasingly integrated into scientific discovery to augment and accelerate research, helping scientists to generate hypotheses, design experiments, collect and interpret large datasets, and gain insights that might not have been possible using traditional scientific methods alone. Here we examine breakthroughs over the past decade that include self-supervised learning, which allows models to be trained on vast amounts of unlabelled data, and geometric deep learning, which leverages knowledge about the structure of scientific data to enhance model accuracy and efficiency. Generative AI methods can create designs, such as small-molecule drugs and proteins, by analysing diverse data modalities, including images and sequences. We discuss how these methods can help scientists throughout the scientific process and the central issues that remain despite such advances. Both developers and users of AI toolsneed a better understanding of when such approaches need improvement, and challenges posed by poor data quality and stewardship remain. These issues cut across scientific disciplines and require developing foundational algorithmic approaches that can contribute to scientific understanding or acquire it autonomously, making them critical areas of focus for AI innovation.
UR - https://www.scopus.com/pages/publications/85166404512
UR - https://www.scopus.com/pages/publications/85166404512#tab=citedBy
U2 - 10.1038/s41586-023-06221-2
DO - 10.1038/s41586-023-06221-2
M3 - Review article
C2 - 37532811
AN - SCOPUS:85166404512
SN - 0028-0836
VL - 620
SP - 47
EP - 60
JO - Nature
JF - Nature
IS - 7972
ER -