TY - JOUR
T1 - Distilling coarse-grained representations of molecular electronic structure with continuously gated message passing
AU - Maier, J. Charlie
AU - Wang, Chun I.
AU - Jackson, Nicholas E.
N1 - This material is based upon work supported by the National Science Foundation Chemical Theory, Models, and Computation division under Award No. CHE-2154916. We acknowledge support from the Dreyfus Program for Machine Learning in the Chemical Sciences and Engineering during this project. We acknowledge the American Chemical Society Petroleum Research Fund Doctoral New Investigator award for additional support of this research.
PY - 2024/1/14
Y1 - 2024/1/14
N2 - Bottom-up methods for coarse-grained (CG) molecular modeling are critically needed to establish rigorous links between atomistic reference data and reduced molecular representations. For a target molecule, the ideal reduced CG representation is a function of both the conformational ensemble of the system and the target physical observable(s) to be reproduced at the CG resolution. However, there is an absence of algorithms for selecting CG representations of molecules from which complex properties, including molecular electronic structure, can be accurately modeled. We introduce continuously gated message passing (CGMP), a graph neural network (GNN) method for atomically decomposing molecular electronic structure sampled over conformational ensembles. CGMP integrates 3D-invariant GNNs and a novel gated message passing system to continuously reduce the atomic degrees of freedom accessible for electronic predictions, resulting in a one-shot importance ranking of atoms contributing to a target molecular property. Moreover, CGMP provides the first approach by which to quantify the degeneracy of “good” CG representations conditioned on specific prediction targets, facilitating the development of more transferable CG representations. We further show how CGMP can be used to highlight multiatom correlations, illuminating a path to developing CG electronic Hamiltonians in terms of interpretable collective variables for arbitrarily complex molecules.
AB - Bottom-up methods for coarse-grained (CG) molecular modeling are critically needed to establish rigorous links between atomistic reference data and reduced molecular representations. For a target molecule, the ideal reduced CG representation is a function of both the conformational ensemble of the system and the target physical observable(s) to be reproduced at the CG resolution. However, there is an absence of algorithms for selecting CG representations of molecules from which complex properties, including molecular electronic structure, can be accurately modeled. We introduce continuously gated message passing (CGMP), a graph neural network (GNN) method for atomically decomposing molecular electronic structure sampled over conformational ensembles. CGMP integrates 3D-invariant GNNs and a novel gated message passing system to continuously reduce the atomic degrees of freedom accessible for electronic predictions, resulting in a one-shot importance ranking of atoms contributing to a target molecular property. Moreover, CGMP provides the first approach by which to quantify the degeneracy of “good” CG representations conditioned on specific prediction targets, facilitating the development of more transferable CG representations. We further show how CGMP can be used to highlight multiatom correlations, illuminating a path to developing CG electronic Hamiltonians in terms of interpretable collective variables for arbitrarily complex molecules.
UR - http://www.scopus.com/inward/record.url?scp=85181996743&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85181996743&partnerID=8YFLogxK
U2 - 10.1063/5.0179253
DO - 10.1063/5.0179253
M3 - Article
C2 - 38193551
AN - SCOPUS:85181996743
SN - 0021-9606
VL - 160
JO - Journal of Chemical Physics
JF - Journal of Chemical Physics
IS - 2
M1 - 024109
ER -