TY - JOUR
T1 - Unsupervised Learning of Sequence-specific Aggregation Behavior for a Model Copolymer
AU - Statt, Antonia
AU - Kleeblatt, Devon C.
AU - Reinhart, Wesley F.
N1 - Funding Information:
This work was supported in part by the Institute for Computational and Data Sciences and the Materials Research Institute at the Pennsylvania State University.
Publisher Copyright:
© The Royal Society of Chemistry.
PY - 2021/9/7
Y1 - 2021/9/7
N2 - We apply a recently developed unsupervised machine learning scheme for local environments [Reinhart, Comput. Mater. Sci., 2021, 196, 110511] to characterize large-scale, disordered aggregates formed by sequence-defined macromolecules. This method provides new insight into the structure of these disordered, dilute aggregates, which has proven difficult to understand using collective variables manually derived from expert knowledge [Statt et al., J. Chem. Phys., 2020, 152, 075101]. In contrast to such conventional order parameters, we are able to classify the global aggregate structure directly using descriptions of the local environments. The resulting characterization provides a deeper understanding of the range of possible self-assembled structures and their relationships to each other. We also provide detailed analysis of the effects of finite system size, stochasticity, and kinetics of these aggregates based on the learned collective variables. Interestingly, we find that the spatiotemporal evolution of systems in the learned latent space is smooth and continuous, despite being derived from only a single snapshot from each of about 1000 monomer sequences. These results demonstrate the insight which can be gained by applying unsupervised machine learning to soft matter systems, especially when suitable order parameters are not known.
AB - We apply a recently developed unsupervised machine learning scheme for local environments [Reinhart, Comput. Mater. Sci., 2021, 196, 110511] to characterize large-scale, disordered aggregates formed by sequence-defined macromolecules. This method provides new insight into the structure of these disordered, dilute aggregates, which has proven difficult to understand using collective variables manually derived from expert knowledge [Statt et al., J. Chem. Phys., 2020, 152, 075101]. In contrast to such conventional order parameters, we are able to classify the global aggregate structure directly using descriptions of the local environments. The resulting characterization provides a deeper understanding of the range of possible self-assembled structures and their relationships to each other. We also provide detailed analysis of the effects of finite system size, stochasticity, and kinetics of these aggregates based on the learned collective variables. Interestingly, we find that the spatiotemporal evolution of systems in the learned latent space is smooth and continuous, despite being derived from only a single snapshot from each of about 1000 monomer sequences. These results demonstrate the insight which can be gained by applying unsupervised machine learning to soft matter systems, especially when suitable order parameters are not known.
UR - http://www.scopus.com/inward/record.url?scp=85113735732&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85113735732&partnerID=8YFLogxK
U2 - 10.1039/d1sm01012c
DO - 10.1039/d1sm01012c
M3 - Article
C2 - 34350929
AN - SCOPUS:85113735732
SN - 1744-683X
VL - 17
SP - 7697
EP - 7707
JO - Soft Matter
JF - Soft Matter
IS - 33
ER -