TY - JOUR
T1 - Machine learning and data science in soft materials engineering
AU - Ferguson, Andrew L.
N1 - Publisher Copyright:
© 2017 IOP Publishing Ltd.
PY - 2018/1/31
Y1 - 2018/1/31
N2 - In many branches of materials science it is now routine to generate data sets of such large size and dimensionality that conventional methods of analysis fail. Paradigms and tools from data science and machine learning can provide scalable approaches to identify and extract trends and patterns within voluminous data sets, perform guided traversals of high-dimensional phase spaces, and furnish data-driven strategies for inverse materials design. This topical review provides an accessible introduction to machine learning tools in the context of soft and biological materials by 'de-jargonizing' data science terminology, presenting a taxonomy of machine learning techniques, and surveying the mathematical underpinnings and software implementations of popular tools, including principal component analysis, independent component analysis, diffusion maps, support vector machines, and relative entropy. We present illustrative examples of machine learning applications in soft matter, including inverse design of self-assembling materials, nonlinear learning of protein folding landscapes, high-throughput antimicrobial peptide design, and data-driven materials design engines. We close with an outlook on the challenges and opportunities for the field.
AB - In many branches of materials science it is now routine to generate data sets of such large size and dimensionality that conventional methods of analysis fail. Paradigms and tools from data science and machine learning can provide scalable approaches to identify and extract trends and patterns within voluminous data sets, perform guided traversals of high-dimensional phase spaces, and furnish data-driven strategies for inverse materials design. This topical review provides an accessible introduction to machine learning tools in the context of soft and biological materials by 'de-jargonizing' data science terminology, presenting a taxonomy of machine learning techniques, and surveying the mathematical underpinnings and software implementations of popular tools, including principal component analysis, independent component analysis, diffusion maps, support vector machines, and relative entropy. We present illustrative examples of machine learning applications in soft matter, including inverse design of self-assembling materials, nonlinear learning of protein folding landscapes, high-throughput antimicrobial peptide design, and data-driven materials design engines. We close with an outlook on the challenges and opportunities for the field.
KW - biological materials
KW - data science
KW - data-driven design
KW - inverse design
KW - machine learning
KW - soft materials
UR - http://www.scopus.com/inward/record.url?scp=85040866057&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85040866057&partnerID=8YFLogxK
U2 - 10.1088/1361-648X/aa98bd
DO - 10.1088/1361-648X/aa98bd
M3 - Review article
C2 - 29111979
AN - SCOPUS:85040866057
SN - 0953-8984
VL - 30
JO - Journal of Physics Condensed Matter
JF - Journal of Physics Condensed Matter
IS - 4
M1 - 043002
ER -