TY - JOUR
T1 - Discovering melting temperature prediction models of inorganic solids by combining supervised and unsupervised learning
AU - Gharakhanyan, Vahe
AU - Wirth, Luke J.
AU - Garrido Torres, Jose A.
AU - Eisenberg, Ethan
AU - Wang, Ting
AU - Trinkle, Dallas R.
AU - Chatterjee, Snigdhansu
AU - Urban, Alexander
N1 - This work was supported by the National Science Foundation (NSF) under Grant No. DMR1940290 (Harnessing the Data Revolution, HDR). We acknowledge computing resources from Columbia University\u2019s Shared Research Computing Facility project, which is supported by NIH Research Facility Improvement under Grant No. 1G20RR030893-01, and associated funds from the New York State Empire State Development, Division of Science Technology and Innovation (NYSTAR), under Contract No. C090171, both awarded April 15, 2010. This work also made use of the Illinois Campus Cluster, a computing resource that is operated by the Illinois Campus Cluster Program (ICCP) in conjunction with the National Center for Supercomputing Applications (NCSA) and which is supported by funds from the University of Illinois Urbana-Champaign. The research of SC was partially supported by the National Science Foundation (NSF) under Grant Nos. 1939916 and 1939956.
PY - 2024/5/28
Y1 - 2024/5/28
N2 - The melting temperature is important for materials design because of its relationship with thermal stability, synthesis, and processing conditions. Current empirical and computational melting point estimation techniques are limited in scope, computational feasibility, or interpretability. We report the development of a machine learning methodology for predicting melting temperatures of binary ionic solid materials. We evaluated different machine-learning models trained on a dataset of the melting points of 476 non-metallic crystalline binary compounds using materials embeddings constructed from elemental properties and density-functional theory calculations as model inputs. A direct supervised-learning approach yields a mean absolute error of around 180 K but suffers from low interpretability. We find that the fidelity of predictions can further be improved by introducing an additional unsupervised-learning step that first classifies the materials before the melting-point regression. Not only does this two-step model exhibit improved accuracy, but the approach also provides a level of interpretability with insights into feature importance and different types of melting that depend on the specific atomic bonding inside a material. Motivated by this finding, we used a symbolic learning approach to find interpretable physical models for the melting temperature, which recovered the best-performing features from both prior models and provided additional interpretability.
AB - The melting temperature is important for materials design because of its relationship with thermal stability, synthesis, and processing conditions. Current empirical and computational melting point estimation techniques are limited in scope, computational feasibility, or interpretability. We report the development of a machine learning methodology for predicting melting temperatures of binary ionic solid materials. We evaluated different machine-learning models trained on a dataset of the melting points of 476 non-metallic crystalline binary compounds using materials embeddings constructed from elemental properties and density-functional theory calculations as model inputs. A direct supervised-learning approach yields a mean absolute error of around 180 K but suffers from low interpretability. We find that the fidelity of predictions can further be improved by introducing an additional unsupervised-learning step that first classifies the materials before the melting-point regression. Not only does this two-step model exhibit improved accuracy, but the approach also provides a level of interpretability with insights into feature importance and different types of melting that depend on the specific atomic bonding inside a material. Motivated by this finding, we used a symbolic learning approach to find interpretable physical models for the melting temperature, which recovered the best-performing features from both prior models and provided additional interpretability.
UR - http://www.scopus.com/inward/record.url?scp=85194936134&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85194936134&partnerID=8YFLogxK
U2 - 10.1063/5.0207033
DO - 10.1063/5.0207033
M3 - Article
C2 - 38804486
AN - SCOPUS:85194936134
SN - 0021-9606
VL - 160
JO - Journal of Chemical Physics
JF - Journal of Chemical Physics
IS - 20
M1 - 204112
ER -