TY - JOUR
T1 - Discovering melting temperature prediction models of inorganic solids by combining supervised and unsupervised learning
AU - Gharakhanyan, Vahe
AU - Wirth, Luke J.
AU - Garrido Torres, Jose A.
AU - Eisenberg, Ethan
AU - Wang, Ting
AU - Trinkle, Dallas R.
AU - Chatterjee, Snigdhansu
AU - Urban, Alexander
N1 - Publisher Copyright:
© 2024 Author(s).
PY - 2024/5/28
Y1 - 2024/5/28
N2 - The melting temperature is important for materials design because of its relationship with thermal stability, synthesis, and processing conditions. Current empirical and computational melting point estimation techniques are limited in scope, computational feasibility, or interpretability. We report the development of a machine learning methodology for predicting melting temperatures of binary ionic solid materials. We evaluated different machine-learning models trained on a dataset of the melting points of 476 non-metallic crystalline binary compounds using materials embeddings constructed from elemental properties and density-functional theory calculations as model inputs. A direct supervised-learning approach yields a mean absolute error of around 180 K but suffers from low interpretability. We find that the fidelity of predictions can further be improved by introducing an additional unsupervised-learning step that first classifies the materials before the melting-point regression. Not only does this two-step model exhibit improved accuracy, but the approach also provides a level of interpretability with insights into feature importance and different types of melting that depend on the specific atomic bonding inside a material. Motivated by this finding, we used a symbolic learning approach to find interpretable physical models for the melting temperature, which recovered the best-performing features from both prior models and provided additional interpretability.
AB - The melting temperature is important for materials design because of its relationship with thermal stability, synthesis, and processing conditions. Current empirical and computational melting point estimation techniques are limited in scope, computational feasibility, or interpretability. We report the development of a machine learning methodology for predicting melting temperatures of binary ionic solid materials. We evaluated different machine-learning models trained on a dataset of the melting points of 476 non-metallic crystalline binary compounds using materials embeddings constructed from elemental properties and density-functional theory calculations as model inputs. A direct supervised-learning approach yields a mean absolute error of around 180 K but suffers from low interpretability. We find that the fidelity of predictions can further be improved by introducing an additional unsupervised-learning step that first classifies the materials before the melting-point regression. Not only does this two-step model exhibit improved accuracy, but the approach also provides a level of interpretability with insights into feature importance and different types of melting that depend on the specific atomic bonding inside a material. Motivated by this finding, we used a symbolic learning approach to find interpretable physical models for the melting temperature, which recovered the best-performing features from both prior models and provided additional interpretability.
UR - http://www.scopus.com/inward/record.url?scp=85194936134&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85194936134&partnerID=8YFLogxK
U2 - 10.1063/5.0207033
DO - 10.1063/5.0207033
M3 - Article
C2 - 38804486
AN - SCOPUS:85194936134
SN - 0021-9606
VL - 160
JO - Journal of Chemical Physics
JF - Journal of Chemical Physics
IS - 20
M1 - 204112
ER -