TY - JOUR
T1 - Learning to predict sustainable aviation fuel properties
T2 - A deep uncertainty quantification viewpoint
AU - Oh, Ji Hun
AU - Oldani, Anna
AU - Solecki, Alex
AU - Lee, Tonghun
N1 - This work was funded by the US Federal Aviation Administration (FAA) Office of Environment and Energy as a part of ASCENT Project 33 under FAA Award Number: 13-C-AJFE-UI Amendment 33. Any opinions, findings, and conclusions or recommendations expressed in this material are those of the authors and do not necessarily reflect the views of the FAA or other ASCENT sponsors.
PY - 2024/1/15
Y1 - 2024/1/15
N2 - Machine/deep learning (DL) predictions of sustainable aviation fuel's (SAF) physiochemical properties from chemical data offers a rapid way to prescreen the potential viability of new SAF candidates but is limited by uncertainties. In this article, the uncertainties arising from insufficient training data (epistemic) and finite-resolution chemical features (heteroscedastic) are addressed by conducting a deep uncertainty quantification (UQ) study using a Bayesian neural network ensemble (BNNE) to model and analyze such uncertainties. In particular, flash point is predicted from two-dimensional gas chromatography (GC×GC) features in various scenarios where differences in epistemicity and heteroscedasticity exist. Several insights are obtained: (1) Overparameterization of the network provides buffer against epistemicity and should be advocated in the absence of sufficient data. (2) Reducing the epistemic uncertainty via GC×GC localization does not always improve accuracy, highlighting the necessity of a probabilistic formulation to prevent overconfident but erroneous predictions. (3) Heteroscedastic uncertainty is larger and irreducible for lower resolution features, e.g., GC separated by chemical family but not molecular formulae. These findings aim not only to facilitate trustworthy DL practices in SAF modeling but also to emphasize the importance of establishing a big data pipeline and the design of finer features (e.g., isomer differentiation via vacuum ultraviolet spectroscopy) to mitigate these uncertainties.
AB - Machine/deep learning (DL) predictions of sustainable aviation fuel's (SAF) physiochemical properties from chemical data offers a rapid way to prescreen the potential viability of new SAF candidates but is limited by uncertainties. In this article, the uncertainties arising from insufficient training data (epistemic) and finite-resolution chemical features (heteroscedastic) are addressed by conducting a deep uncertainty quantification (UQ) study using a Bayesian neural network ensemble (BNNE) to model and analyze such uncertainties. In particular, flash point is predicted from two-dimensional gas chromatography (GC×GC) features in various scenarios where differences in epistemicity and heteroscedasticity exist. Several insights are obtained: (1) Overparameterization of the network provides buffer against epistemicity and should be advocated in the absence of sufficient data. (2) Reducing the epistemic uncertainty via GC×GC localization does not always improve accuracy, highlighting the necessity of a probabilistic formulation to prevent overconfident but erroneous predictions. (3) Heteroscedastic uncertainty is larger and irreducible for lower resolution features, e.g., GC separated by chemical family but not molecular formulae. These findings aim not only to facilitate trustworthy DL practices in SAF modeling but also to emphasize the importance of establishing a big data pipeline and the design of finer features (e.g., isomer differentiation via vacuum ultraviolet spectroscopy) to mitigate these uncertainties.
KW - Bayesian neural networks
KW - Composition-property relationships
KW - Deep learning
KW - Sustainable aviation fuels
KW - Uncertainty quantification
UR - http://www.scopus.com/inward/record.url?scp=85169807715&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85169807715&partnerID=8YFLogxK
U2 - 10.1016/j.fuel.2023.129508
DO - 10.1016/j.fuel.2023.129508
M3 - Article
AN - SCOPUS:85169807715
SN - 0016-2361
VL - 356
JO - Fuel
JF - Fuel
M1 - 129508
ER -