TY - JOUR
T1 - Sequential Deep Operator Networks (S-DeepONet) for predicting full-field solutions under time-dependent loads
AU - He, Junyan
AU - Kushwaha, Shashank
AU - Park, Jaewan
AU - Koric, Seid
AU - Abueidda, Diab
AU - Jasiuk, Iwona
N1 - The authors would like to thank the National Center for Supercomputing Applications (NCSA) at the University of Illinois, and particularly its Research Consulting Directorate, the Industry Program, and the Center for Artificial Intelligence Innovation (CAII) for their support and hardware resources. This research is a part of the Delta research computing project, USA , which is supported by the National Science Foundation (award OCI 2005572 ) and the State of Illinois, USA , as well as the Illinois Computes program, USA supported by the University of Illinois Urbana-Champaign and the University of Illinois System. Finally, the authors would like to thank Professors George Karniadakis, Lu Lu, and the Crunch team at Brown, whose original work with DeepONets inspired this research.
PY - 2024/1
Y1 - 2024/1
N2 - Deep Operator Network (DeepONet), a recently introduced deep learning operator network, approximates linear and nonlinear solution operators by taking parametric functions (infinite-dimensional objects) as inputs and mapping them to solution functions in contrast to classical neural networks that need re-training for every new set of parametric inputs. In this work, we have extended the classical formulation of DeepONets by introducing sequential learning models like the gated recurrent unit (GRU) and long short-term memory (LSTM) in the branch network to allow for accurate predictions of the solution contour plots under parametric and time-dependent loading histories, known as Sequential DeepONets (S-DeepONets). Two example problems, one on transient heat transfer and the other on path-dependent plastic loading, were shown to demonstrate the capabilities of the new architectures compared to the benchmark DeepONet model with a feed-forward neural network (FNN) in the branch. Despite being more computationally expensive, the GRU- and LSTM-DeepONets lowered the prediction error by half (0.06% vs. 0.12%) compared to FNN-DeepONet in the heat transfer problem, and by 2.5 times (0.85% vs. 3%) in the plasticity problem. In all cases, the proposed DeepONets achieved a prediction R2 value of above 0.995, indicating superior accuracy. Results show that once trained, the proposed S-DeepONets can accurately predict the final full-field solution over the entire domain and are at least two orders of magnitude faster than direct finite element simulations, rendering it an accurate and robust surrogate model for rapid preliminary evaluations.
AB - Deep Operator Network (DeepONet), a recently introduced deep learning operator network, approximates linear and nonlinear solution operators by taking parametric functions (infinite-dimensional objects) as inputs and mapping them to solution functions in contrast to classical neural networks that need re-training for every new set of parametric inputs. In this work, we have extended the classical formulation of DeepONets by introducing sequential learning models like the gated recurrent unit (GRU) and long short-term memory (LSTM) in the branch network to allow for accurate predictions of the solution contour plots under parametric and time-dependent loading histories, known as Sequential DeepONets (S-DeepONets). Two example problems, one on transient heat transfer and the other on path-dependent plastic loading, were shown to demonstrate the capabilities of the new architectures compared to the benchmark DeepONet model with a feed-forward neural network (FNN) in the branch. Despite being more computationally expensive, the GRU- and LSTM-DeepONets lowered the prediction error by half (0.06% vs. 0.12%) compared to FNN-DeepONet in the heat transfer problem, and by 2.5 times (0.85% vs. 3%) in the plasticity problem. In all cases, the proposed DeepONets achieved a prediction R2 value of above 0.995, indicating superior accuracy. Results show that once trained, the proposed S-DeepONets can accurately predict the final full-field solution over the entire domain and are at least two orders of magnitude faster than direct finite element simulations, rendering it an accurate and robust surrogate model for rapid preliminary evaluations.
KW - Deep Operator Network (DeepONet)
KW - Gated recurrent unit (GRU)
KW - Long short-term memory (LSTM)
KW - Machine/deep learning
KW - Plastic deformation
KW - Transient heat transfer
UR - http://www.scopus.com/inward/record.url?scp=85173629609&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85173629609&partnerID=8YFLogxK
U2 - 10.1016/j.engappai.2023.107258
DO - 10.1016/j.engappai.2023.107258
M3 - Article
AN - SCOPUS:85173629609
SN - 0952-1976
VL - 127
JO - Engineering Applications of Artificial Intelligence
JF - Engineering Applications of Artificial Intelligence
M1 - 107258
ER -