TY - JOUR
T1 - Identification of flow regimes in boiling flow with clustering algorithms
T2 - An interpretable machine-learning perspective
AU - Zhu, Longxiang
AU - Jhia Ooi, Zhiee
AU - Zhang, Taiyang
AU - Brooks, Caleb S.
AU - Pan, Liangming
N1 - This work is funded by the National Natural Science Foundation of China (grant no: 12205031) and the China Postdoctoral Science Foundation (grant no: 2022M720564).
PY - 2023/6/25
Y1 - 2023/6/25
N2 - The flow regime is the prerequisite to accurately modeling two-phase flow. Unsupervised machine learning techniques enable the identification of flow regimes objectively. Previous machine learning models are used as a “black box” tool without knowing the physical phenomena in the flow regime. Consequently, the cause of the identification error tends to be poorly understood and the model cannot be fundamentally improved. The paper develops an approach to better understand the identification result by creating a mapping relation between bubble distribution and the components in machine learning algorithms. The intrinsic interpretation generates the clustering principle to guide the feed-in feature extraction and clustering algorithm selection processes. Four features extracted from the bubble-size raw data recorded using conductivity probes are examined. Among them, the Cumulative Distribution Function of the chord length in seven dimensions is demonstrated to be the appropriate feed-in feature. Three major kinds of clustering algorithms are investigated, including partition-based, hierarchy-based, and model-based methods. After assigning physical meanings to the nodes in the algorithm and inspecting the clustering outcomes, the K-means, K-medoids, and Self-Organizing Maps are shown to succeed in the flow-regime identification problem. In addition, the local and the global flow regimes are generated by the well-designed machine learning model to assist the understanding of the boiling flow structure in a multi-dimensional way and in an area-averaged sense. The overall accuracy of the machine learning model for the three global flow regimes is 86%, which suggests the chosen algorithm with the selected feed-in feature is capable to capture the flow regime in the boiling flow. The flow regime map for the boiling dataset is compared with the existing flow regime criteria developed in the air–water flow, the result of which highlights the necessity of a new criterion to capture the transition from bubbly to slug for boiling flow. For the range of flow conditions considered in this work, the transition criterion between bubbly and slug flows is proposed to be 0.14 for upward boiling flow in an annular channel.
AB - The flow regime is the prerequisite to accurately modeling two-phase flow. Unsupervised machine learning techniques enable the identification of flow regimes objectively. Previous machine learning models are used as a “black box” tool without knowing the physical phenomena in the flow regime. Consequently, the cause of the identification error tends to be poorly understood and the model cannot be fundamentally improved. The paper develops an approach to better understand the identification result by creating a mapping relation between bubble distribution and the components in machine learning algorithms. The intrinsic interpretation generates the clustering principle to guide the feed-in feature extraction and clustering algorithm selection processes. Four features extracted from the bubble-size raw data recorded using conductivity probes are examined. Among them, the Cumulative Distribution Function of the chord length in seven dimensions is demonstrated to be the appropriate feed-in feature. Three major kinds of clustering algorithms are investigated, including partition-based, hierarchy-based, and model-based methods. After assigning physical meanings to the nodes in the algorithm and inspecting the clustering outcomes, the K-means, K-medoids, and Self-Organizing Maps are shown to succeed in the flow-regime identification problem. In addition, the local and the global flow regimes are generated by the well-designed machine learning model to assist the understanding of the boiling flow structure in a multi-dimensional way and in an area-averaged sense. The overall accuracy of the machine learning model for the three global flow regimes is 86%, which suggests the chosen algorithm with the selected feed-in feature is capable to capture the flow regime in the boiling flow. The flow regime map for the boiling dataset is compared with the existing flow regime criteria developed in the air–water flow, the result of which highlights the necessity of a new criterion to capture the transition from bubbly to slug for boiling flow. For the range of flow conditions considered in this work, the transition criterion between bubbly and slug flows is proposed to be 0.14 for upward boiling flow in an annular channel.
KW - Boiling flow
KW - Clustering algorithm
KW - Flow regime
KW - Interpretable machine learning
KW - Two-phase flow
UR - http://www.scopus.com/inward/record.url?scp=85151250958&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85151250958&partnerID=8YFLogxK
U2 - 10.1016/j.applthermaleng.2023.120493
DO - 10.1016/j.applthermaleng.2023.120493
M3 - Article
AN - SCOPUS:85151250958
SN - 1359-4311
VL - 228
JO - Applied Thermal Engineering
JF - Applied Thermal Engineering
M1 - 120493
ER -