TY - JOUR
T1 - Transformer language model for mapping construction schedule activities to uniformat categories
AU - Jung, Yoonhwa
AU - Hockenmaier, Julia
AU - Golparvar-Fard, Mani
N1 - This material is in part based upon works supported by the National Science Foundation, United States [ 1446765 , 2020227 ]. The support and help of Juan D. Núñez-Morales, Reconstruct Inc, and construction companies in collecting schedule data and validating the system prototype are greatly appreciated. The opinions, findings, and conclusions or recommendations expressed are those of the authors and do not reflect the views of the NSF, or the companies mentioned above.
PY - 2024/1
Y1 - 2024/1
N2 - Creating consistency among project schedule data, BIM, and payment applications requires activities in a construction schedule to be mapped with the most relevant ASTM Uniformat classifications. To do so, we introduce UNIFORMATBRIDGE, a new transformer-based natural language processing model, that automatically labels activities in a project schedule with Uniformat classification. Our model introduces construction sequencing tokens that capture logistically-constrained predecessor and successor activities into BERT architecture. We also introduce a dataset of real-world construction project schedules with their ground-truth Uniformat classifications for validation. Experimental results using this dataset achieve F1-scores of 0.93 and 0.87 when matching unstructured schedule data to Uniformat Level 2 and 3 classifications, respectively. We share how our method unlocks development of new techniques to (1) automatically create 4D BIM, and (2) computer-vision progress monitoring to tie semantic segmentation of reality capture data based on Uniformat classes against schedule or payment application data structures, with/without BIM.
AB - Creating consistency among project schedule data, BIM, and payment applications requires activities in a construction schedule to be mapped with the most relevant ASTM Uniformat classifications. To do so, we introduce UNIFORMATBRIDGE, a new transformer-based natural language processing model, that automatically labels activities in a project schedule with Uniformat classification. Our model introduces construction sequencing tokens that capture logistically-constrained predecessor and successor activities into BERT architecture. We also introduce a dataset of real-world construction project schedules with their ground-truth Uniformat classifications for validation. Experimental results using this dataset achieve F1-scores of 0.93 and 0.87 when matching unstructured schedule data to Uniformat Level 2 and 3 classifications, respectively. We share how our method unlocks development of new techniques to (1) automatically create 4D BIM, and (2) computer-vision progress monitoring to tie semantic segmentation of reality capture data based on Uniformat classes against schedule or payment application data structures, with/without BIM.
KW - Artificial intelligence
KW - Building Information Modeling
KW - Construction planning
KW - Machine learning
KW - Natural language processing
KW - Project controls
UR - http://www.scopus.com/inward/record.url?scp=85176110020&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85176110020&partnerID=8YFLogxK
U2 - 10.1016/j.autcon.2023.105183
DO - 10.1016/j.autcon.2023.105183
M3 - Article
AN - SCOPUS:85176110020
SN - 0926-5805
VL - 157
JO - Automation in Construction
JF - Automation in Construction
M1 - 105183
ER -