TY - JOUR
T1 - A deep neural network-based method for deep information extraction using transfer learning strategies to support automated compliance checking
AU - Zhang, Ruichuan
AU - El-Gohary, Nora
N1 - Funding Information:
The authors would like to thank the National Science Foundation (NSF) . This material is based on work supported by the NSF under Grant No. 1827733 . Any opinions, findings, and conclusions or recommendations expressed in this material are those of the authors and do not necessarily reflect the views of the NSF.
Publisher Copyright:
© 2021
PY - 2021/12
Y1 - 2021/12
N2 - Existing automated compliance checking (ACC) systems require the extraction of requirements from regulatory documents into computer-processable representations. These information extraction (IE) processes are either fully manual, semi-automated, or automated. Semi-automated and manual approaches typically use manual annotations or predefined IE rules, which lack sufficient flexibility and scalability; the annotations and rules typically need adaptation if the characteristics of the regulatory document change. There is, thus, a need for a fully automated IE approach that can achieve high and consistent performance across different types of regulatory documents for supporting ACC. To address this need, this paper proposes a deep neural network-based method for deep IE – extracting semantic and syntactic information elements – from regulatory documents in the architectural, engineering, and construction (AEC) domain. The proposed method was evaluated in extracting information from multiple regulatory documents in the AEC domain. It achieved average precision and recall of 93.1% and 92.9%, respectively.
AB - Existing automated compliance checking (ACC) systems require the extraction of requirements from regulatory documents into computer-processable representations. These information extraction (IE) processes are either fully manual, semi-automated, or automated. Semi-automated and manual approaches typically use manual annotations or predefined IE rules, which lack sufficient flexibility and scalability; the annotations and rules typically need adaptation if the characteristics of the regulatory document change. There is, thus, a need for a fully automated IE approach that can achieve high and consistent performance across different types of regulatory documents for supporting ACC. To address this need, this paper proposes a deep neural network-based method for deep IE – extracting semantic and syntactic information elements – from regulatory documents in the architectural, engineering, and construction (AEC) domain. The proposed method was evaluated in extracting information from multiple regulatory documents in the AEC domain. It achieved average precision and recall of 93.1% and 92.9%, respectively.
KW - Code checking
KW - Deep learning
KW - Information extraction
KW - Transfer learning
UR - http://www.scopus.com/inward/record.url?scp=85115055339&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85115055339&partnerID=8YFLogxK
U2 - 10.1016/j.autcon.2021.103834
DO - 10.1016/j.autcon.2021.103834
M3 - Article
AN - SCOPUS:85115055339
SN - 0926-5805
VL - 132
JO - Automation in Construction
JF - Automation in Construction
M1 - 103834
ER -