TY - GEN
T1 - Extending building information models semi-automatically using semantic natural language processing techniques
AU - Zhang, Jiansong
AU - El-Gohary, Nora M.
N1 - Publisher Copyright:
© ASCE 2014.
PY - 2014
Y1 - 2014
N2 - Automated compliance checking (ACC) of building designs requires automated extraction of information from building information models (BIMs). However, current Industry Foundation Classes (IFC)-based BIM models provide limited support for ACC because they lack the necessary information that is needed to perform compliance checking (CC). In this paper, we are proposing a new approach for extending the IFC schema to incorporate CC-related information in a semi-automated and objective manner. Our method utilizes semantic natural language processing (NLP) techniques to extract concepts and relations from documents that are related to CC (e.g. building codes). We utilize pattern-matching-based rules for the extraction. We use three types of features in the matching patterns: part-of-speech tags, dependency relations, and term sequence numbers in a sentence. The selected concepts and relations are then automatically encoded into the EXPRESS-language-represented IFC schema. The automated encoding in EXPRESS is enabled using a set of mapping rules. To evaluate our proposed approach, we compared the concepts and relations that we automatically extracted from the International Building Code 2006 to extend the IFC schema with a manually-developed gold-standard, and evaluated the results in terms of precision and recall. We achieved higher than 90% precision and recall, which shows that our approach is promising.
AB - Automated compliance checking (ACC) of building designs requires automated extraction of information from building information models (BIMs). However, current Industry Foundation Classes (IFC)-based BIM models provide limited support for ACC because they lack the necessary information that is needed to perform compliance checking (CC). In this paper, we are proposing a new approach for extending the IFC schema to incorporate CC-related information in a semi-automated and objective manner. Our method utilizes semantic natural language processing (NLP) techniques to extract concepts and relations from documents that are related to CC (e.g. building codes). We utilize pattern-matching-based rules for the extraction. We use three types of features in the matching patterns: part-of-speech tags, dependency relations, and term sequence numbers in a sentence. The selected concepts and relations are then automatically encoded into the EXPRESS-language-represented IFC schema. The automated encoding in EXPRESS is enabled using a set of mapping rules. To evaluate our proposed approach, we compared the concepts and relations that we automatically extracted from the International Building Code 2006 to extend the IFC schema with a manually-developed gold-standard, and evaluated the results in terms of precision and recall. We achieved higher than 90% precision and recall, which shows that our approach is promising.
UR - http://www.scopus.com/inward/record.url?scp=84934279798&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84934279798&partnerID=8YFLogxK
U2 - 10.1061/9780784413616.279
DO - 10.1061/9780784413616.279
M3 - Conference contribution
AN - SCOPUS:84934279798
T3 - Computing in Civil and Building Engineering - Proceedings of the 2014 International Conference on Computing in Civil and Building Engineering
SP - 2246
EP - 2253
BT - Computing in Civil and Building Engineering - Proceedings of the 2014 International Conference on Computing in Civil and Building Engineering
A2 - Issa, R. Raymond
A2 - Flood, Ian
PB - American Society of Civil Engineers
T2 - 2014 International Conference on Computing in Civil and Building Engineering
Y2 - 23 June 2014 through 25 June 2014
ER -