Automated regulatory information extraction from building codes leveraging syntactic and semantic information

Jiansong Zhang, Nora EL-Gohary

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Manual regulatory compliance checking of construction projects is usually timeconsuming and error-prone. There have been efforts both in academia and industry to automate this process. However, none of them achieved full automation. Specifically, the extraction of rules from regulatory text (e.g. building code) and its representation in a computer-processable format is still conducted manually or semi-automatically. Natural language processing (NLP) aims at enabling computers to process natural language text in a human-like manner. It provides basic concepts and methods for text processing and analysis, such as part of speech (POS) tagging, tokenization, sentence splitting, named entity recognition, and semantic role labeling, etc. This paper is intended to explore the effectiveness of utilizing syntactic (i.e. grammatical) and semantic (i.e. meaning descriptive) features of the text (using NLP tools and techniques) to automatically extract regulatory information from building codes. An automated information extraction (IE) approach - involving the use of IE rules - is proposed. Chapter 12 of the 2006 International Building Code was used to develop the IE rules, while Chapter 12 of the 2009 International Fire Code was used to test the approach. An overall F-measure of 0.94 shows the potential of the proposed approach. Based on the experimental results and their analysis, we conclude the paper by pinpointing possible ways for improving the proposed approach.

Original languageEnglish (US)
Title of host publicationConstruction Research Congress 2012
Subtitle of host publicationConstruction Challenges in a Flat World, Proceedings of the 2012 Construction Research Congress
Pages622-632
Number of pages11
DOIs
StatePublished - 2012
EventConstruction Research Congress 2012: Construction Challenges in a Flat World - West Lafayette, IN, United States
Duration: May 21 2012May 23 2012

Publication series

NameConstruction Research Congress 2012: Construction Challenges in a Flat World, Proceedings of the 2012 Construction Research Congress

Other

OtherConstruction Research Congress 2012: Construction Challenges in a Flat World
CountryUnited States
CityWest Lafayette, IN
Period5/21/125/23/12

ASJC Scopus subject areas

  • Civil and Structural Engineering
  • Building and Construction

Fingerprint Dive into the research topics of 'Automated regulatory information extraction from building codes leveraging syntactic and semantic information'. Together they form a unique fingerprint.

Cite this