Extraction of construction regulatory requirements from textual documents using natural language processing techniques

J. Zhang, Nora El-Gohary

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Automated regulatory compliance checking requires automated information extraction (IE) from regulatory textual documents (e.g. building codes). Automated IE is a challenging task that requires complex processing of text. Natural Language Processing (NLP) aims at enabling computers to process natural language text in a human-like manner using a variety of text processing techniques, such as phrase-structure parsing, dependency parsing, etc. This paper proposes a hybrid syntactic (syntax/grammar-related) and semantic (meaning/context-related) NLP approach for automated IE from construction regulatory documents, and explores the use of two techniques (phrase-structure grammar and dependency grammar) for extracting information from complex sentences. IE rules were developed based on Chapter 12 of the 2006 International Building Code; and the approach was tested on Chapter 12 of the 2009 International Fire Code. Initial experimental results are presented, empirically evaluated in terms of precision and recall, and discussed.

Original languageEnglish (US)
Title of host publicationComputing in Civil Engineering - Proceedings of the 2012 ASCE International Conference on Computing in Civil Engineering
Pages453-460
Number of pages8
DOIs
StatePublished - Dec 1 2012
Event2012 ASCE International Conference on Computing in Civil Engineering - Clearwater Beach, FL, United States
Duration: Jun 17 2012Jun 20 2012

Publication series

NameCongress on Computing in Civil Engineering, Proceedings

Other

Other2012 ASCE International Conference on Computing in Civil Engineering
Country/TerritoryUnited States
CityClearwater Beach, FL
Period6/17/126/20/12

ASJC Scopus subject areas

  • Civil and Structural Engineering
  • Computer Science Applications

Fingerprint

Dive into the research topics of 'Extraction of construction regulatory requirements from textual documents using natural language processing techniques'. Together they form a unique fingerprint.

Cite this