TY - JOUR
T1 - Text Analytics for Supporting Stakeholder Opinion Mining for Large-scale Highway Projects
AU - Lv, Xuan
AU - El-Gohary, Nora
N1 - Funding Information:
This paper is based upon work supported by the Strategic Research Initiatives (SRI) Program by the College of Engineering at the University of Illinois at Urbana-Champaign.
Publisher Copyright:
© 2016 The Authors.
PY - 2016
Y1 - 2016
N2 - For large-scale highway projects, late identification of stakeholder concerns often leads to design changes and duplication of effort, which may cause major project delays. This paper proposes a stakeholder opinion mining approach for helping transportation practitioners better identify the types of concerns in the early project stage. The proposed approach includes two major components: (1) stakeholder concern extraction, and (2) stakeholder concern classification. This paper focuses on presenting the proposed methodology and experimental results for stakeholder concern extraction, which extracts the words and phrases that describe stakeholder concerns from stakeholder comments on large-scale highway projects. In developing the proposed stakeholder concern extraction methodology, several supervised machine learning (ML) algorithms were tested and evaluated, and the effect of using a predefined name list as feature was also investigated. All the algorithms were tested on a testing data set of 200 comment sentences, which were selected from a comment collection including 1,849 stakeholder comments on five large-scale highway projects.
AB - For large-scale highway projects, late identification of stakeholder concerns often leads to design changes and duplication of effort, which may cause major project delays. This paper proposes a stakeholder opinion mining approach for helping transportation practitioners better identify the types of concerns in the early project stage. The proposed approach includes two major components: (1) stakeholder concern extraction, and (2) stakeholder concern classification. This paper focuses on presenting the proposed methodology and experimental results for stakeholder concern extraction, which extracts the words and phrases that describe stakeholder concerns from stakeholder comments on large-scale highway projects. In developing the proposed stakeholder concern extraction methodology, several supervised machine learning (ML) algorithms were tested and evaluated, and the effect of using a predefined name list as feature was also investigated. All the algorithms were tested on a testing data set of 200 comment sentences, which were selected from a comment collection including 1,849 stakeholder comments on five large-scale highway projects.
KW - Data analytics
KW - Information extraction
KW - Machine learning
KW - Natural language processing
KW - Opinion mining
UR - http://www.scopus.com/inward/record.url?scp=84999836129&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84999836129&partnerID=8YFLogxK
U2 - 10.1016/j.proeng.2016.04.039
DO - 10.1016/j.proeng.2016.04.039
M3 - Conference article
AN - SCOPUS:84999836129
SN - 1877-7058
VL - 145
SP - 518
EP - 524
JO - Procedia Engineering
JF - Procedia Engineering
T2 - International Conference on Sustainable Design, Engineering and Construction, ICSDEC 2016
Y2 - 18 May 2016 through 20 May 2016
ER -