TY - GEN
T1 - Semantic image retrieval and clustering for supporting domain-specific bridge component and defect classification
AU - Liu, Peter Cheng Yang
AU - El-Gohary, Nora
N1 - Funding Information:
The authors would like to thank the National Science Foundation (NSF). This material is based on work supported by the NSF under Grant No. 1937115.
Publisher Copyright:
© 2020 American Society of Civil Engineers.
PY - 2020
Y1 - 2020
N2 - Automatic defect detection and classification from images is becoming increasingly important for bridge deterioration prediction and maintenance decision making. The majority of existing defect detection efforts have developed their datasets for training a machine-learning algorithm for detection/classification. However, the majority of these datasets suffer from two main limitations. First, most of the datasets are relatively small in size, which is not sufficient to build a well-trained, accurate image classifier. Second, most of the datasets lack the needed variety in scenes, angles, and backgrounds, which is not adaptable to different application contexts and environments. To address these limitations, this paper proposes a semantic image retrieval and clustering method to collect a large size of relevant images with various scenes, angles, and backgrounds from the Web and cluster these images for supporting domain-specific bridge component and defect detection. The proposed method includes three primary steps: query formation and image search and retrieval, image representation, and image clustering. First, a set of domain-specific words were extracted from bridge inspection documents and used as queries for retrieving a large number of images from the Web. Second, a transfer learning technique was used to transfer knowledge in a pre-trained model for general image classification to the bridge component and defect-related image clustering task. A deep convolutional neural network (CNN) with pre-trained weights was used to extract the visual features of the images for image representation. Third, a clustering technique was used to cluster the images based on the extracted features. The performance of the proposed method was evaluated using the silhouette coefficient. The evaluation results show that the proposed method is promising.
AB - Automatic defect detection and classification from images is becoming increasingly important for bridge deterioration prediction and maintenance decision making. The majority of existing defect detection efforts have developed their datasets for training a machine-learning algorithm for detection/classification. However, the majority of these datasets suffer from two main limitations. First, most of the datasets are relatively small in size, which is not sufficient to build a well-trained, accurate image classifier. Second, most of the datasets lack the needed variety in scenes, angles, and backgrounds, which is not adaptable to different application contexts and environments. To address these limitations, this paper proposes a semantic image retrieval and clustering method to collect a large size of relevant images with various scenes, angles, and backgrounds from the Web and cluster these images for supporting domain-specific bridge component and defect detection. The proposed method includes three primary steps: query formation and image search and retrieval, image representation, and image clustering. First, a set of domain-specific words were extracted from bridge inspection documents and used as queries for retrieving a large number of images from the Web. Second, a transfer learning technique was used to transfer knowledge in a pre-trained model for general image classification to the bridge component and defect-related image clustering task. A deep convolutional neural network (CNN) with pre-trained weights was used to extract the visual features of the images for image representation. Third, a clustering technique was used to cluster the images based on the extracted features. The performance of the proposed method was evaluated using the silhouette coefficient. The evaluation results show that the proposed method is promising.
UR - http://www.scopus.com/inward/record.url?scp=85096919805&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85096919805&partnerID=8YFLogxK
U2 - 10.1061/9780784482858.087
DO - 10.1061/9780784482858.087
M3 - Conference contribution
AN - SCOPUS:85096919805
T3 - Construction Research Congress 2020: Infrastructure Systems and Sustainability - Selected Papers from the Construction Research Congress 2020
SP - 809
EP - 818
BT - Construction Research Congress 2020
A2 - El Asmar, Mounir
A2 - Tang, Pingbo
A2 - Grau, David
PB - American Society of Civil Engineers
T2 - Construction Research Congress 2020: Infrastructure Systems and Sustainability
Y2 - 8 March 2020 through 10 March 2020
ER -