TY - JOUR
T1 - Weakly Supervised Segmentation of Buildings in Digital Elevation Models
AU - Soliman, Aiman
AU - Chen, Yifan
AU - Luo, Shirui
AU - Makharov, Rauf
AU - Kindratenko, Volodymyr
N1 - Publisher Copyright:
© 2004-2012 IEEE.
PY - 2022
Y1 - 2022
N2 - The lack of quality label data is considered one of the main bottlenecks for training machine and deep learning (DL) models. Weakly supervised learning using incomplete, coarse, or inaccurate data is an alternative strategy to overcome the scarcity of training data. We trained a U-Net model for segmenting buildings' footprints from a high-resolution digital elevation model (DEM), using the existing label data from the open-Access Microsoft building footprints (MS-BF) dataset. Comparison using an independent, manually labeled benchmark indicated the success of weak supervision learning as the quality of model prediction [intersection over union (IoU): 0.876] surpassed that of the original Microsoft data quality (IoU: 0.672) by approximately 20%. Moreover, adding extra channels such as elevation derivatives, slope, aspect, and profile curvatures did not enhance the weak learning process as the model learned directly from the original elevation data. Our results demonstrate the value of using existing data for training DL models even if they are noisy and incomplete.
AB - The lack of quality label data is considered one of the main bottlenecks for training machine and deep learning (DL) models. Weakly supervised learning using incomplete, coarse, or inaccurate data is an alternative strategy to overcome the scarcity of training data. We trained a U-Net model for segmenting buildings' footprints from a high-resolution digital elevation model (DEM), using the existing label data from the open-Access Microsoft building footprints (MS-BF) dataset. Comparison using an independent, manually labeled benchmark indicated the success of weak supervision learning as the quality of model prediction [intersection over union (IoU): 0.876] surpassed that of the original Microsoft data quality (IoU: 0.672) by approximately 20%. Moreover, adding extra channels such as elevation derivatives, slope, aspect, and profile curvatures did not enhance the weak learning process as the model learned directly from the original elevation data. Our results demonstrate the value of using existing data for training DL models even if they are noisy and incomplete.
KW - Building segmentation
KW - Deep learning (DL)
KW - Digital elevation models (DEMs)
KW - Noisy labels
KW - U-Net
KW - Weak supervision
UR - http://www.scopus.com/inward/record.url?scp=85130788885&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85130788885&partnerID=8YFLogxK
U2 - 10.1109/LGRS.2022.3177160
DO - 10.1109/LGRS.2022.3177160
M3 - Article
AN - SCOPUS:85130788885
SN - 1545-598X
VL - 19
JO - IEEE Geoscience and Remote Sensing Letters
JF - IEEE Geoscience and Remote Sensing Letters
ER -