TY - JOUR
T1 - Weakly Supervised Spatial Deep Learning for Earth Image Segmentation Based on Imperfect Polyline Labels
AU - Jiang, Zhe
AU - He, Wenchong
AU - Kirby, Marcus Stephen
AU - Sainju, Arpan Man
AU - Wang, Shaowen
AU - Stanislawski, Lawrence V.
AU - Shavers, Ethan J.
AU - Usery, E. Lynn
N1 - Funding Information:
This material is based upon work supported by the National Science Foundation (NSF) under Grant No. IIS-1850546, IIS-2008973, CNS-1951974, OAC-2152085, and the National Oceanic and Atmospheric Administration (NOAA), Microsoft AI for Earth Grant and the Extreme Science and Engineering Discovery Environment (XSEDE). Authors’ addresses: Z. Jiang (corresponding author) and W. He, Department of Computer & Information Science & Engineering, The University of Florida, P.O. Box 116120, Gainesville, FL, 32611; emails: {zhe.jiang, whe2}@ufl.edu; M. S. Kirby, Department of Computer Science, The University of Alabama, Box 870290, Tuscaloosa, AL 35487; A. M. Sainju, Department of Computer Science, Middle Tennessee State University, PO Box 48, Murfreesboro, TN 37132; email: asainju@mtsu.edu; S. Wang, Department of Geography and Geographic Information Science, The University of Illinois at Urbana-Champaign, 1301 W. Green St., Urbana, IL 61801; email: shaowen@illinois.edu; L. V. Stanislawski, E. J. Shavers, and E. L. Usery, U.S. Geological Survey, Center of Excellence for Geospatial Information Science, 1400 Independence, Rolla, MO 65401; emails: {lstan, eshavers, usery}@usgs.gov. ACM acknowledges that this contribution was authored or co-authored by an employee, contractor, or affiliate of the United States government. As such, the United States government retains a nonexclusive, royalty-free right to publish or reproduce this article, or to allow others to do so, for government purposes only. © 2022 Association for Computing Machinery. 2157-6904/2022/01-ART25 $15.00 https://doi.org/10.1145/3480970
Publisher Copyright:
© 2022 Association for Computing Machinery.
PY - 2022/4
Y1 - 2022/4
N2 - In recent years, deep learning has achieved tremendous success in image segmentation for computer vision applications. The performance of these models heavily relies on the availability of large-scale high-quality training labels (e.g., PASCAL VOC 2012). Unfortunately, such large-scale high-quality training data are often unavailable in many real-world spatial or spatiotemporal problems in earth science and remote sensing (e.g., mapping the nationwide river streams for water resource management). Although extensive efforts have been made to reduce the reliance on labeled data (e.g., semi-supervised or unsupervised learning, few-shot learning), the complex nature of geographic data such as spatial heterogeneity still requires sufficient training labels when transferring a pre-trained model from one region to another. On the other hand, it is often much easier to collect lower-quality training labels with imperfect alignment with earth imagery pixels (e.g., through interpreting coarse imagery by non-expert volunteers). However, directly training a deep neural network on imperfect labels with geometric annotation errors could significantly impact model performance. Existing research that overcomes imperfect training labels either focuses on errors in label class semantics or characterizes label location errors at the pixel level. These methods do not fully incorporate the geometric properties of label location errors in the vector representation. To fill the gap, this article proposes a weakly supervised learning framework to simultaneously update deep learning model parameters and infer hidden true vector label locations. Specifically, we model label location errors in the vector representation to partially reserve geometric properties (e.g., spatial contiguity within line segments). Evaluations on real-world datasets in the National Hydrography Dataset (NHD) refinement application illustrate that the proposed framework outperforms baseline methods in classification accuracy.
AB - In recent years, deep learning has achieved tremendous success in image segmentation for computer vision applications. The performance of these models heavily relies on the availability of large-scale high-quality training labels (e.g., PASCAL VOC 2012). Unfortunately, such large-scale high-quality training data are often unavailable in many real-world spatial or spatiotemporal problems in earth science and remote sensing (e.g., mapping the nationwide river streams for water resource management). Although extensive efforts have been made to reduce the reliance on labeled data (e.g., semi-supervised or unsupervised learning, few-shot learning), the complex nature of geographic data such as spatial heterogeneity still requires sufficient training labels when transferring a pre-trained model from one region to another. On the other hand, it is often much easier to collect lower-quality training labels with imperfect alignment with earth imagery pixels (e.g., through interpreting coarse imagery by non-expert volunteers). However, directly training a deep neural network on imperfect labels with geometric annotation errors could significantly impact model performance. Existing research that overcomes imperfect training labels either focuses on errors in label class semantics or characterizes label location errors at the pixel level. These methods do not fully incorporate the geometric properties of label location errors in the vector representation. To fill the gap, this article proposes a weakly supervised learning framework to simultaneously update deep learning model parameters and infer hidden true vector label locations. Specifically, we model label location errors in the vector representation to partially reserve geometric properties (e.g., spatial contiguity within line segments). Evaluations on real-world datasets in the National Hydrography Dataset (NHD) refinement application illustrate that the proposed framework outperforms baseline methods in classification accuracy.
KW - Deep learning
KW - earth imagery segmentation
KW - imperfect labels
KW - weakly supervised learning
UR - http://www.scopus.com/inward/record.url?scp=85129473019&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85129473019&partnerID=8YFLogxK
U2 - 10.1145/3480970
DO - 10.1145/3480970
M3 - Article
AN - SCOPUS:85129473019
SN - 2157-6904
VL - 13
JO - ACM Transactions on Intelligent Systems and Technology
JF - ACM Transactions on Intelligent Systems and Technology
IS - 2
M1 - 25
ER -