TY - JOUR
T1 - Recovering occlusion boundaries from an image
AU - Hoiem, Derek
AU - Efros, Alexei A.
AU - Hebert, Martial
N1 - Funding Information:
Acknowledgements This material is based upon work supported by the NSF under award IIS-0904209 (D.H.) and CAREER award IIS-0546547 (A.E.), as well as a Microsoft Graduate Fellowship (D.H.). We are grateful to Jenna Hebert for her immensely valuable efforts in ground truth labeling. We thank the Berkeley group for making code available.
PY - 2011/2
Y1 - 2011/2
N2 - Occlusion reasoning is a fundamental problem in computer vision. In this paper, we propose an algorithm to recover the occlusion boundaries and depth ordering of free-standing structures in the scene. Rather than viewing the problem as one of pure image processing, our approach employs cues from an estimated surface layout and applies Gestalt grouping principles using a conditional random field (CRF) model. We propose a hierarchical segmentation process, based on agglomerative merging, that re-estimates boundary strength as the segmentation progresses. Our experiments on the Geometric Context dataset validate our choices for features, our iterative refinement of classifiers, and our CRF model. In experiments on the Berkeley Segmentation Dataset, PASCAL VOC 2008, and LabelMe, we also show that the trained algorithm generalizes to other datasets and can be used as an object boundary predictor with figure/ground labels.
AB - Occlusion reasoning is a fundamental problem in computer vision. In this paper, we propose an algorithm to recover the occlusion boundaries and depth ordering of free-standing structures in the scene. Rather than viewing the problem as one of pure image processing, our approach employs cues from an estimated surface layout and applies Gestalt grouping principles using a conditional random field (CRF) model. We propose a hierarchical segmentation process, based on agglomerative merging, that re-estimates boundary strength as the segmentation progresses. Our experiments on the Geometric Context dataset validate our choices for features, our iterative refinement of classifiers, and our CRF model. In experiments on the Berkeley Segmentation Dataset, PASCAL VOC 2008, and LabelMe, we also show that the trained algorithm generalizes to other datasets and can be used as an object boundary predictor with figure/ground labels.
KW - 3D reconstruction
KW - Depth from image
KW - Edge detection
KW - Figure/ground labeling
KW - Image interpretation
KW - Image segmentation
KW - Occlusion boundaries
KW - Scene understanding
UR - http://www.scopus.com/inward/record.url?scp=79851509434&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=79851509434&partnerID=8YFLogxK
U2 - 10.1007/s11263-010-0400-4
DO - 10.1007/s11263-010-0400-4
M3 - Article
AN - SCOPUS:79851509434
SN - 0920-5691
VL - 91
SP - 328
EP - 346
JO - International Journal of Computer Vision
JF - International Journal of Computer Vision
IS - 3
ER -