TY - GEN
T1 - Putting objects in perspective
AU - Hoiem, Derek
AU - Efros, Alexei A.
AU - Hebert, Martial
PY - 2006
Y1 - 2006
N2 - Image understanding requires not only individually estimating elements of the visual world but also capturing the interplay among them. In this paper, we provide a framework for placing local object detection in the context of the overall 3D scene by modeling the interdependence of objects, surface orientations, and camera viewpoint. Most object detection methods consider all scales and locations in the image as equally likely. We show that with probabilistic estimates of 3D geometry, both in terms of surfaces and world coordinates, we can put objects into perspective and model the scale and location variance in the image. Our approach reflects the cyclical nature of the problem by allowing probabilistic object hypotheses to refine geometry and vice-versa. Our framework allows painless substitution of almost any object detector and is easily extended to include other aspects of image understanding. Our results confirm the benefits of our integrated approach.
AB - Image understanding requires not only individually estimating elements of the visual world but also capturing the interplay among them. In this paper, we provide a framework for placing local object detection in the context of the overall 3D scene by modeling the interdependence of objects, surface orientations, and camera viewpoint. Most object detection methods consider all scales and locations in the image as equally likely. We show that with probabilistic estimates of 3D geometry, both in terms of surfaces and world coordinates, we can put objects into perspective and model the scale and location variance in the image. Our approach reflects the cyclical nature of the problem by allowing probabilistic object hypotheses to refine geometry and vice-versa. Our framework allows painless substitution of almost any object detector and is easily extended to include other aspects of image understanding. Our results confirm the benefits of our integrated approach.
UR - http://www.scopus.com/inward/record.url?scp=33845563923&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=33845563923&partnerID=8YFLogxK
U2 - 10.1109/CVPR.2006.232
DO - 10.1109/CVPR.2006.232
M3 - Conference contribution
AN - SCOPUS:33845563923
SN - 0769525970
SN - 9780769525976
T3 - Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition
SP - 2137
EP - 2144
BT - Proceedings - 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, CVPR 2006
T2 - 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, CVPR 2006
Y2 - 17 June 2006 through 22 June 2006
ER -