Box in the box: Joint 3D layout and object reasoning from single images

Alexander G. Schwing, Sanja Fidler, Marc Pollefeys, Raquel Urtasun

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

In this paper we propose an approach to jointly infer the room layout as well as the objects present in the scene. Towards this goal, we propose a branch and bound algorithm which is guaranteed to retrieve the global optimum of the joint problem. The main difficulty resides in taking into account occlusion in order to not over-count the evidence. We introduce a new decomposition method, which generalizes integral geometry to triangular shapes, and allows us to bound the different terms in constant time. We exploit both geometric cues and object detectors as image features and show large improvements in 2D and 3D object detection over state-of-the-art deformable part-based models.

Original languageEnglish (US)
Title of host publicationProceedings - 2013 IEEE International Conference on Computer Vision, ICCV 2013
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages353-360
Number of pages8
ISBN (Print)9781479928392
DOIs
StatePublished - 2013
Externally publishedYes
Event2013 14th IEEE International Conference on Computer Vision, ICCV 2013 - Sydney, NSW, Australia
Duration: Dec 1 2013Dec 8 2013

Publication series

NameProceedings of the IEEE International Conference on Computer Vision

Other

Other2013 14th IEEE International Conference on Computer Vision, ICCV 2013
Country/TerritoryAustralia
CitySydney, NSW
Period12/1/1312/8/13

ASJC Scopus subject areas

  • Software
  • Computer Vision and Pattern Recognition

Fingerprint

Dive into the research topics of 'Box in the box: Joint 3D layout and object reasoning from single images'. Together they form a unique fingerprint.

Cite this