Abstract
This paper presents a fully automatic method for creating a 3D model from a single photograph. The model is made up of several texture-mapped planar billboards and has the complexity of a typical children's pop-up book illustration. Our main insight is that instead of attempting to recover precise geometry, we statistically model geometric classes defined by their orientations in the scene. Our algorithm labels regions of the input image into coarse categories: "ground", "sky", and "vertical". These labels are then used to "cut and fold" the image into a pop-up model using a set of simple assumptions. Because of the inherent ambiguity of the problem and the statistical nature of the approach, the algorithm is not expected to work on every image. However, it performs surprisingly well for a wide range of scenes taken from a typical person's photo album.
Original language | English (US) |
---|---|
Pages (from-to) | 577-584 |
Number of pages | 8 |
Journal | ACM Transactions on Graphics |
Volume | 24 |
Issue number | 3 |
DOIs | |
State | Published - Jul 2005 |
Externally published | Yes |
Event | ACM SIGGRAPH 2005 - Los Angeles, CA, United States Duration: Jul 31 2005 → Aug 4 2005 |
Keywords
- Image segmentation
- Image-based rendering
- Machine learning
- Single-view reconstruction
ASJC Scopus subject areas
- Computer Graphics and Computer-Aided Design