Abstract
Most image annotation systems consider a single photo at a time and label photos individually. In this work, we focus on collections of personal photos and explore the associated GPS and time information for semantic annotation. First, we employ a constrained clustering method to partition a photo collection into event-based sub-collections, considering that the GPS records may be partly missing (a practical issue). We then use conditional random field (CRF) models to exploit the correlation between photos based on (1) time-location constraints and (2) the relationship between collection-level annotation (i.e., events) and image-level annotation (i.e., scenes). With the introduction of such a multi-level annotation hierarchy, our system addresses the problem of annotating consumer photo collections that requires a more hierarchical description of the customers' activities than do the simpler image annotation tasks. The efficacy of the proposed system is validated using a geotagged customer photo collection database, which consists of over 100 folders and is labeled for 12 events and 12 scenes.
Original language | English (US) |
---|---|
Title of host publication | 26th IEEE Conference on Computer Vision and Pattern Recognition, CVPR |
DOIs | |
State | Published - 2008 |
Event | 26th IEEE Conference on Computer Vision and Pattern Recognition, CVPR - Anchorage, AK, United States Duration: Jun 23 2008 → Jun 28 2008 |
Other
Other | 26th IEEE Conference on Computer Vision and Pattern Recognition, CVPR |
---|---|
Country/Territory | United States |
City | Anchorage, AK |
Period | 6/23/08 → 6/28/08 |
ASJC Scopus subject areas
- Computer Vision and Pattern Recognition
- Control and Systems Engineering