Abstract
This paper presents a novel representation for dynamic scenes composed of multiple rigid objects that may undergo different motions and are observed by a moving camera. Multiview constraints associated with groups of affine-covariant scene patches and a normalized description of their appearance are used to segment a scene into its rigid components, construct three-dimensional models of these components, and match instances of models recovered from different image sequences. The proposed approach has been applied to the detection and matching of moving objects in video sequences and to shot matching, i.e., the identification of shots that depict the same scene in a video clip.
Original language | English (US) |
---|---|
Pages (from-to) | 477-491 |
Number of pages | 15 |
Journal | IEEE transactions on pattern analysis and machine intelligence |
Volume | 29 |
Issue number | 3 |
DOIs | |
State | Published - Mar 2007 |
Keywords
- Affine-covariant patches
- Motion segmentation
- Shot matching
- Structure from motion
- Video retrieval
ASJC Scopus subject areas
- Software
- Computer Vision and Pattern Recognition
- Computational Theory and Mathematics
- Artificial Intelligence
- Applied Mathematics