Segmenting, modeling, and matching video clips containing multiple moving objects

Fred Rothganger, Svetlana Lazebnik, Cordelia Schmid, Jean Ponce

Research output: Contribution to journalArticlepeer-review


This paper presents a novel representation for dynamic scenes composed of multiple rigid objects that may undergo different motions and are observed by a moving camera. Multiview constraints associated with groups of affine-covariant scene patches and a normalized description of their appearance are used to segment a scene into its rigid components, construct three-dimensional models of these components, and match instances of models recovered from different image sequences. The proposed approach has been applied to the detection and matching of moving objects in video sequences and to shot matching, i.e., the identification of shots that depict the same scene in a video clip.

Original languageEnglish (US)
Pages (from-to)477-491
Number of pages15
JournalIEEE transactions on pattern analysis and machine intelligence
Issue number3
StatePublished - Mar 2007


  • Affine-covariant patches
  • Motion segmentation
  • Shot matching
  • Structure from motion
  • Video retrieval

ASJC Scopus subject areas

  • Software
  • Computer Vision and Pattern Recognition
  • Computational Theory and Mathematics
  • Artificial Intelligence
  • Applied Mathematics


Dive into the research topics of 'Segmenting, modeling, and matching video clips containing multiple moving objects'. Together they form a unique fingerprint.

Cite this