Learning the taxonomy and models of categories present in arbitrary images

Narendra Ahuja, Sinisa Todorovic

Research output: Contribution to conferencePaper

Abstract

This paper proposes, and presents a solution to, the problem of simultaneous learning of multiple visual categories present in an arbitrary image set and their intercategory relationships. These relationships, also called their taxonomy, allow categories to be defined recursively, as spatial configurations of (simpler) subcategories each of which may be shared by many categories. Each image is represented by a segmentation tree, whose structure captures recursive embedding of image regions in a multiscale segmentation, and whose nodes contain the associated region properties. The presence of any occurring categories is reflected in the occurrence of associated, similar subtrees within the image trees. Similar subtrees across the entire image set are clustered. Each cluster corresponds to a discovered category, represented by the cluster properties. A (subcategory) cluster of small matching subtrees may occur within multiple clusters (categories) of larger matching subtrees, in different spatial relationships with subtrees from other small clusters. Such recursive embedding, grouping and intersection of clusters is captured in a directed acyclic graph (DAG) which represents the discovered taxonomy. Detection, recognition and segmentation of any of the learned categories present in a new image are simultaneously conducted by matching the segmentation tree of the new image with the learned DAG. This matching also yields a semantic explanation of the recognized category, in terms of the presence of its subcategories. Experiments with a newly compiled dataset of four-legged animals demonstrate good cross-category resolvability.

Original languageEnglish (US)
DOIs
StatePublished - Dec 1 2007
Event2007 IEEE 11th International Conference on Computer Vision, ICCV - Rio de Janeiro, Brazil
Duration: Oct 14 2007Oct 21 2007

Other

Other2007 IEEE 11th International Conference on Computer Vision, ICCV
CountryBrazil
CityRio de Janeiro
Period10/14/0710/21/07

ASJC Scopus subject areas

  • Software
  • Computer Vision and Pattern Recognition

Fingerprint Dive into the research topics of 'Learning the taxonomy and models of categories present in arbitrary images'. Together they form a unique fingerprint.

  • Cite this

    Ahuja, N., & Todorovic, S. (2007). Learning the taxonomy and models of categories present in arbitrary images. Paper presented at 2007 IEEE 11th International Conference on Computer Vision, ICCV, Rio de Janeiro, Brazil. https://doi.org/10.1109/ICCV.2007.4409039