Finding things: Image parsing with regions and per-exemplar detectors

Joseph Tighe, Svetlana Lazebnik

Research output: Contribution to journalConference articlepeer-review

Abstract

This paper presents a system for image parsing, or labeling each pixel in an image with its semantic category, aimed at achieving broad coverage across hundreds of object categories, many of them sparsely sampled. The system combines region-level features with per-exemplar sliding window detectors. Per-exemplar detectors are better suited for our parsing task than traditional bounding box detectors: they perform well on classes with little training data and high intra-class variation, and they allow object masks to be transferred into the test image for pixel-level segmentation. The proposed system achieves state-of-the-art accuracy on three challenging datasets, the largest of which contains 45,676 images and 232 labels.

Original languageEnglish (US)
Article number6619230
Pages (from-to)3001-3008
Number of pages8
JournalProceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition
DOIs
StatePublished - 2013
Externally publishedYes
Event26th IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2013 - Portland, OR, United States
Duration: Jun 23 2013Jun 28 2013

Keywords

  • computer vision
  • image parsing
  • parsing
  • recognition
  • semantic segmentation

ASJC Scopus subject areas

  • Software
  • Computer Vision and Pattern Recognition

Fingerprint

Dive into the research topics of 'Finding things: Image parsing with regions and per-exemplar detectors'. Together they form a unique fingerprint.

Cite this