Visual scenes are categorized by function

Michelle R. Greene, Christopher Baldassano, Andre Esteva, Diane M. Beck, Li Fei-Fei

Research output: Contribution to journalArticlepeer-review

Abstract

How do we know that a kitchen is a kitchen by looking? Traditional models posit that scene categorization is achieved through recognizing necessary and sufficient features and objects, yet there is little consensus about what these may be. However, scene categories should reflect how we use visual information. Therefore, we test the hypothesis that scene categories reflect functions, or the possibilities for actions within a scene. Our approach is to compare human categorization patterns with predictions made by both functions and alternative models. We collected a large-scale scene category distance matrix (5 million trials) by asking observers to simply decide whether 2 images were from the same or different categories. Using the actions from the American Time Use Survey, we mapped actions onto each scene (1.4 million trials). We found a strong relationship between ranked category distance and functional distance (r =.50, or 66% of the maximum possible correlation). The function model outperformed alternative models of object-based distance (r =.33), visual features from a convolutional neural network (r =.39), lexical distance (r =.27), and models of visual features. Using hierarchical linear regression, we found that functions captured 85.5% of overall explained variance, with nearly half of the explained variance captured only by functions, implying that the predictive power of alternative models was because of their shared variance with the function-based model. These results challenge the dominant school of thought that visual features and objects are sufficient for scene categorization, suggesting instead that a scene's category may be determined by the scene's function.

Original languageEnglish (US)
Pages (from-to)82-94
Number of pages13
JournalJournal of Experimental Psychology: General
Volume145
Issue number1
DOIs
StatePublished - Jan 1 2016

Keywords

  • Categorization
  • Scene understanding
  • Similarity

ASJC Scopus subject areas

  • Experimental and Cognitive Psychology
  • General Psychology
  • Developmental Neuroscience

Fingerprint

Dive into the research topics of 'Visual scenes are categorized by function'. Together they form a unique fingerprint.

Cite this