Planning to Be Incremental: Scene Descriptions Reveal Meaningful Clustering in Language Production

Karina Tachihara, Madison Barker, Beverly Cotter, Taylor Hayes, John Henderson, Fernanda Ferreira

Research output: Working paperPreprint

Abstract

How do speakers plan complex descriptions and then execute those plans? In this work, we attempt to answer this question by asking subjects to describe complex visual scenes. We posit that speakers begin planning by organizing the scene into meaningful clusters or groupings of objects. Speakers describe the scene cluster by cluster, allowing for some planning time between each cluster. To test these ideas, in a preregistered study 30 participants described 30 indoor and outdoor scenes while their speech was recorded. Physical distance was calculated by identifying the centroid point of each object and then computing the Euclidean distance between centroid points for every object pair. Semantic distance was calculated using ConceptNet Numberbatch to obtain the semantic similarity between object labels. A clustering algorithm was then applied to establish the appropriate number of clusters per scene and to assign objects to each cluster. We observed that, consistent with our hypothesis, objects separated by shorter physical distances and objects that are semantically more similar were discussed in closer temporal proximity in the verbal descriptions. In addition, word productions that involved jumping from one cluster to another took longer to initiate than those associated with the same cluster. We conclude that speakers address the linearization problem by establishing clusters of objects and using them to facilitate incremental planning. This approach treats multi-utterance language production as a type of foraging behavior, where people balance exploration and exploitation.
Original languageEnglish (US)
DOIs
StateSubmitted - 2024

Publication series

NameCOGNIT-D-24-00854

Keywords

  • Linearization
  • multiutterance production
  • clustering
  • scene description

Fingerprint

Dive into the research topics of 'Planning to Be Incremental: Scene Descriptions Reveal Meaningful Clustering in Language Production'. Together they form a unique fingerprint.

Cite this