Skilled Bandits: Learning to Choose in a Reactive World

Jared M. Hotaling, Danielle J. Navarro, Ben R. Newell

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

In uncertain environments we must balance our need to gather information with our desire to exploit current knowledge. This is further complicated in reactive environments where actions produce long-lasting change. In three experiments, we investigate how people learn to make effective decisions from experience in a dynamic four-armed bandit task. In contrast to the diminishing rewards found in most previous studies, options were framed as skills that developed greater rewards when chosen. We find that most individuals learn effective strategies for coping with reactive environments. We present a psychological model positing that decision makers move through three distinct processing phases, and show that it accounts for key behavioral patterns across experiments.

Original languageEnglish (US)
Title of host publicationProceedings of the 40th Annual Meeting of the Cognitive Science Society, CogSci 2018
PublisherThe Cognitive Science Society
Pages1827-1832
Number of pages6
ISBN (Electronic)9780991196784
StatePublished - 2018
Externally publishedYes
Event40th Annual Meeting of the Cognitive Science Society: Changing Minds, CogSci 2018 - Madison, United States
Duration: Jul 25 2018Jul 28 2018

Publication series

NameProceedings of the 40th Annual Meeting of the Cognitive Science Society, CogSci 2018

Conference

Conference40th Annual Meeting of the Cognitive Science Society: Changing Minds, CogSci 2018
Country/TerritoryUnited States
CityMadison
Period7/25/187/28/18

Keywords

  • decision making
  • decisions from experience
  • dynamic environments
  • explore-exploit dilemma

ASJC Scopus subject areas

  • Artificial Intelligence
  • Computer Science Applications
  • Human-Computer Interaction
  • Cognitive Neuroscience

Fingerprint

Dive into the research topics of 'Skilled Bandits: Learning to Choose in a Reactive World'. Together they form a unique fingerprint.

Cite this