On Generating Explanations for Reinforcement Learning Policies: An Empirical Study

Research output: Contribution to journalArticlepeer-review

Abstract

Explaining reinforcement learning policies is important for deploying them in real-world scenarios. We introduce a set of linear temporal logic formulae designed to provide such explanations, and an algorithm for searching through those formulae for the one that best explains a given policy. Our key idea is to compare action distributions from the target policy with those from policies optimized for candidate explanations. This comparison provides more insight into the target policy than existing methods and avoids inference of "catch-all" explanations. We demonstrate our method in a simulated game of capture-the-flag, a car-parking environment, and a robot navigation task.

Original languageEnglish (US)
Pages (from-to)3027-3032
Number of pages6
JournalIEEE Control Systems Letters
Volume8
DOIs
StatePublished - 2024

Keywords

  • Autonomous system
  • intelligent systems
  • machine learning
  • robotics

ASJC Scopus subject areas

  • Control and Systems Engineering
  • Control and Optimization

Fingerprint

Dive into the research topics of 'On Generating Explanations for Reinforcement Learning Policies: An Empirical Study'. Together they form a unique fingerprint.

Cite this