Pointwise-in-Time Diagnostics for Reinforcement Learning During Training and Runtime

Noel Brindise, Andres Felipe Posada-Moreno, Cedric Langbort, Sebastian Trimpe

Research output: Contribution to journalConference articlepeer-review

Abstract

Explainable AI Planning (XAIP), a subfield of xAI, offers a variety of methods to interpret the behavior of autonomous systems. A recent “pointwise-in-time” explanation method, called Rule Status Assessment (RSA), characterizes an agent’s behavior at individual time steps in a trajectory using linear temporal logic (LTL) rules. In this work, RSA is applied for the first time in a reinforcement learning (RL) context. We first demonstrate RSA diagnostics as a substantial supplement to the basic RL reward curve, tracking whether and when specified subtasks are accomplished. We then introduce a novel “Interactive RSA” which provides the user with detailed diagnostic information automatically at any desired point in a trajectory. We apply RSA to an advanced agent at runtime and show that RSA and its novel interactive variant constitute a promising step towards explainable RL.

Original languageEnglish (US)
Pages (from-to)694-706
Number of pages13
JournalProceedings of Machine Learning Research
Volume242
StatePublished - 2024
Externally publishedYes
Event6th Annual Learning for Dynamics and Control Conference, L4DC 2024 - Oxford, United Kingdom
Duration: Jul 15 2024Jul 17 2024

Keywords

  • Explainable AI Planning
  • Explainable Reinforcement Learning
  • Linear Temporal Logic
  • Markov Decision Processes

ASJC Scopus subject areas

  • Artificial Intelligence
  • Software
  • Control and Systems Engineering
  • Statistics and Probability

Fingerprint

Dive into the research topics of 'Pointwise-in-Time Diagnostics for Reinforcement Learning During Training and Runtime'. Together they form a unique fingerprint.

Cite this