Abstract
Explainable AI Planning (XAIP), a subfield of xAI, offers a variety of methods to interpret the behavior of autonomous systems. A recent “pointwise-in-time” explanation method, called Rule Status Assessment (RSA), characterizes an agent’s behavior at individual time steps in a trajectory using linear temporal logic (LTL) rules. In this work, RSA is applied for the first time in a reinforcement learning (RL) context. We first demonstrate RSA diagnostics as a substantial supplement to the basic RL reward curve, tracking whether and when specified subtasks are accomplished. We then introduce a novel “Interactive RSA” which provides the user with detailed diagnostic information automatically at any desired point in a trajectory. We apply RSA to an advanced agent at runtime and show that RSA and its novel interactive variant constitute a promising step towards explainable RL.
| Original language | English (US) |
|---|---|
| Pages (from-to) | 694-706 |
| Number of pages | 13 |
| Journal | Proceedings of Machine Learning Research |
| Volume | 242 |
| State | Published - 2024 |
| Event | 6th Annual Learning for Dynamics and Control Conference, L4DC 2024 - Oxford, United Kingdom Duration: Jul 15 2024 → Jul 17 2024 |
Keywords
- Explainable AI Planning
- Explainable Reinforcement Learning
- Linear Temporal Logic
- Markov Decision Processes
ASJC Scopus subject areas
- Artificial Intelligence
- Software
- Control and Systems Engineering
- Statistics and Probability
Fingerprint
Dive into the research topics of 'Pointwise-in-Time Diagnostics for Reinforcement Learning During Training and Runtime'. Together they form a unique fingerprint.Cite this
- APA
- Standard
- Harvard
- Vancouver
- Author
- BIBTEX
- RIS