TY - JOUR
T1 - Causal explanation for reinforcement learning
T2 - quantifying state and temporal importance
AU - Wang, Xiaoxiao
AU - Meng, Fanyu
AU - Liu, Xin
AU - Kong, Zhaodan
AU - Chen, Xin
N1 - Publisher Copyright:
© 2023, The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature.
PY - 2023/10
Y1 - 2023/10
N2 - Explainability plays an increasingly important role in machine learning. Because reinforcement learning (RL) involves interactions between states and actions over time, it’s more challenging to explain an RL policy than supervised learning. Furthermore, humans view the world through a causal lens and thus prefer causal explanations over associational ones. Therefore, in this paper, we develop a causal explanation mechanism that quantifies the causal importance of states on actions and such importance over time. We also demonstrate the advantages of our mechanism over state-of-the-art associational methods in terms of RL policy explanation through a series of simulation studies, including crop irrigation, Blackjack, collision avoidance, and lunar lander.
AB - Explainability plays an increasingly important role in machine learning. Because reinforcement learning (RL) involves interactions between states and actions over time, it’s more challenging to explain an RL policy than supervised learning. Furthermore, humans view the world through a causal lens and thus prefer causal explanations over associational ones. Therefore, in this paper, we develop a causal explanation mechanism that quantifies the causal importance of states on actions and such importance over time. We also demonstrate the advantages of our mechanism over state-of-the-art associational methods in terms of RL policy explanation through a series of simulation studies, including crop irrigation, Blackjack, collision avoidance, and lunar lander.
KW - Causal
KW - Explainability
KW - Reinforcement learning
KW - Temporal importance
UR - http://www.scopus.com/inward/record.url?scp=85163783097&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85163783097&partnerID=8YFLogxK
U2 - 10.1007/s10489-023-04649-7
DO - 10.1007/s10489-023-04649-7
M3 - Article
AN - SCOPUS:85163783097
SN - 0924-669X
VL - 53
SP - 22546
EP - 22564
JO - Applied Intelligence
JF - Applied Intelligence
IS - 19
ER -