XRL-Bench: A Benchmark for Evaluating and Comparing Explainable Reinforcement Learning Techniques

Xiong, Yu; Hu, Zhipeng; Huang, Ye; Wu, Runze; Guan, Kai; Fang, Xingchen; Jiang, Ji; Zhou, Tianze; Hu, Yujing; Liu, Haoyu; Lyu, Tangjie; Fan, Changjie

Computer Science > Artificial Intelligence

arXiv:2402.12685 (cs)

[Submitted on 20 Feb 2024]

Title:XRL-Bench: A Benchmark for Evaluating and Comparing Explainable Reinforcement Learning Techniques

Authors:Yu Xiong, Zhipeng Hu, Ye Huang, Runze Wu, Kai Guan, Xingchen Fang, Ji Jiang, Tianze Zhou, Yujing Hu, Haoyu Liu, Tangjie Lyu, Changjie Fan

View PDF HTML (experimental)

Abstract:Reinforcement Learning (RL) has demonstrated substantial potential across diverse fields, yet understanding its decision-making process, especially in real-world scenarios where rationality and safety are paramount, is an ongoing challenge. This paper delves in to Explainable RL (XRL), a subfield of Explainable AI (XAI) aimed at unravelling the complexities of RL models. Our focus rests on state-explaining techniques, a crucial subset within XRL methods, as they reveal the underlying factors influencing an agent's actions at any given time. Despite their significant role, the lack of a unified evaluation framework hinders assessment of their accuracy and effectiveness. To address this, we introduce XRL-Bench, a unified standardized benchmark tailored for the evaluation and comparison of XRL methods, encompassing three main modules: standard RL environments, explainers based on state importance, and standard evaluators. XRL-Bench supports both tabular and image data for state explanation. We also propose TabularSHAP, an innovative and competitive XRL method. We demonstrate the practical utility of TabularSHAP in real-world online gaming services and offer an open-source benchmark platform for the straightforward implementation and evaluation of XRL methods. Our contributions facilitate the continued progression of XRL technology.

Comments:	10 pages, 5 figures
Subjects:	Artificial Intelligence (cs.AI)
Cite as:	arXiv:2402.12685 [cs.AI]
	(or arXiv:2402.12685v1 [cs.AI] for this version)
	https://doi.org/10.48550/arXiv.2402.12685

Submission history

From: Yu Xiong [view email]
[v1] Tue, 20 Feb 2024 03:20:37 UTC (673 KB)

Computer Science > Artificial Intelligence

Title:XRL-Bench: A Benchmark for Evaluating and Comparing Explainable Reinforcement Learning Techniques

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Artificial Intelligence

Title:XRL-Bench: A Benchmark for Evaluating and Comparing Explainable Reinforcement Learning Techniques

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators