๐ฎ
๐ฎ
The Ethereal
Utilizing Explainability Techniques for Reinforcement Learning Model Assurance
November 27, 2023 ยท Entered Twilight ยท ๐ arXiv.org
Repo contents: .flake8, .github, .gitignore, .gitlab-ci.yml, .pre-commit-config.yaml, .python-version, LICENSE, README.md, arlin, assets, config.yaml, docs, examples, poetry.lock, pyproject.toml, tests
Authors
Alexander Tapley, Kyle Gatesman, Luis Robaina, Brett Bissey, Joseph Weissman
arXiv ID
2311.15838
Category
cs.LG: Machine Learning
Cross-listed
cs.AI
Citations
3
Venue
arXiv.org
Repository
https://github.com/mitre/arlin
โญ 11
Last Checked
3 months ago
Abstract
Explainable Reinforcement Learning (XRL) can provide transparency into the decision-making process of a Deep Reinforcement Learning (DRL) model and increase user trust and adoption in real-world use cases. By utilizing XRL techniques, researchers can identify potential vulnerabilities within a trained DRL model prior to deployment, therefore limiting the potential for mission failure or mistakes by the system. This paper introduces the ARLIN (Assured RL Model Interrogation) Toolkit, an open-source Python library that identifies potential vulnerabilities and critical points within trained DRL models through detailed, human-interpretable explainability outputs. To illustrate ARLIN's effectiveness, we provide explainability visualizations and vulnerability analysis for a publicly available DRL model. The open-source code repository is available for download at https://github.com/mitre/arlin.
Community Contributions
Found the code? Know the venue? Think something is wrong? Let us know!
๐ Similar Papers
In the same crypt โ Machine Learning
๐ฎ
๐ฎ
The Ethereal
Continuous control with deep reinforcement learning
๐
๐
Old Age
Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks
๐
๐
Old Age
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor
๐
๐
Old Age
SGDR: Stochastic Gradient Descent with Warm Restarts
๐ฎ
๐ฎ
The Ethereal