RMBench: Benchmarking Deep Reinforcement Learning for Robotic Manipulator Control

October 20, 2022 ยท Entered Twilight ยท ๐Ÿ› IEEE/RJS International Conference on Intelligent RObots and Systems

๐Ÿ’ค TWILIGHT: Eternal Rest
Repo abandoned since publication

Repo contents: .gitignore, 00_DrQv2, 01_VPG, 02_TRPO, 03_PPO, 04_DDPG, 05_SAC, 06_TD3, README.md, assets, dmc.py, logger.py, plot_curve.py, requirements.txt, utils.py, video.py

Authors Yanfei Xiang, Xin Wang, Shu Hu, Bin Zhu, Xiaomeng Huang, Xi Wu, Siwei Lyu arXiv ID 2210.11262 Category cs.RO: Robotics Cross-listed cs.AI Citations 10 Venue IEEE/RJS International Conference on Intelligent RObots and Systems Repository https://github.com/xiangyanfei212/RMBench-2022 โญ 10 Last Checked 1 month ago
Abstract
Reinforcement learning is applied to solve actual complex tasks from high-dimensional, sensory inputs. The last decade has developed a long list of reinforcement learning algorithms. Recent progress benefits from deep learning for raw sensory signal representation. One question naturally arises: how well do they perform concerning different robotic manipulation tasks? Benchmarks use objective performance metrics to offer a scientific way to compare algorithms. In this paper, we present RMBench, the first benchmark for robotic manipulations, which have high-dimensional continuous action and state spaces. We implement and evaluate reinforcement learning algorithms that directly use observed pixels as inputs. We report their average performance and learning curves to show their performance and stability of training. Our study concludes that none of the studied algorithms can handle all tasks well, soft Actor-Critic outperforms most algorithms in average reward and stability, and an algorithm combined with data augmentation may facilitate learning policies. Our code is publicly available at https://github.com/xiangyanfei212/RMBench-2022, including all benchmark tasks and studied algorithms.
Community shame:
Not yet rated
Community Contributions

Found the code? Know the venue? Think something is wrong? Let us know!

๐Ÿ“œ Similar Papers

In the same crypt โ€” Robotics