Sim-to-Real Transfer in Deep Reinforcement Learning for Robotics: a Survey
September 24, 2020 Β· The Cartographer Β· π IEEE Symposium Series on Computational Intelligence
"No code URL or promise found in abstract"
"Title-pattern auto-detect: Sim-to-Real Transfer in Deep Reinforcement Learning for Robotics: a Survey"
Evidence collected by the PWNC Scanner
Authors
Wenshuai Zhao, Jorge PeΓ±a Queralta, Tomi Westerlund
arXiv ID
2009.13303
Category
cs.LG: Machine Learning
Cross-listed
cs.RO
Citations
933
Venue
IEEE Symposium Series on Computational Intelligence
Last Checked
1 day ago
Abstract
Deep reinforcement learning has recently seen huge success across multiple areas in the robotics domain. Owing to the limitations of gathering real-world data, i.e., sample inefficiency and the cost of collecting it, simulation environments are utilized for training the different agents. This not only aids in providing a potentially infinite data source, but also alleviates safety concerns with real robots. Nonetheless, the gap between the simulated and real worlds degrades the performance of the policies once the models are transferred into real robots. Multiple research efforts are therefore now being directed towards closing this sim-to-real gap and accomplish more efficient policy transfer. Recent years have seen the emergence of multiple methods applicable to different domains, but there is a lack, to the best of our knowledge, of a comprehensive review summarizing and putting into context the different methods. In this survey paper, we cover the fundamental background behind sim-to-real transfer in deep reinforcement learning and overview the main methods being utilized at the moment: domain randomization, domain adaptation, imitation learning, meta-learning and knowledge distillation. We categorize some of the most relevant recent works, and outline the main application scenarios. Finally, we discuss the main opportunities and challenges of the different approaches and point to the most promising directions.
Community Contributions
Found the code? Know the venue? Think something is wrong? Let us know!
π Similar Papers
In the same crypt β Machine Learning
ποΈ
ποΈ
Transcended
ποΈ
ποΈ
Transcended
Continuous control with deep reinforcement learning
π
π
Old Age
Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks
π
π
Old Age
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor
π
π
Old Age
SGDR: Stochastic Gradient Descent with Warm Restarts
ποΈ
ποΈ
Transcended