DREAM Architecture: a Developmental Approach to Open-Ended Learning in Robotics
May 13, 2020 ยท Declared Dead ยท ๐ arXiv.org
"No code URL or promise found in abstract"
Evidence collected by the PWNC Scanner
Authors
Stephane Doncieux, Nicolas Bredeche, Lรฉni Le Goff, Benoรฎt Girard, Alexandre Coninx, Olivier Sigaud, Mehdi Khamassi, Natalia Dรญaz-Rodrรญguez, David Filliat, Timothy Hospedales, A. Eiben, Richard Duro
arXiv ID
2005.06223
Category
cs.AI: Artificial Intelligence
Cross-listed
cs.LG,
cs.NE,
cs.RO
Citations
19
Venue
arXiv.org
Last Checked
4 months ago
Abstract
Robots are still limited to controlled conditions, that the robot designer knows with enough details to endow the robot with the appropriate models or behaviors. Learning algorithms add some flexibility with the ability to discover the appropriate behavior given either some demonstrations or a reward to guide its exploration with a reinforcement learning algorithm. Reinforcement learning algorithms rely on the definition of state and action spaces that define reachable behaviors. Their adaptation capability critically depends on the representations of these spaces: small and discrete spaces result in fast learning while large and continuous spaces are challenging and either require a long training period or prevent the robot from converging to an appropriate behavior. Beside the operational cycle of policy execution and the learning cycle, which works at a slower time scale to acquire new policies, we introduce the redescription cycle, a third cycle working at an even slower time scale to generate or adapt the required representations to the robot, its environment and the task. We introduce the challenges raised by this cycle and we present DREAM (Deferred Restructuring of Experience in Autonomous Machines), a developmental cognitive architecture to bootstrap this redescription process stage by stage, build new state representations with appropriate motivations, and transfer the acquired knowledge across domains or tasks or even across robots. We describe results obtained so far with this approach and end up with a discussion of the questions it raises in Neuroscience.
Community Contributions
Found the code? Know the venue? Think something is wrong? Let us know!
๐ Similar Papers
In the same crypt โ Artificial Intelligence
๐
๐
The Cartographer
R.I.P.
๐ป
Ghosted
Explanation in Artificial Intelligence: Insights from the Social Sciences
R.I.P.
๐ป
Ghosted
Federated Machine Learning: Concept and Applications
R.I.P.
๐ป
Ghosted
Counterfactual Explanations without Opening the Black Box: Automated Decisions and the GDPR
R.I.P.
๐ป
Ghosted
DeepAR: Probabilistic Forecasting with Autoregressive Recurrent Networks
R.I.P.
๐ป
Ghosted
Rainbow: Combining Improvements in Deep Reinforcement Learning
Died the same way โ ๐ป Ghosted
R.I.P.
๐ป
Ghosted
Federated Learning: Strategies for Improving Communication Efficiency
R.I.P.
๐ป
Ghosted
In-Datacenter Performance Analysis of a Tensor Processing Unit
R.I.P.
๐ป
Ghosted
Deep Convolutional Neural Networks for Computer-Aided Detection: CNN Architectures, Dataset Characteristics and Transfer Learning
R.I.P.
๐ป
Ghosted