Prioritized Sweeping Neural DynaQ with Multiple Predecessors, and Hippocampal Replays
February 15, 2018 ยท Declared Dead ยท ๐ Living Machines
"No code URL or promise found in abstract"
Evidence collected by the PWNC Scanner
Authors
Lise Aubin, Mehdi Khamassi, Benoรฎt Girard
arXiv ID
1802.05594
Category
cs.AI: Artificial Intelligence
Cross-listed
cs.NE
Citations
10
Venue
Living Machines
Last Checked
4 months ago
Abstract
During sleep and awake rest, the hippocampus replays sequences of place cells that have been activated during prior experiences. These have been interpreted as a memory consolidation process, but recent results suggest a possible interpretation in terms of reinforcement learning. The Dyna reinforcement learning algorithms use off-line replays to improve learning. Under limited replay budget, a prioritized sweeping approach, which requires a model of the transitions to the predecessors, can be used to improve performance. We investigate whether such algorithms can explain the experimentally observed replays. We propose a neural network version of prioritized sweeping Q-learning, for which we developed a growing multiple expert algorithm, able to cope with multiple predecessors. The resulting architecture is able to improve the learning of simulated agents confronted to a navigation task. We predict that, in animals, learning the world model should occur during rest periods, and that the corresponding replays should be shuffled.
Community Contributions
Found the code? Know the venue? Think something is wrong? Let us know!
๐ Similar Papers
In the same crypt โ Artificial Intelligence
๐
๐
The Cartographer
R.I.P.
๐ป
Ghosted
Explanation in Artificial Intelligence: Insights from the Social Sciences
R.I.P.
๐ป
Ghosted
Federated Machine Learning: Concept and Applications
R.I.P.
๐ป
Ghosted
Counterfactual Explanations without Opening the Black Box: Automated Decisions and the GDPR
R.I.P.
๐ป
Ghosted
DeepAR: Probabilistic Forecasting with Autoregressive Recurrent Networks
R.I.P.
๐ป
Ghosted
Rainbow: Combining Improvements in Deep Reinforcement Learning
Died the same way โ ๐ป Ghosted
R.I.P.
๐ป
Ghosted
Federated Learning: Strategies for Improving Communication Efficiency
R.I.P.
๐ป
Ghosted
In-Datacenter Performance Analysis of a Tensor Processing Unit
R.I.P.
๐ป
Ghosted
Deep Convolutional Neural Networks for Computer-Aided Detection: CNN Architectures, Dataset Characteristics and Transfer Learning
R.I.P.
๐ป
Ghosted