What deep reinforcement learning tells us about human motor learning and vice-versa
August 23, 2022 Β· Declared Dead Β· π arXiv.org
"No code URL or promise found in abstract"
Evidence collected by the PWNC Scanner
Authors
Michele Garibbo, Casimir Ludwig, Nathan Lepora, Laurence Aitchison
arXiv ID
2208.10892
Category
q-bio.NC
Cross-listed
cs.LG,
cs.NE
Citations
1
Venue
arXiv.org
Last Checked
3 months ago
Abstract
Machine learning and specifically reinforcement learning (RL) has been extremely successful in helping us to understand neural decision making processes. However, RL's role in understanding other neural processes especially motor learning is much less well explored. To explore this connection, we investigated how recent deep RL methods correspond to the dominant motor learning framework in neuroscience, error-based learning. Error-based learning can be probed using a mirror reversal adaptation paradigm, where it produces distinctive qualitative predictions that are observed in humans. We therefore tested three major families of modern deep RL algorithm on a mirror reversal perturbation. Surprisingly, all of the algorithms failed to mimic human behaviour and indeed displayed qualitatively different behaviour from that predicted by error-based learning. To fill this gap, we introduce a novel deep RL algorithm: model-based deterministic policy gradients (MB-DPG). MB-DPG draws inspiration from error-based learning by explicitly relying on the observed outcome of actions. We show MB-DPG captures (human) error-based learning under mirror-reversal and rotational perturbation. Next, we demonstrate error-based learning in the form of MB-DPG learns faster than canonical model-free algorithms on complex arm-based reaching tasks, while being more robust to (forward) model misspecification than model-based RL. These findings highlight the gap between current deep RL methods and human motor adaptation and offer a route to closing this gap, facilitating future beneficial interaction between between the two fields.
Community Contributions
Found the code? Know the venue? Think something is wrong? Let us know!
π Similar Papers
In the same crypt β q-bio.NC
R.I.P.
π»
Ghosted
R.I.P.
π»
Ghosted
SuperSpike: Supervised learning in multi-layer spiking neural networks
R.I.P.
π»
Ghosted
Generic decoding of seen and imagined objects using hierarchical visual features
R.I.P.
π»
Ghosted
Convolutional Neural Networks as a Model of the Visual System: Past, Present, and Future
R.I.P.
π»
Ghosted
A probabilistic atlas of the human thalamic nuclei combining ex vivo MRI and histology
R.I.P.
π»
Ghosted
Why Neurons Have Thousands of Synapses, A Theory of Sequence Memory in Neocortex
Died the same way β π» Ghosted
R.I.P.
π»
Ghosted
Federated Learning: Strategies for Improving Communication Efficiency
R.I.P.
π»
Ghosted
In-Datacenter Performance Analysis of a Tensor Processing Unit
R.I.P.
π»
Ghosted
Deep Convolutional Neural Networks for Computer-Aided Detection: CNN Architectures, Dataset Characteristics and Transfer Learning
R.I.P.
π»
Ghosted