A Computational Theory of Learning Flexible Reward-Seeking Behavior with Place Cells
April 22, 2022 Β· Declared Dead Β· π bioRxiv
"No code URL or promise found in abstract"
Evidence collected by the PWNC Scanner
Authors
Yuanxiang Gao
arXiv ID
2204.11843
Category
q-bio.NC
Cross-listed
cs.AI,
cs.LG,
cs.NE
Citations
0
Venue
bioRxiv
Last Checked
3 months ago
Abstract
An important open question in computational neuroscience is how various spatially tuned neurons, such as place cells, are used to support the learning of reward-seeking behavior of an animal. Existing computational models either lack biological plausibility or fall short of behavioral flexibility when environments change. In this paper, we propose a computational theory that achieves behavioral flexibility with better biological plausibility. We first train a mixture of Gaussian distributions to model the ensemble of firing fields of place cells. Then we propose a Hebbian-like rule to learn the synaptic strength matrix among place cells. This matrix is interpreted as the transition rate matrix of a continuous time Markov chain to generate the sequential replay of place cells. During replay, the synaptic strengths from place cells to medium spiny neurons (MSN) are learned by a temporal-difference like rule to store place-reward associations. After replay, the activation of MSN will ramp up when an animal approaches the rewarding place, so the animal can move along the direction where the MSN activation is increasing to find the rewarding place. We implement our theory into a high-fidelity virtual rat in the MuJoCo physics simulator. In a complex maze, the rat shows significantly better learning efficiency and behavioral flexibility than a rat that implements a neuroscience-inspired reinforcement learning algorithm, deep Q-network.
Community Contributions
Found the code? Know the venue? Think something is wrong? Let us know!
π Similar Papers
In the same crypt β q-bio.NC
R.I.P.
π»
Ghosted
R.I.P.
π»
Ghosted
SuperSpike: Supervised learning in multi-layer spiking neural networks
R.I.P.
π»
Ghosted
Generic decoding of seen and imagined objects using hierarchical visual features
R.I.P.
π»
Ghosted
Convolutional Neural Networks as a Model of the Visual System: Past, Present, and Future
R.I.P.
π»
Ghosted
A probabilistic atlas of the human thalamic nuclei combining ex vivo MRI and histology
R.I.P.
π»
Ghosted
Why Neurons Have Thousands of Synapses, A Theory of Sequence Memory in Neocortex
Died the same way β π» Ghosted
R.I.P.
π»
Ghosted
Federated Learning: Strategies for Improving Communication Efficiency
R.I.P.
π»
Ghosted
In-Datacenter Performance Analysis of a Tensor Processing Unit
R.I.P.
π»
Ghosted
Deep Convolutional Neural Networks for Computer-Aided Detection: CNN Architectures, Dataset Characteristics and Transfer Learning
R.I.P.
π»
Ghosted