Discovering alternative solutions beyond the simplicity bias in recurrent neural networks
September 25, 2025 Β· Declared Dead Β· π arXiv.org
"No code URL or promise found in abstract"
Evidence collected by the PWNC Scanner
Authors
William Qian, Cengiz Pehlevan
arXiv ID
2509.21504
Category
q-bio.NC
Cross-listed
cs.NE
Citations
0
Venue
arXiv.org
Last Checked
3 months ago
Abstract
Training recurrent neural networks (RNNs) to perform neuroscience-style tasks has become a popular way to generate hypotheses for how neural circuits in the brain might perform computations. Recent work has demonstrated that task-trained RNNs possess a strong simplicity bias. In particular, this inductive bias often causes RNNs trained on the same task to collapse on effectively the same solution, typically comprised of fixed-point attractors or other low-dimensional dynamical motifs. While such solutions are readily interpretable, this collapse proves counterproductive for the sake of generating a set of genuinely unique hypotheses for how neural computations might be performed. Here we propose Iterative Neural Similarity Deflation (INSD), a simple method to break this inductive bias. By penalizing linear predictivity of neural activity produced by standard task-trained RNNs, we find an alternative class of solutions to classic neuroscience-style RNN tasks. These solutions appear distinct across a battery of analysis techniques, including representational similarity metrics, dynamical systems analysis, and the linear decodability of task-relevant variables. Moreover, these alternative solutions can sometimes achieve superior performance in difficult or out-of-distribution task regimes. Our findings underscore the importance of moving beyond the simplicity bias to uncover richer and more varied models of neural computation.
Community Contributions
Found the code? Know the venue? Think something is wrong? Let us know!
π Similar Papers
In the same crypt β q-bio.NC
R.I.P.
π»
Ghosted
R.I.P.
π»
Ghosted
SuperSpike: Supervised learning in multi-layer spiking neural networks
R.I.P.
π»
Ghosted
Generic decoding of seen and imagined objects using hierarchical visual features
R.I.P.
π»
Ghosted
Convolutional Neural Networks as a Model of the Visual System: Past, Present, and Future
R.I.P.
π»
Ghosted
A probabilistic atlas of the human thalamic nuclei combining ex vivo MRI and histology
R.I.P.
π»
Ghosted
Why Neurons Have Thousands of Synapses, A Theory of Sequence Memory in Neocortex
Died the same way β π» Ghosted
R.I.P.
π»
Ghosted
Federated Learning: Strategies for Improving Communication Efficiency
R.I.P.
π»
Ghosted
In-Datacenter Performance Analysis of a Tensor Processing Unit
R.I.P.
π»
Ghosted
Deep Convolutional Neural Networks for Computer-Aided Detection: CNN Architectures, Dataset Characteristics and Transfer Learning
R.I.P.
π»
Ghosted