Bridging Auditory Perception and Language Comprehension through MEG-Driven Encoding Models
December 22, 2024 Β· Declared Dead Β· π arXiv.org
"No code URL or promise found in abstract"
Evidence collected by the PWNC Scanner
Authors
Matteo Ciferri, Matteo Ferrante, Nicola Toschi
arXiv ID
2501.03246
Category
q-bio.NC
Cross-listed
cs.CL,
cs.LG,
cs.SD,
eess.AS,
eess.SP
Citations
0
Venue
arXiv.org
Last Checked
3 months ago
Abstract
Understanding the neural mechanisms behind auditory and linguistic processing is key to advancing cognitive neuroscience. In this study, we use Magnetoencephalography (MEG) data to analyze brain responses to spoken language stimuli. We develop two distinct encoding models: an audio-to-MEG encoder, which uses time-frequency decompositions (TFD) and wav2vec2 latent space representations, and a text-to-MEG encoder, which leverages CLIP and GPT-2 embeddings. Both models successfully predict neural activity, demonstrating significant correlations between estimated and observed MEG signals. However, the text-to-MEG model outperforms the audio-based model, achieving higher Pearson Correlation (PC) score. Spatially, we identify that auditory-based embeddings (TFD and wav2vec2) predominantly activate lateral temporal regions, which are responsible for primary auditory processing and the integration of auditory signals. In contrast, textual embeddings (CLIP and GPT-2) primarily engage the frontal cortex, particularly Broca's area, which is associated with higher-order language processing, including semantic integration and language production, especially in the 8-30 Hz frequency range. The strong involvement of these regions suggests that auditory stimuli are processed through more direct sensory pathways, while linguistic information is encoded via networks that integrate meaning and cognitive control. Our results reveal distinct neural pathways for auditory and linguistic information processing, with higher encoding accuracy for text representations in the frontal regions. These insights refine our understanding of the brain's functional architecture in processing auditory and textual information, offering quantitative advancements in the modelling of neural responses to complex language stimuli.
Community Contributions
Found the code? Know the venue? Think something is wrong? Let us know!
π Similar Papers
In the same crypt β q-bio.NC
R.I.P.
π»
Ghosted
R.I.P.
π»
Ghosted
SuperSpike: Supervised learning in multi-layer spiking neural networks
R.I.P.
π»
Ghosted
Generic decoding of seen and imagined objects using hierarchical visual features
R.I.P.
π»
Ghosted
Convolutional Neural Networks as a Model of the Visual System: Past, Present, and Future
R.I.P.
π»
Ghosted
A probabilistic atlas of the human thalamic nuclei combining ex vivo MRI and histology
R.I.P.
π»
Ghosted
Why Neurons Have Thousands of Synapses, A Theory of Sequence Memory in Neocortex
Died the same way β π» Ghosted
R.I.P.
π»
Ghosted
Federated Learning: Strategies for Improving Communication Efficiency
R.I.P.
π»
Ghosted
In-Datacenter Performance Analysis of a Tensor Processing Unit
R.I.P.
π»
Ghosted
Deep Convolutional Neural Networks for Computer-Aided Detection: CNN Architectures, Dataset Characteristics and Transfer Learning
R.I.P.
π»
Ghosted