🏛️ The Audio & Speech Crypt
eess.AS: Where Audio & Speech papers rest without their code.
1421
Total Papers
1131
No Code
24
Twilight
266
Has Code
18.7%
Survival Rate
R.I.P.
👻
Ghosted
R.I.P.
👻
Ghosted
Real-Time Auralization for First-Person Vocal Interaction in Immersive Virtual Environments
R.I.P.
👻
Ghosted
Detecting the terminality of speech-turn boundary for spoken interactions in French TV and Radio content
R.I.P.
👻
Ghosted
Quality-Controlled Multimodal Emotion Recognition in Conversations with Identity-Based Transfer Learning and MAMBA Fusion
R.I.P.
👻
Ghosted
On the Difficulty of Token-Level Modeling of Dysfluency and Fluency Shaping Artifacts
R.I.P.
👻
Ghosted
Systematic Evaluation of Time-Frequency Features for Binaural Sound Source Localization
R.I.P.
👻
Ghosted
Codec2Vec: Self-Supervised Speech Representation Learning Using Neural Speech Codecs
R.I.P.
👻
Ghosted
How Far Do SSL Speech Models Listen for Tone? Temporal Focus of Tone Representation under Low-resource Transfer
R.I.P.
👻
Ghosted
Speech Recognition Model Improves Text-to-Speech Synthesis using Fine-Grained Reward
R.I.P.
👻
Ghosted
Unifying Model and Layer Fusion for Speech Foundation Models
R.I.P.
👻
Ghosted
Quantizing Whisper-small: How design choices affect ASR performance
R.I.P.
👻
Ghosted
Pruning as Regularization: Sensitivity-Aware One-Shot Pruning in ASR
R.I.P.
👻
Ghosted
Lost in Phonation: Voice Quality Variation as an Evaluation Dimension for Speech Foundation Models
R.I.P.
👻
Ghosted
Data-Centric Lessons To Improve Speech-Language Pretraining
R.I.P.
👻
Ghosted
Beyond Hearing: Learning Task-agnostic ExG Representations from Earphones via Physiology-informed Tokenization
R.I.P.
👻
Ghosted
Can large audio language models understand child stuttering speech? speech summarization, and source separation
R.I.P.
👻
Ghosted
StutterZero and StutterFormer: End-to-End Speech Conversion for Stuttering Transcription and Correction
R.I.P.
👻
Ghosted
Unsupervised lexicon learning from speech is limited by representations rather than clustering
R.I.P.
👻
Ghosted
Pseudo2Real: Task Arithmetic for Pseudo-Label Correction in Automatic Speech Recognition
R.I.P.
👻
Ghosted
TokenChain: A Discrete Speech Chain via Semantic Token Modeling
R.I.P.
👻
Ghosted
Index-MSR: A high-efficiency multimodal fusion framework for speech recognition
R.I.P.
👻
Ghosted
PerformSinger: Multimodal Singing Voice Synthesis Leveraging Synchronized Lip Cues from Singing Performance Videos
R.I.P.
👻
Ghosted
Attentive AV-FusionNet: Audio-Visual Quality Prediction with Hybrid Attention
R.I.P.
👻
Ghosted