🏛️ The Audio & Speech Crypt
eess.AS: Where Audio & Speech papers rest without their code.
1421
Total Papers
1131
No Code
24
Twilight
266
Has Code
18.7%
Survival Rate
R.I.P.
👻
Ghosted
R.I.P.
👻
Ghosted
asya: Mindful verbal communication using deep learning
R.I.P.
👻
Ghosted
Shouted Speech Compensation for Speaker Verification Robust to Vocal Effort Conditions
R.I.P.
👻
Ghosted
Audio-Visual Decision Fusion for WFST-based and seq2seq Models
R.I.P.
👻
Ghosted
Audio-Visual Target Speaker Enhancement on Multi-Talker Environment using Event-Driven Cameras
R.I.P.
👻
Ghosted
A Dataset for measuring reading levels in India at scale
R.I.P.
👻
Ghosted
Generative Audio Synthesis with a Parametric Model
R.I.P.
👻
Ghosted
Memory Requirement Reduction of Deep Neural Networks Using Low-bit Quantization of Parameters
R.I.P.
👻
Ghosted
Comparative Study between Adversarial Networks and Classical Techniques for Speech Enhancement
R.I.P.
👻
Ghosted
Improving Noise Robustness In Speaker Identification Using A Two-Stage Attention Model
R.I.P.
👻
Ghosted
Does Speech enhancement of publicly available data help build robust Speech Recognition Systems?
R.I.P.
👻
Ghosted
Non-native Speaker Verification for Spoken Language Assessment
R.I.P.
👻
Ghosted
Real to H-space Encoder for Speech Recognition
R.I.P.
👻
Ghosted
A Fully Time-domain Neural Model for Subband-based Speech Synthesizer
R.I.P.
👻
Ghosted
Automatic Organisation, Segmentation, and Filtering of User-Generated Audio Content
R.I.P.
👻
Ghosted
CrossSpeech++: Cross-lingual Speech Synthesis with Decoupled Language and Speaker Generation
R.I.P.
👻
Ghosted
Enhancing Audiovisual Speech Recognition through Bifocal Preference Optimization
R.I.P.
👻
Ghosted
SongGLM: Lyric-to-Melody Generation with 2D Alignment Encoding and Multi-Task Pre-Training
R.I.P.
👻
Ghosted
Investigating Acoustic-Textual Emotional Inconsistency Information for Automatic Depression Detection
R.I.P.
👻
Ghosted
Swin-BERT: A Feature Fusion System designed for Speech-based Alzheimer's Dementia Detection
R.I.P.
👻
Ghosted
Episodic fine-tuning prototypical networks for optimization-based few-shot learning: Application to audio classification
R.I.P.
👻
Ghosted
Dialogue Understandability: Why are we streaming movies with subtitles?
R.I.P.
👻
Ghosted
CoAVT: A Cognition-Inspired Unified Audio-Visual-Text Pre-Training Model for Multimodal Processing
🌅
💤
Eternal Rest