🏛️ The Audio & Speech Crypt
eess.AS: Where Audio & Speech papers rest without their code.
1388
Total Papers
1179
No Code
18
Twilight
191
Has Code
13.8%
Survival Rate
R.I.P.
👻
Ghosted
R.I.P.
👻
Ghosted
An Effective End-to-End Modeling Approach for Mispronunciation Detection
R.I.P.
👻
Ghosted
TTS-Portuguese Corpus: a corpus for speech synthesis in Brazilian Portuguese
R.I.P.
👻
Ghosted
Mixture of Inference Networks for VAE-based Audio-visual Speech Enhancement
R.I.P.
👻
Ghosted
Training Multi-Speaker Neural Text-to-Speech Systems using Speaker-Imbalanced Speech Corpora
R.I.P.
👻
Ghosted
Speech-Based Depression Prediction Using Encoder-Weight-Only Transfer Learning and a Large Corpus
R.I.P.
👻
Ghosted
SALMONN-omni: A Codec-free LLM for Full-duplex Speech Understanding and Generation
R.I.P.
👻
Ghosted
Boosting Cross-Domain Speech Recognition with Self-Supervision
R.I.P.
👻
Ghosted
Semi-FedSER: Semi-supervised Learning for Speech Emotion Recognition On Federated Learning using Multiview Pseudo-Labeling
R.I.P.
👻
Ghosted
Efficient neural speech synthesis for low-resource languages through multilingual modeling
R.I.P.
👻
Ghosted
A Pyramid Recurrent Network for Predicting Crowdsourced Speech-Quality Ratings of Real-World Signals
R.I.P.
👻
Ghosted
I4U Submission to NIST SRE 2018: Leveraging from a Decade of Shared Experiences
R.I.P.
👻
Ghosted
On Training Targets and Objective Functions for Deep-Learning-Based Audio-Visual Speech Enhancement
R.I.P.
👻
Ghosted
Multilingual and Unsupervised Subword Modeling for Zero-Resource Languages
R.I.P.
👻
Ghosted
Malafide: a novel adversarial convolutive noise attack against deepfake and spoofing detection systems
R.I.P.
👻
Ghosted
Adapter-Based Extension of Multi-Speaker Text-to-Speech Model for New Speakers
R.I.P.
👻
Ghosted
Simple Pooling Front-ends For Efficient Audio Classification
R.I.P.
👻
Ghosted
Smartajweed Automatic Recognition of Arabic Quranic Recitation Rules
R.I.P.
👻
Ghosted
Group Communication with Context Codec for Lightweight Source Separation
R.I.P.
👻
Ghosted
Prosodic Representation Learning and Contextual Sampling for Neural Text-to-Speech
R.I.P.
👻
Ghosted
VAW-GAN for Singing Voice Conversion with Non-parallel Training Data
R.I.P.
👻
Ghosted
Audio-visual Multi-channel Recognition of Overlapped Speech
R.I.P.
👻
Ghosted
Multi-Task Network for Noise-Robust Keyword Spotting and Speaker Verification using CTC-based Soft VAD and Global Query Attention
R.I.P.
👻
Ghosted