🏛️ The Audio & Speech Crypt
eess.AS: Where Audio & Speech papers rest without their code.
1407
Total Papers
1131
No Code
24
Twilight
252
Has Code
17.9%
Survival Rate
R.I.P.
👻
Ghosted
R.I.P.
👻
Ghosted
Towards Generating Diverse Audio Captions via Adversarial Training
R.I.P.
👻
Ghosted
A Comparative Study of Data Augmentation Techniques for Deep Learning Based Emotion Recognition
R.I.P.
👻
Ghosted
Does Joint Training Really Help Cascaded Speech Translation?
R.I.P.
👻
Ghosted
A Study of Modeling Rising Intonation in Cantonese Neural Speech Synthesis
R.I.P.
👻
Ghosted
PoeticTTS -- Controllable Poetry Reading for Literary Studies
R.I.P.
👻
Ghosted
GlowVC: Mel-spectrogram space disentangling model for language-independent text-free voice conversion
R.I.P.
👻
Ghosted
Nonwords Pronunciation Classification in Language Development Tests for Preschool Children
R.I.P.
👻
Ghosted
End-to-end speech recognition modeling from de-identified data
R.I.P.
👻
Ghosted
Defense against Adversarial Attacks on Hybrid Speech Recognition using Joint Adversarial Fine-tuning with Denoiser
R.I.P.
👻
Ghosted
A Speech Representation Anonymization Framework via Selective Noise Perturbation
📚
📚
The Cartographer
Voice Analysis for Stress Detection and Application in Virtual Reality to Improve Public Speaking in Real-time: A Review
R.I.P.
👻
Ghosted
Device-Directed Speech Detection: Regularization via Distillation for Weakly-Supervised Models
R.I.P.
👻
Ghosted
Literary and Colloquial Tamil Dialect Identification
R.I.P.
👻
Ghosted
Comparative Analysis of Personalized Voice Activity Detection Systems: Assessing Real-World Effectiveness
R.I.P.
👻
Ghosted
Should you use a probabilistic duration model in TTS? Probably! Especially for spontaneous speech
📚
📚
The Cartographer
A Comprehensive Survey on Generative AI for Video-to-Music Generation
R.I.P.
👻
Ghosted
Speech Recognition With LLMs Adapted to Disordered Speech Using Reinforcement Learning
R.I.P.
👻
Ghosted
From KAN to GR-KAN: Advancing Speech Enhancement with KAN-Based Methodology
R.I.P.
👻
Ghosted
CA-SSLR: Condition-Aware Self-Supervised Learning Representation for Generalized Speech Processing
R.I.P.
👻
Ghosted
Open-Amp: Synthetic Data Framework for Audio Effect Foundation Models
R.I.P.
👻
Ghosted
AlignSTS: Speech-to-Singing Conversion via Cross-Modal Alignment
R.I.P.
👻
Ghosted
Few Shot Adaptive Normalization Driven Multi-Speaker Speech Synthesis
R.I.P.
👻
Ghosted