🏛️ The Audio & Speech Crypt
eess.AS: Where Audio & Speech papers rest without their code.
1421
Total Papers
1131
No Code
24
Twilight
266
Has Code
18.7%
Survival Rate
R.I.P.
💀
404 Not Found
R.I.P.
👻
Ghosted
REWIND: Speech Time Reversal for Enhancing Speaker Representations in Diffusion-based Voice Conversion
R.I.P.
⚰️
The Empty Tomb
MAVFlow: Preserving Paralinguistic Elements with Conditional Flow Matching for Zero-Shot AV2AV Multilingual Translation
R.I.P.
👻
Ghosted
Meta-Learning-Based Delayless Subband Adaptive Filter using Complex Self-Attention for Active Noise Control
🌅
💤
Eternal Rest
Speech Watermarking with Discrete Intermediate Representations
R.I.P.
👻
Ghosted
TACO: Training-free Sound Prompted Segmentation via Semantically Constrained Audio-visual CO-factorization
R.I.P.
👻
Ghosted
Automating Feedback Analysis in Surgical Training: Detection, Categorization, and Assessment
R.I.P.
👻
Ghosted
Late fusion ensembles for speech recognition on diverse input audio representations
R.I.P.
👻
Ghosted
Towards Maximum Likelihood Training for Transducer-based Streaming Speech Recognition
R.I.P.
👻
Ghosted
Enhancing Code-Switching ASR Leveraging Non-Peaky CTC Loss and Deep Language Posterior Injection
R.I.P.
👻
Ghosted
A Context-Based Numerical Format Prediction for a Text-To-Speech System
R.I.P.
👻
Ghosted
Memory-Efficient Training for Text-Dependent SV with Independent Pre-trained Models
R.I.P.
👻
Ghosted
Cluster-to-Predict Affect Contours from Speech
R.I.P.
👻
Ghosted
Improved Vocal Effort Transfer Vector Estimation for Vocal Effort-Robust Speaker Verification
R.I.P.
👻
Ghosted
Speaker Diaphragm Excursion Prediction: deep attention and online adaptation
R.I.P.
👻
Ghosted
Improved Lossless Coding for Storage and Transmission of Multichannel Immersive Audio
R.I.P.
👻
Ghosted
Text-to-speech for the hearing impaired
R.I.P.
👻
Ghosted
Comparison of Speaker Role Recognition and Speaker Enrollment Protocol for conversational Clinical Interviews
R.I.P.
👻
Ghosted
BERT for Joint Multichannel Speech Dereverberation with Spatial-aware Tasks
R.I.P.
👻
Ghosted
End-to-End Trainable Self-Attentive Shallow Network for Text-Independent Speaker Verification
R.I.P.
👻
Ghosted
Deep F-measure Maximization for End-to-End Speech Understanding
R.I.P.
👻
Ghosted
Applying Speech Tempo-Derived Features, BoAW and Fisher Vectors to Detect Elderly Emotion and Speech in Surgical Masks
R.I.P.
👻
Ghosted
Weakly Supervised Construction of ASR Systems with Massive Video Data
R.I.P.
👻
Ghosted