🏛️ The Sound Crypt
cs.SD: Where Sound papers rest without their code.
2521
Total Papers
1771
No Code
61
Twilight
689
Has Code
27.3%
Survival Rate
R.I.P.
👻
Ghosted
R.I.P.
👻
Ghosted
Improving Transformer-based Conversational ASR by Inter-Sentential Attention Mechanism
R.I.P.
👻
Ghosted
Real-time Speech Emotion Recognition Based on Syllable-Level Feature Extraction
R.I.P.
👻
Ghosted
Parameter-Efficient Learning for Text-to-Speech Accent Adaptation
R.I.P.
👻
Ghosted
Seeing Sound, Hearing Sight: Uncovering Modality Bias and Conflict of AI models in Sound Localization
R.I.P.
👻
Ghosted
Automatic source localization and spectra generation from sparse beamforming maps
R.I.P.
👻
Ghosted
Phonetic Posteriorgrams based Many-to-Many Singing Voice Conversion via Adversarial Training
R.I.P.
👻
Ghosted
WaveNODE: A Continuous Normalizing Flow for Speech Synthesis
R.I.P.
👻
Ghosted
Machine learning for the recognition of emotion in the speech of couples in psychotherapy using the Stanford Suppes Brain Lab Psychotherapy Dataset
R.I.P.
👻
Ghosted
Weakly Supervised Training of Speaker Identification Models
R.I.P.
👻
Ghosted
A Convex Approximation of the Relaxed Binaural Beamforming Optimization Problem
R.I.P.
👻
Ghosted
Deep Learning of Human Perception in Audio Event Classification
R.I.P.
👻
Ghosted
On Residual CNN in text-dependent speaker verification task
R.I.P.
👻
Ghosted
OBTAIN: Real-Time Beat Tracking in Audio Signals
R.I.P.
👻
Ghosted
Melody Generation for Pop Music via Word Representation of Musical Properties
R.I.P.
👻
Ghosted
PCA/LDA Approach for Text-Independent Speaker Recognition
R.I.P.
👻
Ghosted
Max-margin Metric Learning for Speaker Recognition
R.I.P.
👻
Ghosted
Speech Separation with Pretrained Frontend to Minimize Domain Mismatch
R.I.P.
💀
404 Not Found
Contextual Cross-Modal Attention for Audio-Visual Deepfake Detection and Localization
R.I.P.
👻
Ghosted
Zero-Shot Fake Video Detection by Audio-Visual Consistency
R.I.P.
👻
Ghosted
Separate in the Speech Chain: Cross-Modal Conditional Audio-Visual Target Speech Extraction
R.I.P.
👻
Ghosted
Self-Supervised Disentangled Representation Learning for Robust Target Speech Extraction
R.I.P.
👻
Ghosted
StyleSpeech: Self-supervised Style Enhancing with VQ-VAE-based Pre-training for Expressive Audiobook Speech Synthesis
R.I.P.
👻
Ghosted