🏛️ The Sound Crypt
cs.SD: Where Sound papers rest without their code.
2574
Total Papers
1771
No Code
61
Twilight
742
Has Code
28.8%
Survival Rate
R.I.P.
👻
Ghosted
R.I.P.
👻
Ghosted
Rhythmic Foley: A Framework For Seamless Audio-Visual Alignment In Video-to-Audio Synthesis
R.I.P.
👻
Ghosted
FoVNet: Configurable Field-of-View Speech Enhancement with Low Computation and Distortion for Smart Glasses
R.I.P.
💀
404 Not Found
SaMoye: Zero-shot Singing Voice Conversion Model Based on Feature Disentanglement and Enhancement
R.I.P.
👻
Ghosted
HybridVC: Efficient Voice Style Conversion with Text and Audio Prompts
R.I.P.
👻
Ghosted
Anchor-aware Deep Metric Learning for Audio-visual Retrieval
R.I.P.
👻
Ghosted
SongBsAb: A Dual Prevention Approach against Singing Voice Conversion based Illegal Song Covers
📚
📚
The Cartographer
A review-based study on different Text-to-Speech technologies
🌅
💤
Eternal Rest
D4AM: A General Denoising Framework for Downstream Acoustic Models
R.I.P.
👻
Ghosted
Seq2seq for Automatic Paraphasia Detection in Aphasic Speech
R.I.P.
👻
Ghosted
Audio-visual fine-tuning of audio-only ASR models
R.I.P.
👻
Ghosted
On The Open Prompt Challenge In Conditional Audio Generation
R.I.P.
👻
Ghosted
High-Fidelity Speech Synthesis with Minimal Supervision: All Using Diffusion Models
R.I.P.
👻
Ghosted
Representation Learning for Audio Privacy Preservation using Source Separation and Robust Adversarial Learning
R.I.P.
👻
Ghosted
ASTER: Automatic Speech Recognition System Accessibility Testing for Stutterers
R.I.P.
👻
Ghosted
Robust and lightweight audio fingerprint for Automatic Content Recognition
R.I.P.
👻
Ghosted
Heterogeneous Graph Learning for Acoustic Event Classification
R.I.P.
👻
Ghosted
TimbreCLIP: Connecting Timbre to Text and Images
🌅
💤
Eternal Rest
Vis2Mus: Exploring Multimodal Representation Mapping for Controllable Music Generation
R.I.P.
👻
Ghosted
Fast and efficient speech enhancement with variational autoencoders
R.I.P.
👻
Ghosted
Back-Translation-Style Data Augmentation for Mandarin Chinese Polyphone Disambiguation
R.I.P.
👻
Ghosted
Large-scale learning of generalised representations for speaker recognition
R.I.P.
💀
404 Not Found
The Efficacy of Self-Supervised Speech Models for Audio Representations
R.I.P.
👻
Ghosted