🏛️ The Sound Crypt
cs.SD: Where Sound papers rest without their code.
2574
Total Papers
1771
No Code
61
Twilight
742
Has Code
28.8%
Survival Rate
R.I.P.
💀
404 Not Found
R.I.P.
👻
Ghosted
A large-scale and PCR-referenced vocal audio dataset for COVID-19
R.I.P.
👻
Ghosted
Impact of annotation modality on label quality and model performance in the automatic assessment of laughter in-the-wild
R.I.P.
👻
Ghosted
Style-Label-Free: Cross-Speaker Style Transfer by Quantized VAE and Speaker-wise Normalization in Speech Synthesis
R.I.P.
👻
Ghosted
Explicit Intensity Control for Accented Text-to-speech
R.I.P.
👻
Ghosted
Knowledge Transfer For On-Device Speech Emotion Recognition with Neural Structured Learning
R.I.P.
👻
Ghosted
Learning Phone Recognition from Unpaired Audio and Phone Sequences Based on Generative Adversarial Network
R.I.P.
👻
Ghosted
Synthesizing Personalized Non-speech Vocalization from Discrete Speech Representations
R.I.P.
👻
Ghosted
Acoustic Modeling for End-to-End Empathetic Dialogue Speech Synthesis Using Linguistic and Prosodic Contexts of Dialogue History
R.I.P.
👻
Ghosted
From Environmental Sound Representation to Robustness of 2D CNN Models Against Adversarial Attacks
📚
📚
The Cartographer
Adversarial Attacks on Speech Recognition Systems for Mission-Critical Applications: A Survey
R.I.P.
👻
Ghosted
GMM based multi-stage Wiener filtering for low SNR speech enhancement
R.I.P.
👻
Ghosted
Musical Audio Similarity with Self-supervised Convolutional Neural Networks
R.I.P.
👻
Ghosted
The GigaMIDI Dataset with Features for Expressive Music Performance Detection
R.I.P.
👻
Ghosted
Music Discovery Dialogue Generation Using Human Intent Analysis and Large Language Models
R.I.P.
👻
Ghosted
DiffATR: Diffusion-based Generative Modeling for Audio-Text Retrieval
R.I.P.
👻
Ghosted
MIDI-to-Tab: Guitar Tablature Inference via Masked Language Modeling
R.I.P.
👻
Ghosted
Active Bird2Vec: Towards End-to-End Bird Sound Monitoring with Transformers
R.I.P.
👻
Ghosted
EnchantDance: Unveiling the Potential of Music-Driven Dance Movement
R.I.P.
👻
Ghosted
Low-latency Speech Enhancement via Speech Token Generation
🌅
💤
Eternal Rest
Bridging High-Quality Audio and Video via Language for Sound Effects Retrieval from Visual Queries
R.I.P.
👻
Ghosted
MSAC: Multiple Speech Attribute Control Method for Reliable Speech Emotion Recognition
R.I.P.
👻
Ghosted
Transavs: End-To-End Audio-Visual Segmentation With Transformer
R.I.P.
👻
Ghosted