🏛️ The Sound Crypt
cs.SD: Where Sound papers rest without their code.
2574
Total Papers
1771
No Code
61
Twilight
742
Has Code
28.8%
Survival Rate
R.I.P.
👻
Ghosted
R.I.P.
👻
Ghosted
Hindi audio-video-Deepfake (HAV-DF): A Hindi language-based Audio-video Deepfake Dataset
R.I.P.
👻
Ghosted
Conditional GAN for Enhancing Diffusion Models in Efficient and Authentic Global Gesture Generation from Audios
R.I.P.
👻
Ghosted
UALM: Unified Audio Language Model for Understanding, Generation and Reasoning
R.I.P.
👻
Ghosted
STOPA: A Database of Systematic VariaTion Of DeePfake Audio for Open-Set Source Tracing and Attribution
R.I.P.
👻
Ghosted
Stereo Sound Event Localization and Detection with Onscreen/offscreen Classification
R.I.P.
👻
Ghosted
Can Large Language Models Predict Audio Effects Parameters from Natural Language?
R.I.P.
👻
Ghosted
Representation Learning for Semantic Alignment of Language, Audio, and Visual Modalities
R.I.P.
👻
Ghosted
Text2midi-InferAlign: Improving Symbolic Music Generation with Inference-Time Alignment
R.I.P.
👻
Ghosted
SteerMusic: Enhanced Musical Consistency for Zero-shot Text-guided and Personalized Music Editing
R.I.P.
👻
Ghosted
Do Audio-Visual Segmentation Models Truly Segment Sounding Objects?
R.I.P.
👻
Ghosted
FolAI: Synchronized Foley Sound Generation with Semantic and Temporal Alignment
R.I.P.
👻
Ghosted
Missing Melodies: AI Music Generation and its "Nearly" Complete Omission of the Global South
R.I.P.
👻
Ghosted
A Theory-Based Explainable Deep Learning Architecture for Music Emotion
R.I.P.
👻
Ghosted
Morse Code-Enabled Speech Recognition for Individuals with Visual and Hearing Impairments
R.I.P.
💀
404 Not Found
Deep Learning Models in Speech Recognition: Measuring GPU Energy Consumption, Impact of Noise and Model Quantization for Edge Deployment
R.I.P.
👻
Ghosted
An Extended Variational Mode Decomposition Algorithm Developed Speech Emotion Recognition Performance
R.I.P.
💀
404 Not Found
DanceAnyWay: Synthesizing Beat-Guided 3D Dances with Randomized Temporal Contrastive Learning
R.I.P.
👻
Ghosted
Parallel and Limited Data Voice Conversion Using Stochastic Variational Deep Kernel Learning
R.I.P.
👻
Ghosted
Knowledge-based Multimodal Music Similarity
R.I.P.
👻
Ghosted
Multi-view Temporal Alignment for Non-parallel Articulatory-to-Acoustic Speech Synthesis
R.I.P.
👻
Ghosted
Improving the Classification of Rare Chords with Unlabeled Data
R.I.P.
👻
Ghosted
Semi-supervised Learning for Singing Synthesis Timbre
R.I.P.
👻
Ghosted