🏛️ The Sound Crypt
cs.SD: Where Sound papers rest without their code.
2574
Total Papers
1771
No Code
61
Twilight
742
Has Code
28.8%
Survival Rate
R.I.P.
👻
Ghosted
R.I.P.
👻
Ghosted
MotionRAG-Diff: A Retrieval-Augmented Diffusion Framework for Long-Term Music-to-Dance Generation
R.I.P.
👻
Ghosted
Segment-Factorized Full-Song Generation on Symbolic Piano Music
R.I.P.
👻
Ghosted
Prompt-aware classifier free guidance for diffusion models
R.I.P.
👻
Ghosted
Pay More Attention To Audio: Mitigating Imbalance of Cross-Modal Attention in Large Audio Language Models
R.I.P.
👻
Ghosted
StereoFoley: Object-Aware Stereo Audio Generation from Video
R.I.P.
👻
Ghosted
PianoVAM: A Multimodal Piano Performance Dataset
R.I.P.
👻
Ghosted
Multi-level SSL Feature Gating for Audio Deepfake Detection
R.I.P.
👻
Ghosted
MoTAS: MoE-Guided Feature Selection from TTS-Augmented Speech for Enhanced Multimodal Alzheimer's Early Screening
R.I.P.
👻
Ghosted
Face2VoiceSync: Lightweight Face-Voice Consistency for Text-Driven Talking Face Generation
R.I.P.
👻
Ghosted
MLLM-based Speech Recognition: When and How is Multimodality Beneficial?
R.I.P.
👻
Ghosted
Improving BERT for Symbolic Music Understanding Using Token Denoising and Pianoroll Prediction
R.I.P.
👻
Ghosted
Exploring Classical Piano Performance Generation with Expressive Music Variational AutoEncoder
R.I.P.
👻
Ghosted
On the Design of Diffusion-based Neural Speech Codecs
R.I.P.
👻
Ghosted
Latent Swap Joint Diffusion for 2D Long-Form Latent Generation
R.I.P.
👻
Ghosted
Whisper-GPT: A Hybrid Representation Audio Large Language Model
R.I.P.
👻
Ghosted
Aligner-Guided Training Paradigm: Advancing Text-to-Speech Models with Aligner Guided Duration
R.I.P.
👻
Ghosted
Efficient VoIP Communications through LLM-based Real-Time Speech Reconstruction and Call Prioritization for Emergency Services
R.I.P.
👻
Ghosted
MusicGen-Chord: Advancing Music Generation through Chord Progressions and Interactive Web-UI
🌅
💤
Eternal Rest
Towards Speaker Identification with Minimal Dataset and Constrained Resources using 1D-Convolution Neural Network
R.I.P.
👻
Ghosted
Generative AI for Music and Audio
R.I.P.
👻
Ghosted
Attention-guided Spectrogram Sequence Modeling with CNNs for Music Genre Classification
R.I.P.
👻
Ghosted
Multiple Consistency-guided Test-Time Adaptation for Contrastive Audio-Language Models with Unlabeled Audio
R.I.P.
👻
Ghosted