🏛️ The Sound Crypt
cs.SD: Where Sound papers rest without their code.
2574
Total Papers
1771
No Code
61
Twilight
742
Has Code
28.8%
Survival Rate
R.I.P.
👻
Ghosted
R.I.P.
👻
Ghosted
Tiny-Align: Bridging Automatic Speech Recognition and Large Language Model on the Edge
R.I.P.
👻
Ghosted
A Novel Speech Analysis and Correction Tool for Arabic-Speaking Children
R.I.P.
👻
Ghosted
Multi-class Decoding of Attended Speaker Direction Using Electroencephalogram and Audio Spatial Spectrum
R.I.P.
👻
Ghosted
CAFE A Novel Code switching Dataset for Algerian Dialect French and English
R.I.P.
👻
Ghosted
Investigation of Speaker Representation for Target-Speaker Speech Processing
R.I.P.
👻
Ghosted
Full-Rank No More: Low-Rank Weight Training for Modern Speech Recognition Models
R.I.P.
👻
Ghosted
Training Large ASR Encoders with Differential Privacy
R.I.P.
👻
Ghosted
Mel-Refine: A Plug-and-Play Approach to Refine Mel-Spectrogram in Audio Generation
R.I.P.
👻
Ghosted
PIAST: A Multimodal Piano Dataset with Audio, Symbolic and Text
R.I.P.
👻
Ghosted
Leveraging LLM Embeddings for Cross Dataset Label Alignment and Zero Shot Music Emotion Prediction
R.I.P.
👻
Ghosted
Quantitative Analysis of Audio-Visual Tasks: An Information-Theoretic Perspective
R.I.P.
👻
Ghosted
ChordSync: Conformer-Based Alignment of Chord Annotations to Music Audio
R.I.P.
👻
Ghosted
Improving Speech Enhancement by Integrating Inter-Channel and Band Features with Dual-branch Conformer
R.I.P.
👻
Ghosted
Straight Through Gumbel Softmax Estimator based Bimodal Neural Architecture Search for Audio-Visual Deepfake Detection
R.I.P.
👻
Ghosted
FastAST: Accelerating Audio Spectrogram Transformer via Token Merging and Cross-Model Knowledge Distillation
R.I.P.
👻
Ghosted
Carnatic Raga Identification System using Rigorous Time-Delay Neural Network
R.I.P.
👻
Ghosted
Music Enhancement with Deep Filters: A Technical Report for The ICASSP 2024 Cadenza Challenge
R.I.P.
👻
Ghosted
MR-MT3: Memory Retaining Multi-Track Music Transcription to Mitigate Instrument Leakage
R.I.P.
👻
Ghosted
Robust Wake Word Spotting With Frame-Level Cross-Modal Attention Based Audio-Visual Conformer
R.I.P.
👻
Ghosted
Acoustic models of Brazilian Portuguese Speech based on Neural Transformers
R.I.P.
👻
Ghosted
The AeroSonicDB (YPAD-0523) Dataset for Acoustic Detection and Classification of Aircraft
R.I.P.
👻
Ghosted
Combinatorial music generation model with song structure graph analysis
R.I.P.
👻
Ghosted