🏛️ The Sound Crypt
cs.SD: Where Sound papers rest without their code.
2574
Total Papers
1771
No Code
61
Twilight
742
Has Code
28.8%
Survival Rate
R.I.P.
👻
Ghosted
R.I.P.
👻
Ghosted
Weak Alignment Supervision from Hybrid Model Improves End-to-end ASR
R.I.P.
👻
Ghosted
EmoDiarize: Speaker Diarization and Emotion Identification from Speech Signals using Convolutional Neural Networks
R.I.P.
👻
Ghosted
Psychoacoustic Challenges Of Speech Enhancement On VoIP Platforms
R.I.P.
👻
Ghosted
Comparative Analysis of Transfer Learning in Deep Learning Text-to-Speech Models on a Few-Shot, Low-Resource, Customized Dataset
R.I.P.
👻
Ghosted
On the Relation between Internal Language Model and Sequence Discriminative Training for Neural Transducers
R.I.P.
👻
Ghosted
Audio Contrastive-based Fine-tuning: Decoupling Representation Learning and Classification
🌅
💤
Eternal Rest
Differentially Private Adapters for Parameter Efficient Acoustic Modeling
R.I.P.
👻
Ghosted
Barwise Music Structure Analysis with the Correlation Block-Matching Segmentation Algorithm
R.I.P.
👻
Ghosted
Music Augmentation and Denoising For Peak-Based Audio Fingerprinting
R.I.P.
👻
Ghosted
A Long-Tail Friendly Representation Framework for Artist and Music Similarity
R.I.P.
👻
Ghosted
Transfer Learning with Semi-Supervised Dataset Annotation for Birdcall Classification
R.I.P.
👻
Ghosted
An Order-Complexity Model for Aesthetic Quality Assessment of Homophony Music Performance
R.I.P.
💀
404 Not Found
Approach to Learning Generalized Audio Representation Through Batch Embedding Covariance Regularization and Constant-Q Transforms
R.I.P.
👻
Ghosted
Model Extraction Attack against Self-supervised Speech Models
R.I.P.
👻
Ghosted
Efficient Incremental Text-to-Speech on GPUs
R.I.P.
👻
Ghosted
Conditional variational autoencoder to improve neural audio synthesis for polyphonic music sound
R.I.P.
👻
Ghosted
Exploiting Device and Audio Data to Tag Music with User-Aware Listening Contexts
R.I.P.
👻
Ghosted
"Seeing Sound": Audio Classification with the Wigner-Wille Distribution and Convolutional Neural Networks
R.I.P.
👻
Ghosted
Predicting phoneme-level prosody latents using AR and flow-based Prior Networks for expressive speech synthesis
R.I.P.
👻
Ghosted
Learning utterance-level representations through token-level acoustic latents prediction for Expressive Speech Synthesis
R.I.P.
👻
Ghosted
Learning Invariant Representation and Risk Minimized for Unsupervised Accent Domain Adaptation
R.I.P.
👻
Ghosted
Individualized Conditioning and Negative Distances for Speaker Separation
R.I.P.
👻
Ghosted