🏛️ The Sound Crypt
cs.SD: Where Sound papers rest without their code.
2574
Total Papers
1771
No Code
61
Twilight
742
Has Code
28.8%
Survival Rate
R.I.P.
👻
Ghosted
R.I.P.
👻
Ghosted
Time-frequency Network for Robust Speaker Recognition
R.I.P.
👻
Ghosted
Automated Arrangements of Multi-Part Music for Sets of Monophonic Instruments
R.I.P.
👻
Ghosted
Audio Latent Space Cartography
R.I.P.
👻
Ghosted
SIMD-size aware weight regularization for fast neural vocoding on CPU
R.I.P.
👻
Ghosted
Self-Supervised Hierarchical Metrical Structure Modeling
R.I.P.
👻
Ghosted
Speech MOS multi-task learning and rater bias correction
R.I.P.
👻
Ghosted
Adaptive Speech Quality Aware Complex Neural Network for Acoustic Echo Cancellation with Supervised Contrastive Learning
R.I.P.
👻
Ghosted
Relating Human Perception of Musicality to Prediction in a Predictive Coding Model
R.I.P.
👻
Ghosted
Spectrograms Are Sequences of Patches
R.I.P.
👻
Ghosted
V-Cloak: Intelligibility-, Naturalness- & Timbre-Preserving Real-Time Voice Anonymization
R.I.P.
👻
Ghosted
Adaptive re-calibration of channel-wise features for Adversarial Audio Classification
R.I.P.
👻
Ghosted
Speech Enhancement with Perceptually-motivated Optimization and Dual Transformations
R.I.P.
👻
Ghosted
Dynamic Time-Alignment of Dimensional Annotations of Emotion using Recurrent Neural Networks
R.I.P.
👻
Ghosted
SSCFormer: Push the Limit of Chunk-wise Conformer for Streaming ASR Using Sequentially Sampled Chunks and Chunked Causal Convolution
R.I.P.
👻
Ghosted
Dynamic Kernels and Channel Attention for Low Resource Speaker Verification
R.I.P.
👻
Ghosted
Towards zero-shot Text-based voice editing using acoustic context conditioning, utterance embeddings, and reference encoders
R.I.P.
👻
Ghosted
Pronunciation Generation for Foreign Language Words in Intra-Sentential Code-Switching Speech Recognition
R.I.P.
👻
Ghosted
Speaker- and Age-Invariant Training for Child Acoustic Modeling Using Adversarial Multi-Task Learning
R.I.P.
👻
Ghosted
Read it to me: An emotionally aware Speech Narration Application
R.I.P.
👻
Ghosted
Bridging Music and Text with Crowdsourced Music Comments: A Sequence-to-Sequence Framework for Thematic Music Comments Generation
R.I.P.
👻
Ghosted
Chronological Self-Training for Real-Time Speaker Diarization
R.I.P.
👻
Ghosted
WOLONet: Wave Outlooker for Efficient and High Fidelity Speech Synthesis
R.I.P.
👻
Ghosted