🏛️ The Sound Crypt
cs.SD: Where Sound papers rest without their code.
2427
Total Papers
1844
No Code
37
Twilight
546
Has Code
22.5%
Survival Rate
R.I.P.
👻
Ghosted
R.I.P.
👻
Ghosted
Freeze-Omni: A Smart and Low Latency Speech-to-speech Dialogue Model with Frozen LLM
R.I.P.
👻
Ghosted
Indoor Sound Source Localization with Probabilistic Neural Network
R.I.P.
👻
Ghosted
The Jazz Transformer on the Front Line: Exploring the Shortcomings of AI-composed Music through Quantitative Measures
R.I.P.
👻
Ghosted
The Deterministic plus Stochastic Model of the Residual Signal and its Applications
R.I.P.
👻
Ghosted
TimbreTron: A WaveNet(CycleGAN(CQT(Audio))) Pipeline for Musical Timbre Transfer
R.I.P.
👻
Ghosted
A Survey on Audio Diffusion Models: Text To Speech Synthesis and Enhancement in Generative AI
R.I.P.
👻
Ghosted
Convolutional Recurrent Neural Networks for Bird Audio Detection
R.I.P.
👻
Ghosted
Convolutional Gated Recurrent Neural Network Incorporating Spatial Features for Audio Tagging
R.I.P.
👻
Ghosted
LauraGPT: Listen, Attend, Understand, and Regenerate Audio with GPT
R.I.P.
👻
Ghosted
Multi-microphone Complex Spectral Mapping for Utterance-wise and Continuous Speech Separation
R.I.P.
👻
Ghosted
AI Song Contest: Human-AI Co-Creation in Songwriting
R.I.P.
👻
Ghosted
MMM : Exploring Conditional Multi-Track Music Generation with the Transformer
R.I.P.
👻
Ghosted
Variational Autoencoders for Learning Latent Representations of Speech Emotion: A Preliminary Study
R.I.P.
👻
Ghosted
Virufy: Global Applicability of Crowdsourced and Clinical Datasets for AI Detection of COVID-19 from Cough
R.I.P.
👻
Ghosted
Deep Karaoke: Extracting Vocals from Musical Mixtures Using a Convolutional Deep Neural Network
R.I.P.
👻
Ghosted
A Deterministic plus Stochastic Model of the Residual Signal for Improved Parametric Speech Synthesis
R.I.P.
👻
Ghosted
Recognizing Multi-talker Speech with Permutation Invariant Training
R.I.P.
👻
Ghosted
Encoding Musical Style with Transformer Autoencoders
R.I.P.
👻
Ghosted
Deep Learning Based Phase Reconstruction for Speaker Separation: A Trigonometric Perspective
R.I.P.
👻
Ghosted
A Neural Parametric Singing Synthesizer
🌅
💤
Eternal Rest
Efficient Emotional Adaptation for Audio-Driven Talking-Head Generation
R.I.P.
👻
Ghosted
Sample-level CNN Architectures for Music Auto-tagging Using Raw Waveforms
R.I.P.
👻
Ghosted