🏛️ The Sound Crypt
cs.SD: Where Sound papers rest without their code.
2437
Total Papers
1844
No Code
37
Twilight
556
Has Code
22.8%
Survival Rate
R.I.P.
👻
Ghosted
R.I.P.
👻
Ghosted
BigEAR: Inferring the Ambient and Emotional Correlates from Smartphone-based Acoustic Big Data
R.I.P.
👻
Ghosted
CLIPSonic: Text-to-Audio Synthesis with Unlabeled Videos and Pretrained Language-Vision Models
R.I.P.
👻
Ghosted
AdaDurIAN: Few-shot Adaptation for Neural Text-to-Speech with DurIAN
R.I.P.
👻
Ghosted
The Bach Doodle: Approachable music composition with machine learning at scale
R.I.P.
👻
Ghosted
MuSE-ing on the Impact of Utterance Ordering On Crowdsourced Emotion Annotations
R.I.P.
👻
Ghosted
Improved Chord Recognition by Combining Duration and Harmonic Language Models
R.I.P.
👻
Ghosted
Generating music with sentiment using Transformer-GANs
R.I.P.
👻
Ghosted
Unified Mandarin TTS Front-end Based on Distilled BERT Model
R.I.P.
👻
Ghosted
Towards Robust Neural Vocoding for Speech Generation: A Survey
R.I.P.
👻
Ghosted
STC Speaker Recognition Systems for the VOiCES From a Distance Challenge
R.I.P.
👻
Ghosted
A Hybrid Approach with Multi-channel I-Vectors and Convolutional Neural Networks for Acoustic Scene Classification
R.I.P.
👻
Ghosted
Probabilistic Binary-Mask Cocktail-Party Source Separation in a Convolutional Deep Neural Network
R.I.P.
👻
Ghosted
ERNIE-Music: Text-to-Waveform Music Generation with Diffusion Models
R.I.P.
👻
Ghosted
Fall Detection from Audios with Audio Transformers
R.I.P.
👻
Ghosted
A Unified Audio-Visual Learning Framework for Localization, Separation, and Recognition
R.I.P.
👻
Ghosted
Between Homomorphic Signal Processing and Deep Neural Networks: Constructing Deep Algorithms for Polyphonic Music Transcription
R.I.P.
👻
Ghosted
Histogram of gradients of Time-Frequency Representations for Audio scene detection
R.I.P.
👻
Ghosted
Source localization and denoising: a perspective from the TDOA space
R.I.P.
👻
Ghosted
Multi-source Domain Adaptation for Text-independent Forensic Speaker Recognition
R.I.P.
👻
Ghosted
ParaTTS: Learning Linguistic and Prosodic Cross-sentence Information in Paragraph-based TTS
R.I.P.
👻
Ghosted
Multimodal Fish Feeding Intensity Assessment in Aquaculture
R.I.P.
👻
Ghosted
V2Meow: Meowing to the Visual Beat via Video-to-Music Generation
R.I.P.
👻
Ghosted