🏛️ The Audio & Speech Crypt
eess.AS: Where Audio & Speech papers rest without their code.
1388
Total Papers
1179
No Code
18
Twilight
191
Has Code
13.8%
Survival Rate
R.I.P.
👻
Ghosted
R.I.P.
👻
Ghosted
Unsupervised adversarial domain adaptation for acoustic scene classification
R.I.P.
👻
Ghosted
Listen Attentively, and Spell Once: Whole Sentence Generation via a Non-Autoregressive Architecture for Low-Latency Speech Recognition
R.I.P.
👻
Ghosted
Emotional Voice Conversion using Multitask Learning with Text-to-speech
R.I.P.
👻
Ghosted
Contextual Speech Recognition with Difficult Negative Training Examples
R.I.P.
👻
Ghosted
WaveCycleGAN: Synthetic-to-natural speech waveform conversion using cycle-consistent adversarial networks
R.I.P.
👻
Ghosted
SpeechLMScore: Evaluating speech generation using speech language model
R.I.P.
👻
Ghosted
Masked Modeling Duo: Learning Representations by Encouraging Both Networks to Model the Input
R.I.P.
👻
Ghosted
CASS-NAT: CTC Alignment-based Single Step Non-autoregressive Transformer for Speech Recognition
R.I.P.
👻
Ghosted
Self-Supervised Representations Improve End-to-End Speech Translation
R.I.P.
👻
Ghosted
Maximum Voiced Frequency Estimation: Exploiting Amplitude and Phase Spectra
R.I.P.
👻
Ghosted
FoleyGen: Visually-Guided Audio Generation
R.I.P.
👻
Ghosted
Hierarchical Prosody Modeling for Non-Autoregressive Speech Synthesis
R.I.P.
👻
Ghosted
TaL: a synchronised multi-speaker corpus of ultrasound tongue imaging, audio, and lip videos
R.I.P.
👻
Ghosted
Relative Positional Encoding for Speech Recognition and Direct Translation
R.I.P.
👻
Ghosted
Unsupervised Speech Recognition via Segmental Empirical Output Distribution Matching
R.I.P.
👻
Ghosted
Modeling of nonlinear audio effects with end-to-end deep neural networks
R.I.P.
👻
Ghosted
Hybrid CTC-Attention based End-to-End Speech Recognition using Subword Units
R.I.P.
👻
Ghosted
End-to-End Multimodal Speech Recognition
R.I.P.
👻
Ghosted
S3T: Self-Supervised Pre-training with Swin Transformer for Music Classification
R.I.P.
👻
Ghosted
Universal MelGAN: A Robust Neural Vocoder for High-Fidelity Waveform Generation in Multiple Domains
R.I.P.
👻
Ghosted
Tie Your Embeddings Down: Cross-Modal Latent Spaces for End-to-end Spoken Language Understanding
R.I.P.
👻
Ghosted
Multistream CNN for Robust Acoustic Modeling
R.I.P.
👻
Ghosted