🏛️ The Audio & Speech Crypt
eess.AS: Where Audio & Speech papers rest without their code.
1388
Total Papers
1179
No Code
18
Twilight
191
Has Code
13.8%
Survival Rate
R.I.P.
👻
Ghosted
R.I.P.
👻
Ghosted
A Comparison of Modeling Units in Sequence-to-Sequence Speech Recognition with the Transformer on Mandarin Chinese
R.I.P.
👻
Ghosted
You Do Not Need More Data: Improving End-To-End Speech Recognition by Text-To-Speech Data Augmentation
R.I.P.
👻
Ghosted
Muse: Multi-modal target speaker extraction with visual cues
R.I.P.
👻
Ghosted
Diffusion-based Generative Speech Source Separation
R.I.P.
👻
Ghosted
Audio Retrieval with WavText5K and CLAP Training
R.I.P.
👻
Ghosted
Streaming Transformer-based Acoustic Models Using Self-attention with Augmented Memory
R.I.P.
👻
Ghosted
A Multi-Phase Gammatone Filterbank for Speech Separation via TasNet
R.I.P.
👻
Ghosted
Small-Footprint Keyword Spotting on Raw Audio Data with Sinc-Convolutions
R.I.P.
👻
Ghosted
Quaternion Convolutional Neural Networks for Detection and Localization of 3D Sound Events
R.I.P.
👻
Ghosted
VoiceFlow: Efficient Text-to-Speech with Rectified Flow Matching
R.I.P.
👻
Ghosted
PIANOTREE VAE: Structured Representation Learning for Polyphonic Music
R.I.P.
👻
Ghosted
Generative Adversarial Speaker Embedding Networks for Domain Robust End-to-End Speaker Verification
R.I.P.
👻
Ghosted
Low-resource expressive text-to-speech using data augmentation
R.I.P.
👻
Ghosted
Deep Learning based Emotion Recognition System Using Speech Features and Transcriptions
R.I.P.
👻
Ghosted
Grad-StyleSpeech: Any-speaker Adaptive Text-to-Speech Synthesis with Diffusion Models
R.I.P.
👻
Ghosted
Speaker-invariant Affective Representation Learning via Adversarial Training
R.I.P.
👻
Ghosted
DiPCo -- Dinner Party Corpus
R.I.P.
👻
Ghosted
The DIRHA-English corpus and related tasks for distant-speech recognition in domestic environments
R.I.P.
👻
Ghosted
EmoDiff: Intensity Controllable Emotional Text-to-Speech with Soft-Label Guidance
R.I.P.
👻
Ghosted
Multi-stage Speaker Extraction with Utterance and Frame-Level Reference Signals
R.I.P.
👻
Ghosted
Align-Refine: Non-Autoregressive Speech Recognition via Iterative Realignment
R.I.P.
👻
Ghosted
Confidence Estimation for Attention-based Sequence-to-sequence Models for Speech Recognition
R.I.P.
👻
Ghosted