🏛️ The Audio & Speech Crypt
eess.AS: Where Audio & Speech papers rest without their code.
1392
Total Papers
1179
No Code
18
Twilight
195
Has Code
14.0%
Survival Rate
R.I.P.
👻
Ghosted
R.I.P.
👻
Ghosted
Multi-Tones' Phase Coding (MTPC) of Interaural Time Difference by Spiking Neural Network
R.I.P.
👻
Ghosted
Deep Attention Fusion Feature for Speech Separation with End-to-End Post-filter Method
R.I.P.
👻
Ghosted
Multimodal active speaker detection and virtual cinematography for video conferencing
R.I.P.
👻
Ghosted
power-law nonlinearity with maximally uniform distribution criterion for improved neural network training in automatic speech recognition
R.I.P.
👻
Ghosted
Independent and automatic evaluation of acoustic-to-articulatory inversion models
R.I.P.
👻
Ghosted
MIDI-Sandwich: Multi-model Multi-task Hierarchical Conditional VAE-GAN networks for Symbolic Single-track Music Generation
R.I.P.
👻
Ghosted
Tied Hidden Factors in Neural Networks for End-to-End Speaker Recognition
R.I.P.
👻
Ghosted
Speech Retrieval-Augmented Generation without Automatic Speech Recognition
R.I.P.
👻
Ghosted
Failing Forward: Improving Generative Error Correction for ASR with Synthetic Data and Retrieval Augmentation
R.I.P.
👻
Ghosted
Detecting the presence of sperm whales echolocation clicks in noisy environments
R.I.P.
👻
Ghosted
Diffusion-Based Speech Enhancement in Matched and Mismatched Conditions Using a Heun-Based Sampler
R.I.P.
👻
Ghosted
Efficient Speech Quality Assessment using Self-supervised Framewise Embeddings
R.I.P.
👻
Ghosted
A unified one-shot prosody and speaker conversion system with self-supervised discrete speech units
R.I.P.
👻
Ghosted
Enhancing and Adversarial: Improve ASR with Speaker Labels
R.I.P.
👻
Ghosted
Text-to-speech synthesis based on latent variable conversion using diffusion probabilistic model and variational autoencoder
R.I.P.
👻
Ghosted
Accidental Learners: Spoken Language Identification in Multilingual Self-Supervised Models
R.I.P.
👻
Ghosted
Speech-text based multi-modal training with bidirectional attention for improved speech recognition
R.I.P.
👻
Ghosted
Unify and Conquer: How Phonetic Feature Representation Affects Polyglot Text-To-Speech (TTS)
R.I.P.
👻
Ghosted
AdvEst: Adversarial Perturbation Estimation to Classify and Detect Adversarial Attacks against Speaker Identification
R.I.P.
👻
Ghosted
Scalable Speech Enhancement with Dynamic Channel Pruning
R.I.P.
👻
Ghosted
On the Use of Self-Supervised Speech Representations in Spontaneous Speech Synthesis
R.I.P.
👻
Ghosted
Spectrum and Prosody Conversion for Cross-lingual Voice Conversion with CycleGAN
R.I.P.
👻
Ghosted