🏛️ The Audio & Speech Crypt
eess.AS: Where Audio & Speech papers rest without their code.
1392
Total Papers
1179
No Code
18
Twilight
195
Has Code
14.0%
Survival Rate
R.I.P.
👻
Ghosted
R.I.P.
👻
Ghosted
Acoustic Model Adaptation from Raw Waveforms with SincNet
R.I.P.
👻
Ghosted
Many-to-Many Voice Conversion using Cycle-Consistent Variational Autoencoder with Multiple Decoders
R.I.P.
👻
Ghosted
Speaker-Targeted Audio-Visual Models for Speech Recognition in Cocktail-Party Environments
R.I.P.
👻
Ghosted
Meeting Transcription Using Virtual Microphone Arrays
R.I.P.
👻
Ghosted
Compression of Acoustic Event Detection Models with Low-rank Matrix Factorization and Quantization Training
R.I.P.
👻
Ghosted
Additional Shared Decoder on Siamese Multi-view Encoders for Learning Acoustic Word Embeddings
R.I.P.
👻
Ghosted
Generative x-vectors for text-independent speaker verification
R.I.P.
👻
Ghosted
Unsupervised Representation Learning of Speech for Dialect Identification
R.I.P.
👻
Ghosted
A study on speech enhancement using exponent-only floating point quantized neural network (EOFP-QNN)
R.I.P.
👻
Ghosted
Automatic context window composition for distant speech recognition
R.I.P.
👻
Ghosted
Investigating Zero-Shot Generalizability on Mandarin-English Code-Switched ASR and Speech-to-text Translation of Recent Foundation Models with Self-Supervision and Weak Supervision
R.I.P.
👻
Ghosted
End-to-End Continuous Speech Emotion Recognition in Real-life Customer Service Call Center Conversations
R.I.P.
👻
Ghosted
Augmenting Transformer-Transducer Based Speaker Change Detection With Token-Level Training Loss
R.I.P.
👻
Ghosted
DiffPhase: Generative Diffusion-based STFT Phase Retrieval
R.I.P.
👻
Ghosted
Hearing Lips in Noise: Universal Viseme-Phoneme Mapping and Transfer for Robust Audio-Visual Speech Recognition
R.I.P.
👻
Ghosted
Cross-Modal Global Interaction and Local Alignment for Audio-Visual Speech Recognition
R.I.P.
👻
Ghosted
Dyadic Speech-based Affect Recognition using DAMI-P2C Parent-child Multimodal Interaction Dataset
R.I.P.
👻
Ghosted
Speaker-Utterance Dual Attention for Speaker and Utterance Verification
R.I.P.
👻
Ghosted
Laughter Synthesis: Combining Seq2seq modeling with Transfer Learning
R.I.P.
👻
Ghosted
Peking Opera Synthesis via Duration Informed Attention Network
R.I.P.
👻
Ghosted
Streaming Language Identification using Combination of Acoustic Representations and ASR Hypotheses
R.I.P.
👻
Ghosted
Score-informed Networks for Music Performance Assessment
R.I.P.
👻
Ghosted