💀 The Wall of Shame

The most cited papers with no code. Sorted by the weight of their sins.

Page 5, showing 50 papers

# Paper Cause of Death Category Citations Published
201 Speech To Semantics: Improve ASR and NLU Jointly via All-Neural Interfaces
Milind Rao, Anirudh Raju, ... (+3 more)
👻 Ghosted cs.CL 44 5 years ago
202 DNN driven Speaker Independent Audio-Visual Mask Estimation for Speech Separation
Mandar Gogate, Ahsan Adeel, ... (+3 more)
👻 Ghosted cs.SD 43 7 years ago
203 Listen Attentively, and Spell Once: Whole Sentence Generation via a Non-Autoregressive Architecture for Low-Latency Speech Recognition
Ye Bai, Jiangyan Yi, ... (+4 more)
👻 Ghosted eess.AS 43 6 years ago
204 Super-Human Performance in Online Low-latency Recognition of Conversational Speech
Thai-Son Nguyen, Sebastian Stueker, Alex Waibel
👻 Ghosted cs.CV 43 5 years ago
205 Transformer VQ-VAE for Unsupervised Unit Discovery and Speech Synthesis: ZeroSpeech 2020 Challenge
Andros Tjandra, Sakriani Sakti, Satoshi Nakamura
👻 Ghosted cs.CL 42 6 years ago
206 Self-Supervised Representations Improve End-to-End Speech Translation
Anne Wu, Changhan Wang, ... (+2 more)
👻 Ghosted eess.AS 42 6 years ago
207 Contextualized Attention-based Knowledge Transfer for Spoken Conversational Question Answering
Chenyu You, Nuo Chen, Yuexian Zou
👻 Ghosted cs.CL 42 5 years ago
208 Capturing Long-term Temporal Dependencies with Convolutional Networks for Continuous Emotion Recognition
Soheil Khorram, Zakaria Aldeneh, ... (+3 more)
👻 Ghosted cs.SD 41 8 years ago
209 Embedding-Based Speaker Adaptive Training of Deep Neural Networks
Xiaodong Cui, Vaibhava Goel, George Saon
👻 Ghosted cs.CL 41 8 years ago
210 Conditional End-to-End Audio Transforms
Albert Haque, Michelle Guo, Prateek Verma
👻 Ghosted cs.SD 41 8 years ago
211 Analyzing Phonetic and Graphemic Representations in End-to-End Automatic Speech Recognition
Yonatan Belinkov, Ahmed Ali, James Glass
👻 Ghosted cs.CL 41 6 years ago
212 Relative Positional Encoding for Speech Recognition and Direct Translation
Ngoc-Quan Pham, Thanh-Le Ha, ... (+6 more)
👻 Ghosted eess.AS 41 6 years ago
213 Speaker Recognition for Children's Speech
Saeid Safavi, Maryam Najafian, ... (+4 more)
👻 Ghosted cs.SD 40 9 years ago
214 Visualization and Interpretation of Latent Spaces for Controlling Expressive Speech Synthesis through Audio Analysis
Noé Tits, Fengna Wang, ... (+3 more)
👻 Ghosted cs.CL 40 7 years ago
215 Minimum Bayes Risk Training of RNN-Transducer for End-to-End Speech Recognition
Chao Weng, Chengzhu Yu, ... (+3 more)
👻 Ghosted cs.CL 40 6 years ago
216 Investigating Generative Adversarial Networks based Speech Dereverberation for Robust Speech Recognition
Ke Wang, Junbo Zhang, ... (+4 more)
👻 Ghosted cs.SD 39 8 years ago
217 Investigating Speech Features for Continuous Turn-Taking Prediction Using LSTMs
Matthew Roddy, Gabriel Skantze, Naomi Harte
👻 Ghosted cs.CL 39 7 years ago
218 Learn Spelling from Teachers: Transferring Knowledge from Language Models to Sequence-to-Sequence Speech Recognition
Ye Bai, Jiangyan Yi, ... (+3 more)
👻 Ghosted eess.AS 39 6 years ago
219 Deep speech inpainting of time-frequency masks
Mikolaj Kegler, Pierre Beckmann, Milos Cernak
👻 Ghosted cs.SD 39 6 years ago
220 JDI-T: Jointly trained Duration Informed Transformer for Text-To-Speech without Explicit Alignment
Dan Lim, Won Jang, ... (+4 more)
👻 Ghosted eess.AS 39 6 years ago
221 PyChain: A Fully Parallelized PyTorch Implementation of LF-MMI for End-to-End ASR
Yiwen Shao, Yiming Wang, ... (+2 more)
👻 Ghosted eess.AS 39 6 years ago
222 Dialogue Session Segmentation by Embedding-Enhanced TextTiling
Yiping Song, Lili Mou, ... (+5 more)
👻 Ghosted cs.CL 38 9 years ago
223 Analysis of Deep Clustering as Preprocessing for Automatic Speech Recognition of Sparsely Overlapping Speech
Tobias Menne, Ilya Sklyar, ... (+2 more)
👻 Ghosted cs.SD 38 7 years ago
224 Auxiliary Interference Speaker Loss for Target-Speaker Speech Recognition
Naoyuki Kanda, Shota Horiguchi, ... (+4 more)
👻 Ghosted cs.CL 38 6 years ago
225 Speaker Adaptation for Attention-Based End-to-End Speech Recognition
Zhong Meng, Yashesh Gaur, ... (+2 more)
👻 Ghosted cs.CL 38 6 years ago
226 Towards unsupervised phone and word segmentation using self-supervised vector-quantized neural networks
Herman Kamper, Benjamin van Niekerk
👻 Ghosted cs.CL 38 5 years ago
227 Multi-Modal Data Augmentation for End-to-End ASR
Adithya Renduchintala, Shuoyang Ding, ... (+2 more)
👻 Ghosted cs.CL 37 8 years ago
228 Enhancing Monotonic Multihead Attention for Streaming ASR
Hirofumi Inaguma, Masato Mimura, Tatsuya Kawahara
👻 Ghosted eess.AS 36 6 years ago
229 ASR error management for improving spoken language understanding
Edwin Simonnet, Sahar Ghannay, ... (+3 more)
👻 Ghosted cs.CL 35 9 years ago
230 Adversarial Feature-Mapping for Speech Enhancement
Zhong Meng, Jinyu Li, ... (+3 more)
👻 Ghosted eess.AS 35 7 years ago
231 Multi-Reference Neural TTS Stylization with Adversarial Cycle Consistency
Matt Whitehill, Shuang Ma, ... (+2 more)
👻 Ghosted cs.LG 35 6 years ago
232 Lite Audio-Visual Speech Enhancement
Shang-Yi Chuang, Yu Tsao, ... (+2 more)
👻 Ghosted eess.AS 35 6 years ago
233 Language-specific Characteristic Assistance for Code-switching Speech Recognition
Tongtong Song, Qiang Xu, ... (+6 more)
👻 Ghosted cs.CL 35 3 years ago
234 Speech Pseudonymisation Assessment Using Voice Similarity Matrices
Paul-Gauthier Noé, Jean-François Bonastre, ... (+4 more)
👻 Ghosted eess.AS 34 5 years ago
235 Deep versus Wide: An Analysis of Student Architectures for Task-Agnostic Knowledge Distillation of Self-Supervised Speech Models
Takanori Ashihara, Takafumi Moriya, ... (+2 more)
👻 Ghosted cs.CL 34 3 years ago
236 Streaming End-to-End Multilingual Speech Recognition with Joint Language Identification
Chao Zhang, Bo Li, ... (+5 more)
👻 Ghosted eess.AS 34 3 years ago
237 Joint Learning of Correlated Sequence Labelling Tasks Using Bidirectional Recurrent Neural Networks
Vardaan Pahuja, Anirban Laha, ... (+4 more)
👻 Ghosted cs.CL 33 9 years ago
238 Subword and Crossword Units for CTC Acoustic Models
Thomas Zenkel, Ramon Sanabria, ... (+2 more)
👻 Ghosted cs.CL 33 8 years ago
239 Cumulative Adaptation for BLSTM Acoustic Models
Markus Kitza, Pavel Golik, ... (+2 more)
👻 Ghosted cs.CL 33 7 years ago
240 Transfer Learning from Audio-Visual Grounding to Speech Recognition
Wei-Ning Hsu, David Harwath, James Glass
👻 Ghosted cs.CL 33 6 years ago
241 DARTS-ASR: Differentiable Architecture Search for Multilingual Speech Recognition and Adaptation
Yi-Chen Chen, Jui-Yang Hsu, ... (+2 more)
👻 Ghosted eess.AS 33 6 years ago
242 The Privacy ZEBRA: Zero Evidence Biometric Recognition Assessment
Andreas Nautsch, Jose Patino, ... (+6 more)
👻 Ghosted cs.CR 33 6 years ago
243 Improving Cross-Lingual Transfer Learning for End-to-End Speech Recognition with Speech Translation
Changhan Wang, Juan Pino, Jiatao Gu
👻 Ghosted eess.AS 33 6 years ago
244 Automatic Speech Recognition Benchmark for Air-Traffic Communications
Juan Zuluaga-Gomez, Petr Motlicek, ... (+3 more)
👻 Ghosted cs.CL 33 6 years ago
245 Adversarial Disentanglement of Speaker Representation for Attribute-Driven Privacy Preservation
Paul-Gauthier Noé, Mohammad Mohammadamini, ... (+4 more)
👻 Ghosted eess.AS 33 5 years ago
246 Are disentangled representations all you need to build speaker anonymization systems?
Pierre Champion, Denis Jouvet, Anthony Larcher
👻 Ghosted cs.SD 33 3 years ago
247 COSMIC: Data Efficient Instruction-tuning For Speech In-Context Learning
Jing Pan, Jian Wu, ... (+5 more)
👻 Ghosted cs.CL 33 2 years ago
248 Contaminated speech training methods for robust DNN-HMM distant speech recognition
Mirco Ravanelli, Maurizio Omologo
👻 Ghosted eess.AS 32 8 years ago
249 Learning Acoustic Word Embeddings with Temporal Context for Query-by-Example Speech Search
Yougen Yuan, Cheung-Chi Leung, ... (+4 more)
👻 Ghosted cs.CL 32 8 years ago
250 Building a Unified Code-Switching ASR System for South African Languages
Emre Yılmaz, Astik Biswas, ... (+3 more)
👻 Ghosted cs.CL 32 7 years ago