💀 The Wall of Shame

The most cited papers with no code. Sorted by the weight of their sins.

Page 4, showing 50 papers

# Paper Cause of Death Category Citations Published
151 Converting Anyone's Emotion: Towards Speaker-Independent Emotional Voice Conversion
Kun Zhou, Berrak Sisman, ... (+2 more)
👻 Ghosted cs.SD 62 6 years ago
152 Exploiting Multi-Modal Features From Pre-trained Networks for Alzheimer's Dementia Recognition
Junghyun Koo, Jie Hwan Lee, ... (+3 more)
👻 Ghosted cs.SD 62 5 years ago
153 Sequential Convolutional Neural Networks for Slot Filling in Spoken Language Understanding
Ngoc Thang Vu
👻 Ghosted cs.CL 61 9 years ago
154 Low-Latency Sequence-to-Sequence Speech Recognition and Translation by Partial Hypothesis Selection
Danni Liu, Gerasimos Spanakis, Jan Niehues
👻 Ghosted cs.CL 60 6 years ago
155 Unsupervised acoustic unit discovery for speech synthesis using discrete latent-variable neural networks
Ryan Eloff, André Nortje, ... (+8 more)
👻 Ghosted cs.CL 59 7 years ago
156 Machine Speech Chain with One-shot Speaker Adaptation
Andros Tjandra, Sakriani Sakti, Satoshi Nakamura
👻 Ghosted cs.CL 58 8 years ago
157 Curriculum-based transfer learning for an effective end-to-end spoken language understanding and domain portability
Antoine Caubrière, Natalia Tomashenko, ... (+4 more)
👻 Ghosted cs.CL 58 7 years ago
158 The DKU Replay Detection System for the ASVspoof 2019 Challenge: On Data Augmentation, Feature Representation, Classification, and Fusion
Weicheng Cai, Haiwei Wu, ... (+2 more)
👻 Ghosted eess.AS 58 6 years ago
159 Class LM and word mapping for contextual biasing in End-to-End ASR
Rongqing Huang, Ossama Abdel-hamid, ... (+2 more)
👻 Ghosted cs.CL 58 5 years ago
160 Diff-HierVC: Diffusion-based Hierarchical Voice Conversion with Robust Pitch Generation and Masked Prior for Zero-shot Speaker Adaptation
Ha-Yeong Choi, Sang-Hoon Lee, Seong-Whan Lee
👻 Ghosted eess.AS 58 2 years ago
161 Optimizing expected word error rate via sampling for speech recognition
Matt Shannon
👻 Ghosted cs.CL 57 9 years ago
162 Joint training framework for text-to-speech and voice conversion using multi-source Tacotron and WaveNet
Mingyang Zhang, Xin Wang, ... (+3 more)
👻 Ghosted eess.AS 57 7 years ago
163 On the efficient representation and execution of deep acoustic models
Raziel Alvarez, Rohit Prabhavalkar, Anton Bakhtin
👻 Ghosted cs.LG 56 9 years ago
164 Improving speech recognition by revising gated recurrent units
Mirco Ravanelli, Philemon Brakel, ... (+2 more)
👻 Ghosted cs.CL 56 8 years ago
165 A Multi-Discriminator CycleGAN for Unsupervised Non-Parallel Speech Domain Adaptation
Ehsan Hosseini-Asl, Yingbo Zhou, ... (+2 more)
👻 Ghosted cs.CL 56 8 years ago
166 Punctuation Prediction Model for Conversational Speech
Piotr Żelasko, Piotr Szymański, ... (+4 more)
👻 Ghosted cs.CL 56 7 years ago
167 Efficient minimum word error rate training of RNN-Transducer for end-to-end speech recognition
Jinxi Guo, Gautam Tiwari, ... (+5 more)
👻 Ghosted eess.AS 56 5 years ago
168 Distilling the Knowledge of BERT for Sequence-to-Sequence ASR
Hayato Futami, Hirofumi Inaguma, ... (+4 more)
👻 Ghosted cs.CL 56 5 years ago
169 SPEAK YOUR MIND! Towards Imagined Speech Recognition With Hierarchical Deep Learning
Pramit Saha, Muhammad Abdul-Mageed, Sidney Fels
👻 Ghosted cs.LG 55 7 years ago
170 Spike-Triggered Non-Autoregressive Transformer for End-to-End Speech Recognition
Zhengkun Tian, Jiangyan Yi, ... (+4 more)
👻 Ghosted eess.AS 55 6 years ago
171 Comparing Natural Language Processing Techniques for Alzheimer's Dementia Prediction in Spontaneous Speech
Thomas Searle, Zina Ibrahim, Richard Dobson
👻 Ghosted cs.LG 55 6 years ago
172 Maximum a Posteriori Adaptation of Network Parameters in Deep Models
Zhen Huang, Sabato Marco Siniscalchi, ... (+3 more)
👻 Ghosted cs.LG 54 11 years ago
173 Speech-XLNet: Unsupervised Acoustic Model Pretraining For Self-Attention Networks
Xingchen Song, Guangsen Wang, ... (+5 more)
👻 Ghosted cs.CL 54 6 years ago
174 Joint Learning of Domain Classification and Out-of-Domain Detection with Dynamic Class Weighting for Satisficing False Acceptance Rates
Joo-Kyung Kim, Young-Bum Kim
👻 Ghosted cs.CL 53 7 years ago
175 Semantic Mask for Transformer based End-to-End Speech Recognition
Chengyi Wang, Yu Wu, ... (+8 more)
👻 Ghosted cs.CL 53 6 years ago
176 A New Training Pipeline for an Improved Neural Transducer
Albert Zeyer, André Merboldt, ... (+2 more)
👻 Ghosted eess.AS 53 6 years ago
177 Adapt-and-Adjust: Overcoming the Long-Tail Problem of Multilingual Speech Recognition
Genta Indra Winata, Guangsen Wang, ... (+2 more)
👻 Ghosted cs.CL 53 5 years ago
178 The PRIORI Emotion Dataset: Linking Mood to Emotion Detected In-the-Wild
Soheil Khorram, Mimansa Jaiswal, ... (+3 more)
👻 Ghosted cs.HC 52 8 years ago
179 Cycle-Consistent Speech Enhancement
Zhong Meng, Jinyu Li, ... (+3 more)
👻 Ghosted eess.AS 52 7 years ago
180 Conv-Transformer Transducer: Low Latency, Low Frame Rate, Streamable End-to-End Speech Recognition
Wenyong Huang, Wenchao Hu, ... (+2 more)
👻 Ghosted eess.AS 52 5 years ago
181 On Enhancing Speech Emotion Recognition using Generative Adversarial Networks
Saurabh Sahu, Rahul Gupta, Carol Espy-Wilson
👻 Ghosted cs.CL 51 8 years ago
182 Completely Unsupervised Phoneme Recognition by Adversarially Learning Mapping Relationships from Audio Embeddings
Da-Rong Liu, Kuan-Yu Chen, ... (+2 more)
👻 Ghosted cs.CL 50 8 years ago
183 Device-directed Utterance Detection
Sri Harish Mallidi, Roland Maas, ... (+4 more)
👻 Ghosted cs.CL 50 7 years ago
184 Generative adversarial network-based glottal waveform model for statistical parametric speech synthesis
Bajibabu Bollepalli, Lauri Juvela, Paavo Alku
👻 Ghosted eess.AS 49 7 years ago
185 A New GAN-based End-to-End TTS Training Algorithm
Haohan Guo, Frank K. Soong, ... (+2 more)
👻 Ghosted cs.CL 49 7 years ago
186 An Online Attention-based Model for Speech Recognition
Ruchao Fan, Pan Zhou, ... (+3 more)
👻 Ghosted cs.CL 48 7 years ago
187 Guiding CTC Posterior Spike Timings for Improved Posterior Fusion and Knowledge Distillation
Gakuto Kurata, Kartik Audhkhasi
👻 Ghosted cs.CL 48 7 years ago
188 An End-to-End Mispronunciation Detection System for L2 English Speech Leveraging Novel Anti-Phone Modeling
Bi-Cheng Yan, Meng-Che Wu, ... (+2 more)
👻 Ghosted eess.AS 48 6 years ago
189 Structured-based Curriculum Learning for End-to-end English-Japanese Speech Translation
Takatomo Kano, Sakriani Sakti, Satoshi Nakamura
👻 Ghosted cs.CL 46 8 years ago
190 Study of Semi-supervised Approaches to Improving English-Mandarin Code-Switching Speech Recognition
Pengcheng Guo, Haihua Xu, ... (+2 more)
👻 Ghosted cs.CL 46 8 years ago
191 Toward Fairness in Speech Recognition: Discovery and mitigation of performance disparities
Pranav Dheram, Murugesan Ramakrishnan, ... (+7 more)
👻 Ghosted cs.CL 46 3 years ago
192 Improved training for online end-to-end speech recognition systems
Suyoun Kim, Michael L. Seltzer, ... (+2 more)
👻 Ghosted cs.CL 45 8 years ago
193 Disfluencies and Human Speech Transcription Errors
Vicky Zayats, Trang Tran, ... (+3 more)
👻 Ghosted cs.CL 45 7 years ago
194 Language learning using Speech to Image retrieval
Danny Merkx, Stefan L. Frank, Mirjam Ernestus
👻 Ghosted cs.CL 45 6 years ago
195 Exploring Transformers for Large-Scale Speech Recognition
Liang Lu, Changliang Liu, ... (+2 more)
👻 Ghosted eess.AS 45 6 years ago
196 Multi-talker ASR for an unknown number of sources: Joint training of source counting, separation and ASR
Thilo von Neumann, Christoph Boeddeker, ... (+5 more)
👻 Ghosted eess.AS 45 6 years ago
197 Multi-level Fusion of Wav2vec 2.0 and BERT for Multimodal Emotion Recognition
Zihan Zhao, Yanfeng Wang, Yu Wang
👻 Ghosted cs.CL 45 3 years ago
198 Advances in Very Deep Convolutional Neural Networks for LVCSR
Tom Sercu, Vaibhava Goel
👻 Ghosted cs.CL 44 10 years ago
199 Improving Speaker-Independent Lipreading with Domain-Adversarial Training
Michael Wand, Juergen Schmidhuber
👻 Ghosted cs.CV 44 8 years ago
200 Comparison of Decoding Strategies for CTC Acoustic Models
Thomas Zenkel, Ramon Sanabria, ... (+5 more)
👻 Ghosted cs.CL 44 8 years ago