💀 The Wall of Shame

The most cited papers with no code. Sorted by the weight of their sins.

Page 1, showing 50 papers

# Paper Cause of Death Category Citations Published
1 SEGAN: Speech Enhancement Generative Adversarial Network
Santiago Pascual, Antonio Bonafonte, Joan Serrà
👻 Ghosted cs.LG 1.3K 9 years ago
2 ASVspoof 2019: Future Horizons in Spoofed and Fake Audio Detection
Massimiliano Todisco, Xin Wang, ... (+8 more)
👻 Ghosted eess.AS 736 7 years ago
3 The fifth 'CHiME' Speech Separation and Recognition Challenge: Dataset, task and baselines
Jon Barker, Shinji Watanabe, ... (+2 more)
👻 Ghosted cs.SD 714 8 years ago
4 Attention-Based Recurrent Neural Network Models for Joint Intent Detection and Slot Filling
Bing Liu, Ian Lane
👻 Ghosted cs.CL 708 9 years ago
5 Single-Channel Multi-Speaker Separation using Deep Clustering
Yusuf Isik, Jonathan Le Roux, ... (+3 more)
👻 Ghosted cs.LG 447 9 years ago
6 Fast and Accurate Recurrent Neural Network Acoustic Models for Speech Recognition
Haşim Sak, Andrew Senior, ... (+2 more)
👻 Ghosted cs.CL 441 10 years ago
7 An Unsupervised Autoregressive Model for Speech Representation Learning
Yu-An Chung, Wei-Ning Hsu, ... (+2 more)
🌅 Old Age cs.CL 425 7 years ago
8 VoiceFilter: Targeted Voice Separation by Speaker-Conditioned Spectrogram Masking
Quan Wang, Hannah Muckenhirn, ... (+8 more)
👻 Ghosted eess.AS 413 7 years ago
9 A Fully Convolutional Neural Network for Speech Enhancement
Se Rim Park, Jinwon Lee
👻 Ghosted cs.LG 391 9 years ago
10 Towards End-to-End Speech Recognition with Deep Convolutional Neural Networks
Ying Zhang, Mohammad Pezeshki, ... (+4 more)
👻 Ghosted cs.CL 383 9 years ago
11 Towards better decoding and language model integration in sequence to sequence models
Jan Chorowski, Navdeep Jaitly
👻 Ghosted cs.NE 381 9 years ago
12 English Conversational Telephone Speech Recognition by Humans and Machines
George Saon, Gakuto Kurata, ... (+10 more)
👻 Ghosted cs.CL 371 9 years ago
13 Sequence-to-Sequence Models Can Directly Translate Foreign Speech
Ron J. Weiss, Jan Chorowski, ... (+3 more)
👻 Ghosted cs.CL 363 9 years ago
14 Combining Residual Networks with LSTMs for Lipreading
Themos Stafylakis, Georgios Tzimiropoulos
👻 Ghosted cs.CV 335 9 years ago
15 Voice Conversion from Unaligned Corpora using Variational Autoencoding Wasserstein Generative Adversarial Networks
Chin-Cheng Hsu, Hsin-Te Hwang, ... (+3 more)
👻 Ghosted cs.CL 325 9 years ago
16 Neural Speech Recognizer: Acoustic-to-Word LSTM Model for Large Vocabulary Speech Recognition
Hagen Soltau, Hank Liao, Hasim Sak
👻 Ghosted cs.CL 316 9 years ago
17 Advances in Joint CTC-Attention based End-to-End Speech Recognition with a Deep CNN Encoder and RNN-LM
Takaaki Hori, Shinji Watanabe, ... (+2 more)
👻 Ghosted cs.CL 306 8 years ago
18 Cold Fusion: Training Seq2Seq Models Together with Language Models
Anuroop Sriram, Heewoo Jun, ... (+2 more)
👻 Ghosted cs.CL 301 8 years ago
19 ContextNet: Improving Convolutional Neural Networks for Automatic Speech Recognition with Global Context
Wei Han, Zhengdong Zhang, ... (+7 more)
👻 Ghosted eess.AS 298 6 years ago
20 STC Antispoofing Systems for the ASVspoof2019 Challenge
Galina Lavrentyeva, Sergey Novoselov, ... (+4 more)
👻 Ghosted cs.SD 286 7 years ago
21 Improved training of end-to-end attention models for speech recognition
Albert Zeyer, Kazuki Irie, ... (+2 more)
👻 Ghosted cs.CL 279 8 years ago
22 Jasper: An End-to-End Convolutional Neural Acoustic Model
Jason Li, Vitaly Lavrukhin, ... (+6 more)
👻 Ghosted eess.AS 278 7 years ago
23 Direct speech-to-speech translation with a sequence-to-sequence model
Ye Jia, Ron J. Weiss, ... (+5 more)
👻 Ghosted cs.CL 262 7 years ago
24 Joint Robust Voicing Detection and Pitch Estimation Based on Residual Harmonics
Thomas Drugman, Abeer Alwan
👻 Ghosted cs.SD 259 6 years ago
25 RWTH ASR Systems for LibriSpeech: Hybrid vs Attention -- w/o Data Augmentation
Christoph Lüscher, Eugen Beck, ... (+6 more)
👻 Ghosted cs.CL 240 7 years ago
26 Target-Speaker Voice Activity Detection: a Novel Approach for Multi-Speaker Diarization in a Dinner Party Scenario
Ivan Medennikov, Maxim Korenevsky, ... (+10 more)
👻 Ghosted eess.AS 236 6 years ago
27 Exploring wav2vec 2.0 on speaker verification and language identification
Zhiyun Fan, Meng Li, ... (+2 more)
👻 Ghosted cs.SD 231 5 years ago
28 Attentive Convolutional Neural Network based Speech Emotion Recognition: A Study on the Impact of Input Features, Signal Length, and Acted Speech
Michael Neumann, Ngoc Thang Vu
👻 Ghosted cs.CL 228 9 years ago
29 End-to-End Speaker Diarization for an Unknown Number of Speakers with Encoder-Decoder Based Attractors
Shota Horiguchi, Yusuke Fujita, ... (+3 more)
👻 Ghosted eess.AS 223 6 years ago
30 Conditional Generative Adversarial Networks for Speech Enhancement and Noise-Robust Speaker Verification
Daniel Michelsanti, Zheng-Hua Tan
👻 Ghosted eess.AS 221 8 years ago
31 The Second DIHARD Diarization Challenge: Dataset, task, and baselines
Neville Ryant, Kenneth Church, ... (+5 more)
👻 Ghosted eess.AS 196 6 years ago
32 Convolutional Recurrent Neural Networks for Small-Footprint Keyword Spotting
Sercan O. Arik, Markus Kliegl, ... (+6 more)
👻 Ghosted cs.CL 191 9 years ago
33 Audio Word2Vec: Unsupervised Learning of Audio Segment Representations using Sequence-to-sequence Autoencoder
Yu-An Chung, Chao-Chung Wu, ... (+3 more)
👻 Ghosted cs.SD 191 10 years ago
34 Language Modeling with Deep Transformers
Kazuki Irie, Albert Zeyer, ... (+2 more)
👻 Ghosted cs.CL 188 7 years ago
35 Quality-Net: An End-to-End Non-intrusive Speech Quality Assessment Model based on BLSTM
Szu-Wei Fu, Yu Tsao, ... (+2 more)
👻 Ghosted cs.SD 187 7 years ago
36 Speech2Vec: A Sequence-to-Sequence Framework for Learning Word Embeddings from Speech
Yu-An Chung, James Glass
👻 Ghosted cs.CL 187 8 years ago
37 Residual LSTM: Design of a Deep Recurrent Architecture for Distant Speech Recognition
Jaeyoung Kim, Mostafa El-Khamy, Jungwon Lee
👻 Ghosted cs.LG 187 9 years ago
38 Powerset multi-class cross entropy loss for neural speaker diarization
Alexis Plaquet, Hervé Bredin
👻 Ghosted cs.SD 185 2 years ago
39 Temporal Convolution for Real-time Keyword Spotting on Mobile Devices
Seungwoo Choi, Seokjun Seo, ... (+6 more)
👻 Ghosted cs.SD 179 7 years ago
40 ASSERT: Anti-Spoofing with Squeeze-Excitation and Residual neTworks
Cheng-I Lai, Nanxin Chen, ... (+2 more)
👻 Ghosted cs.CL 176 7 years ago
41 Glottal Closure and Opening Instant Detection from Speech Signals
Thomas Drugman, Thierry Dutoit
👻 Ghosted cs.SD 170 6 years ago
42 End-to-End Speech Translation with Knowledge Distillation
Yuchen Liu, Hao Xiong, ... (+5 more)
👻 Ghosted cs.CL 170 7 years ago
43 Very Deep Self-Attention Networks for End-to-End Speech Recognition
Ngoc-Quan Pham, Thai-Son Nguyen, ... (+4 more)
👻 Ghosted cs.CL 168 7 years ago
44 Sequence-to-Sequence Neural Net Models for Grapheme-to-Phoneme Conversion
Kaisheng Yao, Geoffrey Zweig
👻 Ghosted cs.CL 168 11 years ago
45 Large-Scale Visual Speech Recognition
Brendan Shillingford, Yannis Assael, ... (+13 more)
👻 Ghosted cs.CV 166 7 years ago
46 A Unified Deep Neural Network for Speaker and Language Recognition
Fred Richardson, Douglas Reynolds, Najim Dehak
👻 Ghosted cs.CL 163 11 years ago
47 Massively Multilingual ASR: 50 Languages, 1 Model, 1 Billion Parameters
Vineel Pratap, Anuroop Sriram, ... (+5 more)
👻 Ghosted eess.AS 159 5 years ago
48 Self-Attentional Acoustic Models
Matthias Sperber, Jan Niehues, ... (+3 more)
👻 Ghosted cs.CL 159 8 years ago
49 Two-Pass End-to-End Speech Recognition
Tara N. Sainath, Ruoming Pang, ... (+10 more)
👻 Ghosted cs.CL 158 6 years ago
50 AVLnet: Learning Audio-Visual Language Representations from Instructional Videos
Andrew Rouditchenko, Angie Boggust, ... (+12 more)
👻 Ghosted cs.CV 147 5 years ago