💀 The Wall of Shame

The most cited papers with no code. Sorted by the weight of their sins.

Page 2, showing 50 papers

# Paper Cause of Death Category Citations Published
51 End-to-End Automatic Speech Translation of Audiobooks
Alexandre Bérard, Laurent Besacier, ... (+2 more)
👻 Ghosted cs.CL 206 8 years ago
52 End-to-end Microphone Permutation and Number Invariant Multi-channel Speech Separation
Yi Luo, Zhuo Chen, ... (+2 more)
👻 Ghosted eess.AS 206 6 years ago
53 Europarl-ST: A Multilingual Corpus For Speech Translation Of Parliamentary Debates
Javier Iranzo-Sánchez, Joan Albert Silvestre-Cerdà, ... (+6 more)
👻 Ghosted cs.CL 204 6 years ago
54 Developing Real-time Streaming Transformer Transducer for Speech Recognition on Large-scale Dataset
Xie Chen, Yu Wu, ... (+3 more)
👻 Ghosted cs.CL 203 5 years ago
55 Personalized Speech recognition on mobile devices
Ian McGraw, Rohit Prabhavalkar, ... (+9 more)
👻 Ghosted cs.CL 198 10 years ago
56 Emformer: Efficient Memory Transformer Based Acoustic Model For Low Latency Streaming Speech Recognition
Yangyang Shi, Yongqiang Wang, ... (+6 more)
👻 Ghosted cs.SD 195 5 years ago
57 Replay and Synthetic Speech Detection with Res2net Architecture
Xu Li, Na Li, ... (+5 more)
👻 Ghosted eess.AS 192 5 years ago
58 Bias Mitigation Post-processing for Individual and Group Fairness
Pranay K. Lohia, Karthikeyan Natesan Ramamurthy, ... (+4 more)
👻 Ghosted cs.LG 187 7 years ago
59 Convolutional-Recurrent Neural Networks for Speech Enhancement
Han Zhao, Shuayb Zarar, ... (+2 more)
👻 Ghosted cs.SD 182 8 years ago
60 Age-Based Scheduling Policy for Federated Learning in Mobile Edge Networks
Howard H. Yang, Ahmed Arafa, ... (+2 more)
👻 Ghosted cs.IT 182 6 years ago
61 A Graph-CNN for 3D Point Cloud Classification
Yingxue Zhang, Michael Rabbat
👻 Ghosted cs.CV 178 7 years ago
62 Deep convolutional acoustic word embeddings using word-pair side information
Herman Kamper, Weiran Wang, Karen Livescu
👻 Ghosted cs.CL 176 10 years ago
63 Attention-Based Models for Text-Dependent Speaker Verification
F A Rezaur Rahman Chowdhury, Quan Wang, ... (+2 more)
👻 Ghosted eess.AS 174 8 years ago
64 A Comprehensive Study of Deep Bidirectional LSTM RNNs for Acoustic Modeling in Speech Recognition
Albert Zeyer, Patrick Doetsch, ... (+3 more)
👻 Ghosted cs.NE 172 9 years ago
65 Minimum Word Error Rate Training for Attention-based Sequence-to-Sequence Models
Rohit Prabhavalkar, Tara N. Sainath, ... (+5 more)
👻 Ghosted cs.CL 171 8 years ago
66 Leveraging Weakly Supervised Data to Improve End-to-End Speech-to-Text Translation
Ye Jia, Melvin Johnson, ... (+7 more)
👻 Ghosted cs.CL 171 7 years ago
67 Deep Clustering and Conventional Networks for Music Separation: Stronger Together
Yi Luo, Zhuo Chen, ... (+3 more)
👻 Ghosted stat.ML 170 9 years ago
68 PromptTTS: Controllable Text-to-Speech with Text Descriptions
Zhifang Guo, Yichong Leng, ... (+3 more)
👻 Ghosted eess.AS 167 3 years ago
69 AVA-ActiveSpeaker: An Audio-Visual Dataset for Active Speaker Detection
Joseph Roth, Sourish Chaudhuri, ... (+9 more)
👻 Ghosted cs.CV 165 7 years ago
70 Mellotron: Multispeaker expressive voice synthesis by conditioning on rhythm, pitch and global style tokens
Rafael Valle, Jason Li, ... (+2 more)
👻 Ghosted cs.SD 162 6 years ago
71 wav2letter++: The Fastest Open-source Speech Recognition System
Vineel Pratap, Awni Hannun, ... (+6 more)
👻 Ghosted cs.CL 161 7 years ago
72 RoIMix: Proposal-Fusion among Multiple Images for Underwater Object Detection
Wei-Hong Lin, Jia-Xing Zhong, ... (+3 more)
👻 Ghosted cs.CV 160 6 years ago
73 Ordered Reliability Bits Guessing Random Additive Noise Decoding
Ken R. Duffy
👻 Ghosted cs.IT 156 6 years ago
74 Robust and fine-grained prosody control of end-to-end speech synthesis
Younggun Lee, Taesu Kim
👻 Ghosted cs.CL 155 7 years ago
75 SpecAugment on Large Scale Datasets
Daniel S. Park, Yu Zhang, ... (+6 more)
👻 Ghosted eess.AS 154 6 years ago
76 Sound Event Detection Using Spatial Features and Convolutional Recurrent Neural Network
Sharath Adavanne, Pasi Pertilä, Tuomas Virtanen
👻 Ghosted cs.SD 153 9 years ago
77 Building competitive direct acoustics-to-word models for English conversational speech recognition
Kartik Audhkhasi, Brian Kingsbury, ... (+3 more)
👻 Ghosted cs.CL 152 8 years ago
78 A spelling correction model for end-to-end speech recognition
Jinxi Guo, Tara N. Sainath, Ron J. Weiss
👻 Ghosted eess.AS 151 7 years ago
79 ASR is all you need: cross-modal distillation for lip reading
Triantafyllos Afouras, Joon Son Chung, Andrew Zisserman
👻 Ghosted cs.CV 149 6 years ago
80 Trainable Frontend For Robust and Far-Field Keyword Spotting
Yuxuan Wang, Pascal Getreuer, ... (+3 more)
👻 Ghosted cs.CL 147 9 years ago
81 CIF: Continuous Integrate-and-Fire for End-to-End Speech Recognition
Linhao Dong, Bo Xu
👻 Ghosted cs.CL 146 7 years ago
82 Continuous Speech Separation with Conformer
Sanyuan Chen, Yu Wu, ... (+7 more)
👻 Ghosted eess.AS 145 5 years ago
83 A Co-Interactive Transformer for Joint Slot Filling and Intent Detection
Libo Qin, Tailu Liu, ... (+4 more)
👻 Ghosted cs.CL 145 5 years ago
84 High-quality nonparallel voice conversion based on cycle-consistent adversarial network
Fuming Fang, Junichi Yamagishi, ... (+2 more)
👻 Ghosted eess.AS 144 8 years ago
85 Strong Data Augmentation Sanitizes Poisoning and Backdoor Attacks Without an Accuracy Tradeoff
Eitan Borgnia, Valeriia Cherepanova, ... (+6 more)
👻 Ghosted cs.CR 143 5 years ago
86 Domain Adversarial Training for Accented Speech Recognition
Sining Sun, Ching-Feng Yeh, ... (+3 more)
👻 Ghosted cs.CL 139 8 years ago
87 Learning Sparse Graphs Under Smoothness Prior
Sundeep Prabhakar Chepuri, Sijia Liu, ... (+2 more)
👻 Ghosted cs.LG 137 9 years ago
88 Bytes are All You Need: End-to-End Multilingual Speech Recognition and Synthesis with Bytes
Bo Li, Yu Zhang, ... (+3 more)
👻 Ghosted eess.AS 136 7 years ago
89 A Comparison of deep learning methods for environmental sound
Juncheng Li, Wei Dai, ... (+3 more)
👻 Ghosted cs.SD 135 9 years ago
90 Speech Emotion Recognition with Dual-Sequence LSTM Architecture
Jianyou Wang, Michael Xue, ... (+4 more)
👻 Ghosted eess.AS 132 6 years ago
91 End-to-End Audio Visual Scene-Aware Dialog using Multimodal Attention-Based Video Features
Chiori Hori, Huda Alamri, ... (+11 more)
👻 Ghosted cs.CL 131 7 years ago
92 Perfect match: Improved cross-modal embeddings for audio-visual synchronisation
Soo-Whan Chung, Joon Son Chung, Hong-Goo Kang
👻 Ghosted cs.CV 130 7 years ago
93 Speech Emotion Recognition Using Multi-hop Attention Mechanism
Seunghyun Yoon, Seokhyun Byun, ... (+2 more)
👻 Ghosted eess.AS 130 7 years ago
94 Exposing GAN-generated Faces Using Inconsistent Corneal Specular Highlights
Shu Hu, Yuezun Li, Siwei Lyu
👻 Ghosted cs.CV 129 5 years ago
95 Statistical Speech Enhancement Based on Probabilistic Integration of Variational Autoencoder and Non-Negative Matrix Factorization
Yoshiaki Bando, Masato Mimura, ... (+3 more)
👻 Ghosted cs.SD 128 8 years ago
96 Learning Filterbanks from Raw Speech for Phone Recognition
Neil Zeghidour, Nicolas Usunier, ... (+4 more)
👻 Ghosted cs.CL 127 8 years ago
97 Random Projections through multiple optical scattering: Approximating kernels at the speed of light
Alaa Saade, Francesco Caltagirone, ... (+5 more)
👻 Ghosted cs.ET 125 10 years ago
98 Semi-Supervised Training for Improving Data Efficiency in End-to-End Speech Synthesis
Yu-An Chung, Yuxuan Wang, ... (+3 more)
👻 Ghosted cs.CL 121 7 years ago
99 End-To-End Visual Speech Recognition With LSTMs
Stavros Petridis, Zuwei Li, Maja Pantic
👻 Ghosted cs.CV 120 9 years ago
100 FedPrompt: Communication-Efficient and Privacy Preserving Prompt Tuning in Federated Learning
Haodong Zhao, Wei Du, ... (+3 more)
👻 Ghosted cs.LG 116 3 years ago