| 1 |
Listen, Attend and Spell
William Chan, Navdeep Jaitly, ... (+2 more)
|
👻
Ghosted
|
cs.CL
|
2.4K |
10 years ago |
| 2 |
Deep clustering: Discriminative embeddings for segmentation and separation
John R. Hershey, Zhuo Chen, ... (+2 more)
|
👻
Ghosted
|
cs.NE
|
1.4K |
10 years ago |
| 3 |
End-to-End Attention-based Large Vocabulary Speech Recognition
Dzmitry Bahdanau, Jan Chorowski, ... (+3 more)
|
👻
Ghosted
|
cs.CL
|
1.2K |
10 years ago |
| 4 |
State-of-the-art Speech Recognition With Sequence-to-Sequence Models
Chung-Cheng Chiu, Tara N. Sainath, ... (+12 more)
|
📚
The Cartographer
|
cs.CL
|
1.2K |
8 years ago |
| 5 |
Exposing Deep Fakes Using Inconsistent Head Poses
Xin Yang, Yuezun Li, Siwei Lyu
|
👻
Ghosted
|
cs.CV
|
1.0K |
7 years ago |
| 6 |
Joint CTC-Attention based End-to-End Speech Recognition using Multi-task Learning
Suyoun Kim, Takaaki Hori, Shinji Watanabe
|
👻
Ghosted
|
cs.CL
|
982 |
9 years ago |
| 7 |
Permutation Invariant Training of Deep Models for Speaker-Independent Multi-talker Speech Separation
Dong Yu, Morten Kolbæk, ... (+2 more)
|
👻
Ghosted
|
cs.CL
|
929 |
9 years ago |
| 8 |
Capsule-Forensics: Using Capsule Networks to Detect Forged Images and Videos
Huy H. Nguyen, Junichi Yamagishi, Isao Echizen
|
👻
Ghosted
|
cs.CV
|
713 |
7 years ago |
| 9 |
TasNet: time-domain audio separation network for real-time, single-channel speech separation
Yi Luo, Nima Mesgarani
|
👻
Ghosted
|
cs.SD
|
711 |
8 years ago |
| 10 |
Streaming End-to-end Speech Recognition For Mobile Devices
Yanzhang He, Tara N. Sainath, ... (+18 more)
|
👻
Ghosted
|
cs.CL
|
664 |
7 years ago |
| 11 |
End-to-End Text-Dependent Speaker Verification
Georg Heigold, Ignacio Moreno, ... (+2 more)
|
👻
Ghosted
|
cs.LG
|
607 |
10 years ago |
| 12 |
Convolutional Recurrent Neural Networks for Music Classification
Keunwoo Choi, George Fazekas, ... (+2 more)
|
👻
Ghosted
|
cs.NE
|
518 |
9 years ago |
| 13 |
LPCNet: Improving Neural Speech Synthesis Through Linear Prediction
Jean-Marc Valin, Jan Skoglund
|
👻
Ghosted
|
eess.AS
|
489 |
7 years ago |
| 14 |
The Microsoft 2017 Conversational Speech Recognition System
W. Xiong, L. Wu, ... (+4 more)
|
👻
Ghosted
|
cs.CL
|
481 |
8 years ago |
| 15 |
DNSMOS: A Non-Intrusive Perceptual Objective Speech Quality metric to evaluate Noise Suppressors
Chandan K A Reddy, Vishak Gopal, Ross Cutler
|
👻
Ghosted
|
cs.SD
|
461 |
5 years ago |
| 16 |
Deep attractor network for single-microphone speaker separation
Zhuo Chen, Yi Luo, Nima Mesgarani
|
👻
Ghosted
|
cs.SD
|
420 |
9 years ago |
| 17 |
Very Deep Convolutional Networks for End-to-End Speech Recognition
Yu Zhang, William Chan, Navdeep Jaitly
|
👻
Ghosted
|
cs.CL
|
419 |
9 years ago |
| 18 |
Deep Learning for Joint Source-Channel Coding of Text
Nariman Farsad, Milind Rao, Andrea Goldsmith
|
👻
Ghosted
|
cs.IT
|
418 |
8 years ago |
| 19 |
Mockingjay: Unsupervised Speech Representation Learning with Deep Bidirectional Transformer Encoders
Andy T. Liu, Shu-wen Yang, ... (+3 more)
|
👻
Ghosted
|
eess.AS
|
393 |
6 years ago |
| 20 |
Hierarchical Federated Learning Across Heterogeneous Cellular Networks
Mehdi Salehi Heydar Abad, Emre Ozfatura, ... (+2 more)
|
👻
Ghosted
|
cs.LG
|
366 |
6 years ago |
| 21 |
Utterance-level Aggregation For Speaker Recognition In The Wild
Weidi Xie, Arsha Nagrani, ... (+2 more)
|
👻
Ghosted
|
eess.AS
|
365 |
7 years ago |
| 22 |
Speaker Diarization with LSTM
Quan Wang, Carlton Downey, ... (+3 more)
|
👻
Ghosted
|
eess.AS
|
341 |
8 years ago |
| 23 |
Recurrent Neural Networks for Polyphonic Sound Event Detection in Real Life Recordings
Giambattista Parascandolo, Heikki Huttunen, Tuomas Virtanen
|
👻
Ghosted
|
cs.SD
|
334 |
10 years ago |
| 24 |
Capsule Networks for Brain Tumor Classification based on MRI Images and Course Tumor Boundaries
Parnian Afshar, Konstantinos N. Plataniotis, Arash Mohammadi
|
👻
Ghosted
|
cs.CV
|
314 |
7 years ago |
| 25 |
Federated Learning for Keyword Spotting
David Leroy, Alice Coucke, ... (+3 more)
|
👻
Ghosted
|
eess.AS
|
310 |
7 years ago |
| 26 |
Highway Long Short-Term Memory RNNs for Distant Speech Recognition
Yu Zhang, Guoguo Chen, ... (+4 more)
|
👻
Ghosted
|
cs.NE
|
295 |
10 years ago |
| 27 |
Beamforming Optimization for Intelligent Reflecting Surface with Discrete Phase Shifts
Qingqing Wu, Rui Zhang
|
👻
Ghosted
|
cs.IT
|
293 |
7 years ago |
| 28 |
Learning to Invert: Signal Recovery via Deep Convolutional Networks
Ali Mousavi, Richard G. Baraniuk
|
👻
Ghosted
|
stat.ML
|
293 |
9 years ago |
| 29 |
The Microsoft 2016 Conversational Speech Recognition System
W. Xiong, J. Droppo, ... (+6 more)
|
👻
Ghosted
|
cs.CL
|
291 |
9 years ago |
| 30 |
Multilingual Speech Recognition With A Single End-To-End Model
Shubham Toshniwal, Tara N. Sainath, ... (+5 more)
|
👻
Ghosted
|
eess.AS
|
283 |
8 years ago |
| 31 |
Compressed Sensing Based Multi-User Millimeter Wave Systems: How Many Measurements Are Needed?
Ahmed Alkhateeb, Geert Leus, Robert W. Heath
|
👻
Ghosted
|
cs.IT
|
281 |
11 years ago |
| 32 |
Achievable Rate maximization by Passive Intelligent Mirrors
Chongwen Huang, Alessio Zappone, ... (+2 more)
|
👻
Ghosted
|
cs.IT
|
276 |
7 years ago |
| 33 |
An analysis of incorporating an external language model into a sequence-to-sequence model
Anjuli Kannan, Yonghui Wu, ... (+4 more)
|
👻
Ghosted
|
eess.AS
|
275 |
8 years ago |
| 34 |
Yedrouj-Net: An efficient CNN for spatial steganalysis
Mehdi Yedroudj, Frederic Comby, Marc Chaumont
|
👻
Ghosted
|
cs.CV
|
264 |
8 years ago |
| 35 |
Transformer-based Acoustic Modeling for Hybrid Speech Recognition
Yongqiang Wang, Abdelrahman Mohamed, ... (+11 more)
|
👻
Ghosted
|
cs.CL
|
259 |
6 years ago |
| 36 |
Self-Training for End-to-End Speech Recognition
Jacob Kahn, Ann Lee, Awni Hannun
|
👻
Ghosted
|
cs.CL
|
254 |
6 years ago |
| 37 |
Deep Residual Learning for Small-Footprint Keyword Spotting
Raphael Tang, Jimmy Lin
|
👻
Ghosted
|
cs.CL
|
253 |
8 years ago |
| 38 |
Seen and Unseen emotional style transfer for voice conversion with a new emotional speech dataset
Kun Zhou, Berrak Sisman, ... (+2 more)
|
👻
Ghosted
|
cs.SD
|
245 |
5 years ago |
| 39 |
Learning latent representations for style control and transfer in end-to-end speech synthesis
Ya-Jie Zhang, Shifeng Pan, ... (+2 more)
|
👻
Ghosted
|
cs.CL
|
242 |
7 years ago |
| 40 |
APE-GAN: Adversarial Perturbation Elimination with GAN
Shiwei Shen, Guoqing Jin, ... (+2 more)
|
👻
Ghosted
|
cs.CV
|
242 |
8 years ago |
| 41 |
Deep Multimodal Learning for Audio-Visual Speech Recognition
Youssef Mroueh, Etienne Marcheret, Vaibhava Goel
|
👻
Ghosted
|
cs.CL
|
242 |
11 years ago |
| 42 |
CN-CELEB: a challenging Chinese speaker recognition dataset
Yue Fan, Jiawen Kang, ... (+8 more)
|
👻
Ghosted
|
eess.AS
|
241 |
6 years ago |
| 43 |
Towards end-to-end spoken language understanding
Dmitriy Serdyuk, Yongqiang Wang, ... (+4 more)
|
👻
Ghosted
|
cs.CL
|
241 |
8 years ago |
| 44 |
The PyTorch-Kaldi Speech Recognition Toolkit
Mirco Ravanelli, Titouan Parcollet, Yoshua Bengio
|
👻
Ghosted
|
eess.AS
|
236 |
7 years ago |
| 45 |
Fully Supervised Speaker Diarization
Aonan Zhang, Quan Wang, ... (+3 more)
|
👻
Ghosted
|
eess.AS
|
229 |
7 years ago |
| 46 |
A Hardware Architecture for Reconfigurable Intelligent Surfaces with Minimal Active Elements for Explicit Channel Estimation
George C. Alexandropoulos, Evangelos Vlachos
|
👻
Ghosted
|
cs.IT
|
225 |
6 years ago |
| 47 |
Very Deep Multilingual Convolutional Neural Networks for LVCSR
Tom Sercu, Christian Puhrsch, ... (+2 more)
|
👻
Ghosted
|
cs.CL
|
225 |
10 years ago |
| 48 |
Lipreading with Long Short-Term Memory
Michael Wand, Jan Koutník, Jürgen Schmidhuber
|
👻
Ghosted
|
cs.CV
|
223 |
10 years ago |
| 49 |
Exploring Speech Enhancement with Generative Adversarial Networks for Robust Speech Recognition
Chris Donahue, Bo Li, Rohit Prabhavalkar
|
👻
Ghosted
|
cs.SD
|
220 |
8 years ago |
| 50 |
Fooling End-to-end Speaker Verification by Adversarial Examples
Felix Kreuk, Yossi Adi, ... (+2 more)
|
👻
Ghosted
|
cs.LG
|
218 |
8 years ago |