| 51 |
End-to-End Automatic Speech Translation of Audiobooks
Alexandre Bérard, Laurent Besacier, ... (+2 more)
|
👻
Ghosted
|
cs.CL
|
206 |
8 years ago |
| 52 |
End-to-end Microphone Permutation and Number Invariant Multi-channel Speech Separation
Yi Luo, Zhuo Chen, ... (+2 more)
|
👻
Ghosted
|
eess.AS
|
206 |
6 years ago |
| 53 |
Europarl-ST: A Multilingual Corpus For Speech Translation Of Parliamentary Debates
Javier Iranzo-Sánchez, Joan Albert Silvestre-Cerdà, ... (+6 more)
|
👻
Ghosted
|
cs.CL
|
204 |
6 years ago |
| 54 |
Developing Real-time Streaming Transformer Transducer for Speech Recognition on Large-scale Dataset
Xie Chen, Yu Wu, ... (+3 more)
|
👻
Ghosted
|
cs.CL
|
203 |
5 years ago |
| 55 |
Personalized Speech recognition on mobile devices
Ian McGraw, Rohit Prabhavalkar, ... (+9 more)
|
👻
Ghosted
|
cs.CL
|
198 |
10 years ago |
| 56 |
Emformer: Efficient Memory Transformer Based Acoustic Model For Low Latency Streaming Speech Recognition
Yangyang Shi, Yongqiang Wang, ... (+6 more)
|
👻
Ghosted
|
cs.SD
|
195 |
5 years ago |
| 57 |
Replay and Synthetic Speech Detection with Res2net Architecture
Xu Li, Na Li, ... (+5 more)
|
👻
Ghosted
|
eess.AS
|
192 |
5 years ago |
| 58 |
Bias Mitigation Post-processing for Individual and Group Fairness
Pranay K. Lohia, Karthikeyan Natesan Ramamurthy, ... (+4 more)
|
👻
Ghosted
|
cs.LG
|
187 |
7 years ago |
| 59 |
Convolutional-Recurrent Neural Networks for Speech Enhancement
Han Zhao, Shuayb Zarar, ... (+2 more)
|
👻
Ghosted
|
cs.SD
|
182 |
8 years ago |
| 60 |
Age-Based Scheduling Policy for Federated Learning in Mobile Edge Networks
Howard H. Yang, Ahmed Arafa, ... (+2 more)
|
👻
Ghosted
|
cs.IT
|
182 |
6 years ago |
| 61 |
A Graph-CNN for 3D Point Cloud Classification
Yingxue Zhang, Michael Rabbat
|
👻
Ghosted
|
cs.CV
|
178 |
7 years ago |
| 62 |
Deep convolutional acoustic word embeddings using word-pair side information
Herman Kamper, Weiran Wang, Karen Livescu
|
👻
Ghosted
|
cs.CL
|
176 |
10 years ago |
| 63 |
Attention-Based Models for Text-Dependent Speaker Verification
F A Rezaur Rahman Chowdhury, Quan Wang, ... (+2 more)
|
👻
Ghosted
|
eess.AS
|
174 |
8 years ago |
| 64 |
A Comprehensive Study of Deep Bidirectional LSTM RNNs for Acoustic Modeling in Speech Recognition
Albert Zeyer, Patrick Doetsch, ... (+3 more)
|
👻
Ghosted
|
cs.NE
|
172 |
9 years ago |
| 65 |
Minimum Word Error Rate Training for Attention-based Sequence-to-Sequence Models
Rohit Prabhavalkar, Tara N. Sainath, ... (+5 more)
|
👻
Ghosted
|
cs.CL
|
171 |
8 years ago |
| 66 |
Leveraging Weakly Supervised Data to Improve End-to-End Speech-to-Text Translation
Ye Jia, Melvin Johnson, ... (+7 more)
|
👻
Ghosted
|
cs.CL
|
171 |
7 years ago |
| 67 |
Deep Clustering and Conventional Networks for Music Separation: Stronger Together
Yi Luo, Zhuo Chen, ... (+3 more)
|
👻
Ghosted
|
stat.ML
|
170 |
9 years ago |
| 68 |
PromptTTS: Controllable Text-to-Speech with Text Descriptions
Zhifang Guo, Yichong Leng, ... (+3 more)
|
👻
Ghosted
|
eess.AS
|
167 |
3 years ago |
| 69 |
AVA-ActiveSpeaker: An Audio-Visual Dataset for Active Speaker Detection
Joseph Roth, Sourish Chaudhuri, ... (+9 more)
|
👻
Ghosted
|
cs.CV
|
165 |
7 years ago |
| 70 |
Mellotron: Multispeaker expressive voice synthesis by conditioning on rhythm, pitch and global style tokens
Rafael Valle, Jason Li, ... (+2 more)
|
👻
Ghosted
|
cs.SD
|
162 |
6 years ago |
| 71 |
wav2letter++: The Fastest Open-source Speech Recognition System
Vineel Pratap, Awni Hannun, ... (+6 more)
|
👻
Ghosted
|
cs.CL
|
161 |
7 years ago |
| 72 |
RoIMix: Proposal-Fusion among Multiple Images for Underwater Object Detection
Wei-Hong Lin, Jia-Xing Zhong, ... (+3 more)
|
👻
Ghosted
|
cs.CV
|
160 |
6 years ago |
| 73 |
Ordered Reliability Bits Guessing Random Additive Noise Decoding
Ken R. Duffy
|
👻
Ghosted
|
cs.IT
|
156 |
6 years ago |
| 74 |
Robust and fine-grained prosody control of end-to-end speech synthesis
Younggun Lee, Taesu Kim
|
👻
Ghosted
|
cs.CL
|
155 |
7 years ago |
| 75 |
SpecAugment on Large Scale Datasets
Daniel S. Park, Yu Zhang, ... (+6 more)
|
👻
Ghosted
|
eess.AS
|
154 |
6 years ago |
| 76 |
Sound Event Detection Using Spatial Features and Convolutional Recurrent Neural Network
Sharath Adavanne, Pasi Pertilä, Tuomas Virtanen
|
👻
Ghosted
|
cs.SD
|
153 |
9 years ago |
| 77 |
Building competitive direct acoustics-to-word models for English conversational speech recognition
Kartik Audhkhasi, Brian Kingsbury, ... (+3 more)
|
👻
Ghosted
|
cs.CL
|
152 |
8 years ago |
| 78 |
A spelling correction model for end-to-end speech recognition
Jinxi Guo, Tara N. Sainath, Ron J. Weiss
|
👻
Ghosted
|
eess.AS
|
151 |
7 years ago |
| 79 |
ASR is all you need: cross-modal distillation for lip reading
Triantafyllos Afouras, Joon Son Chung, Andrew Zisserman
|
👻
Ghosted
|
cs.CV
|
149 |
6 years ago |
| 80 |
Trainable Frontend For Robust and Far-Field Keyword Spotting
Yuxuan Wang, Pascal Getreuer, ... (+3 more)
|
👻
Ghosted
|
cs.CL
|
147 |
9 years ago |
| 81 |
CIF: Continuous Integrate-and-Fire for End-to-End Speech Recognition
Linhao Dong, Bo Xu
|
👻
Ghosted
|
cs.CL
|
146 |
7 years ago |
| 82 |
Continuous Speech Separation with Conformer
Sanyuan Chen, Yu Wu, ... (+7 more)
|
👻
Ghosted
|
eess.AS
|
145 |
5 years ago |
| 83 |
A Co-Interactive Transformer for Joint Slot Filling and Intent Detection
Libo Qin, Tailu Liu, ... (+4 more)
|
👻
Ghosted
|
cs.CL
|
145 |
5 years ago |
| 84 |
High-quality nonparallel voice conversion based on cycle-consistent adversarial network
Fuming Fang, Junichi Yamagishi, ... (+2 more)
|
👻
Ghosted
|
eess.AS
|
144 |
8 years ago |
| 85 |
Strong Data Augmentation Sanitizes Poisoning and Backdoor Attacks Without an Accuracy Tradeoff
Eitan Borgnia, Valeriia Cherepanova, ... (+6 more)
|
👻
Ghosted
|
cs.CR
|
143 |
5 years ago |
| 86 |
Domain Adversarial Training for Accented Speech Recognition
Sining Sun, Ching-Feng Yeh, ... (+3 more)
|
👻
Ghosted
|
cs.CL
|
139 |
8 years ago |
| 87 |
Learning Sparse Graphs Under Smoothness Prior
Sundeep Prabhakar Chepuri, Sijia Liu, ... (+2 more)
|
👻
Ghosted
|
cs.LG
|
137 |
9 years ago |
| 88 |
Bytes are All You Need: End-to-End Multilingual Speech Recognition and Synthesis with Bytes
Bo Li, Yu Zhang, ... (+3 more)
|
👻
Ghosted
|
eess.AS
|
136 |
7 years ago |
| 89 |
A Comparison of deep learning methods for environmental sound
Juncheng Li, Wei Dai, ... (+3 more)
|
👻
Ghosted
|
cs.SD
|
135 |
9 years ago |
| 90 |
Speech Emotion Recognition with Dual-Sequence LSTM Architecture
Jianyou Wang, Michael Xue, ... (+4 more)
|
👻
Ghosted
|
eess.AS
|
132 |
6 years ago |
| 91 |
End-to-End Audio Visual Scene-Aware Dialog using Multimodal Attention-Based Video Features
Chiori Hori, Huda Alamri, ... (+11 more)
|
👻
Ghosted
|
cs.CL
|
131 |
7 years ago |
| 92 |
Perfect match: Improved cross-modal embeddings for audio-visual synchronisation
Soo-Whan Chung, Joon Son Chung, Hong-Goo Kang
|
👻
Ghosted
|
cs.CV
|
130 |
7 years ago |
| 93 |
Speech Emotion Recognition Using Multi-hop Attention Mechanism
Seunghyun Yoon, Seokhyun Byun, ... (+2 more)
|
👻
Ghosted
|
eess.AS
|
130 |
7 years ago |
| 94 |
Exposing GAN-generated Faces Using Inconsistent Corneal Specular Highlights
Shu Hu, Yuezun Li, Siwei Lyu
|
👻
Ghosted
|
cs.CV
|
129 |
5 years ago |
| 95 |
Statistical Speech Enhancement Based on Probabilistic Integration of Variational Autoencoder and Non-Negative Matrix Factorization
Yoshiaki Bando, Masato Mimura, ... (+3 more)
|
👻
Ghosted
|
cs.SD
|
128 |
8 years ago |
| 96 |
Learning Filterbanks from Raw Speech for Phone Recognition
Neil Zeghidour, Nicolas Usunier, ... (+4 more)
|
👻
Ghosted
|
cs.CL
|
127 |
8 years ago |
| 97 |
Random Projections through multiple optical scattering: Approximating kernels at the speed of light
Alaa Saade, Francesco Caltagirone, ... (+5 more)
|
👻
Ghosted
|
cs.ET
|
125 |
10 years ago |
| 98 |
Semi-Supervised Training for Improving Data Efficiency in End-to-End Speech Synthesis
Yu-An Chung, Yuxuan Wang, ... (+3 more)
|
👻
Ghosted
|
cs.CL
|
121 |
7 years ago |
| 99 |
End-To-End Visual Speech Recognition With LSTMs
Stavros Petridis, Zuwei Li, Maja Pantic
|
👻
Ghosted
|
cs.CV
|
120 |
9 years ago |
| 100 |
FedPrompt: Communication-Efficient and Privacy Preserving Prompt Tuning in Federated Learning
Haodong Zhao, Wei Du, ... (+3 more)
|
👻
Ghosted
|
cs.LG
|
116 |
3 years ago |