| 151 |
Attentive Filtering Networks for Audio Replay Attack Detection
Cheng-I Lai, Alberto Abad, ... (+4 more)
|
👻
Ghosted
|
eess.AS
|
84 |
7 years ago |
| 152 |
Detecting Multiple Speech Disfluencies using a Deep Residual Network with Bidirectional Long Short-Term Memory
Tedd Kourkounakis, Amirhossein Hajavi, Ali Etemad
|
👻
Ghosted
|
eess.AS
|
84 |
6 years ago |
| 153 |
Speech Sentiment Analysis via Pre-trained Features from End-to-end ASR Models
Zhiyun Lu, Liangliang Cao, ... (+3 more)
|
👻
Ghosted
|
cs.CL
|
84 |
6 years ago |
| 154 |
End-to-End Joint Learning of Natural Language Understanding and Dialogue Manager
Xuesong Yang, Yun-Nung Chen, ... (+5 more)
|
👻
Ghosted
|
cs.CL
|
82 |
9 years ago |
| 155 |
Parsimonious Online Learning with Kernels via Sparse Projections in Function Space
Alec Koppel, Garrett Warnell, ... (+2 more)
|
👻
Ghosted
|
stat.ML
|
82 |
9 years ago |
| 156 |
Cross-lingual and Multilingual Speech Emotion Recognition on English and French
Michael Neumann, Ngoc Thang Vu
|
👻
Ghosted
|
cs.CL
|
81 |
8 years ago |
| 157 |
Scalable Mutual Information Estimation using Dependence Graphs
Morteza Noshad, Yu Zeng, Alfred O. Hero
|
👻
Ghosted
|
cs.IT
|
80 |
8 years ago |
| 158 |
A General Multi-Task Learning Framework to Leverage Text Data for Speech to Text Tasks
Yun Tang, Juan Pino, ... (+3 more)
|
👻
Ghosted
|
cs.CL
|
80 |
5 years ago |
| 159 |
Sound Source Localization in a Multipath Environment Using Convolutional Neural Networks
Eric L. Ferguson, Stefan B. Williams, Craig T. Jin
|
👻
Ghosted
|
cs.SD
|
79 |
8 years ago |
| 160 |
Improving Universal Sound Separation Using Sound Classification
Efthymios Tzinis, Scott Wisdom, ... (+3 more)
|
👻
Ghosted
|
cs.SD
|
79 |
6 years ago |
| 161 |
STC Anti-spoofing Systems for the ASVspoof 2015 Challenge
Sergey Novoselov, Alexandr Kozlov, ... (+3 more)
|
👻
Ghosted
|
cs.SD
|
78 |
10 years ago |
| 162 |
Knowledge Distillation for Small-footprint Highway Networks
Liang Lu, Michelle Guo, Steve Renals
|
👻
Ghosted
|
cs.CL
|
78 |
9 years ago |
| 163 |
RETURNN: The RWTH Extensible Training framework for Universal Recurrent Neural Networks
Patrick Doetsch, Albert Zeyer, ... (+4 more)
|
👻
Ghosted
|
cs.LG
|
78 |
9 years ago |
| 164 |
Emotion recognition by fusing time synchronous and time asynchronous representations
Wen Wu, Chao Zhang, Philip C. Woodland
|
👻
Ghosted
|
cs.CL
|
78 |
5 years ago |
| 165 |
A Probabilistic Interpretation of Sampling Theory of Graph Signals
Akshay Gadde, Antonio Ortega
|
👻
Ghosted
|
cs.LG
|
77 |
11 years ago |
| 166 |
Sketching for Large-Scale Learning of Mixture Models
Nicolas Keriven, Anthony Bourrier, ... (+2 more)
|
👻
Ghosted
|
cs.LG
|
77 |
10 years ago |
| 167 |
Breast density classification with deep convolutional neural networks
Nan Wu, Krzysztof J. Geras, ... (+7 more)
|
👻
Ghosted
|
cs.CV
|
77 |
8 years ago |
| 168 |
The CORAL+ Algorithm for Unsupervised Domain Adaptation of PLDA
Kong Aik Lee, Qiongqiong Wang, Takafumi Koshinaka
|
👻
Ghosted
|
cs.LG
|
77 |
7 years ago |
| 169 |
Synchronous Transformers for End-to-End Speech Recognition
Zhengkun Tian, Jiangyan Yi, ... (+4 more)
|
👻
Ghosted
|
eess.AS
|
77 |
6 years ago |
| 170 |
Attention Driven Fusion for Multi-Modal Emotion Recognition
Darshana Priyasad, Tharindu Fernando, ... (+3 more)
|
👻
Ghosted
|
eess.AS
|
77 |
5 years ago |
| 171 |
End-to-End Optimized Speech Coding with Deep Neural Networks
Srihari Kankanahalli
|
👻
Ghosted
|
cs.SD
|
74 |
8 years ago |
| 172 |
Generative Adversarial Source Separation
Cem Subakan, Paris Smaragdis
|
👻
Ghosted
|
cs.SD
|
74 |
8 years ago |
| 173 |
Filterbank design for end-to-end speech separation
Manuel Pariente, Samuele Cornell, ... (+2 more)
|
👻
Ghosted
|
cs.SD
|
74 |
6 years ago |
| 174 |
Encoder-decoder with Focus-mechanism for Sequence Labelling Based Spoken Language Understanding
Su Zhu, Kai Yu
|
👻
Ghosted
|
cs.CL
|
73 |
9 years ago |
| 175 |
Adversarial Advantage Actor-Critic Model for Task-Completion Dialogue Policy Learning
Baolin Peng, Xiujun Li, ... (+4 more)
|
👻
Ghosted
|
cs.CL
|
73 |
8 years ago |
| 176 |
Joint Modeling of Accents and Acoustics for Multi-Accent Speech Recognition
Xuesong Yang, Kartik Audhkhasi, ... (+4 more)
|
👻
Ghosted
|
cs.CL
|
73 |
8 years ago |
| 177 |
End-to-End Streaming Keyword Spotting
Alvarez Raziel, Park Hyun-Jin
|
👻
Ghosted
|
cs.CL
|
73 |
7 years ago |
| 178 |
Scaling Recurrent Neural Network Language Models
Will Williams, Niranjani Prasad, ... (+3 more)
|
👻
Ghosted
|
cs.CL
|
72 |
11 years ago |
| 179 |
Adversarial Inpainting of Medical Image Modalities
Karim Armanious, Youssef Mecky, ... (+2 more)
|
👻
Ghosted
|
cs.CV
|
72 |
7 years ago |
| 180 |
Modality Attention for End-to-End Audio-visual Speech Recognition
Pan Zhou, Wenwen Yang, ... (+3 more)
|
👻
Ghosted
|
cs.CL
|
72 |
7 years ago |
| 181 |
Truly unsupervised acoustic word embeddings using weak top-down constraints in encoder-decoder models
Herman Kamper
|
👻
Ghosted
|
cs.CL
|
71 |
7 years ago |
| 182 |
Analyzing ASR pretraining for low-resource speech-to-text translation
Mihaela C. Stoian, Sameer Bansal, Sharon Goldwater
|
👻
Ghosted
|
cs.CL
|
71 |
6 years ago |
| 183 |
Unsupervised Contrastive Learning of Sound Event Representations
Eduardo Fonseca, Diego Ortego, ... (+3 more)
|
👻
Ghosted
|
cs.SD
|
71 |
5 years ago |
| 184 |
Towards Language-Universal End-to-End Speech Recognition
Suyoun Kim, Michael L. Seltzer
|
👻
Ghosted
|
cs.CL
|
70 |
8 years ago |
| 185 |
Two-Step Sound Source Separation: Training on Learned Latent Targets
Efthymios Tzinis, Shrikant Venkataramani, ... (+3 more)
|
👻
Ghosted
|
cs.LG
|
70 |
6 years ago |
| 186 |
FPGA Based Implementation of Deep Neural Networks Using On-chip Memory Only
Jinhwan Park, Wonyong Sung
|
👻
Ghosted
|
cs.AR
|
69 |
10 years ago |
| 187 |
A comparison of recent waveform generation and acoustic modeling methods for neural-network-based speech synthesis
Xin Wang, Jaime Lorenzo-Trueba, ... (+3 more)
|
👻
Ghosted
|
eess.AS
|
69 |
8 years ago |
| 188 |
Character-Level Language Modeling with Hierarchical Recurrent Neural Networks
Kyuyeon Hwang, Wonyong Sung
|
👻
Ghosted
|
cs.LG
|
68 |
9 years ago |
| 189 |
Towards Audio to Scene Image Synthesis using Generative Adversarial Network
Chia-Hung Wan, Shun-Po Chuang, Hung-Yi Lee
|
👻
Ghosted
|
cs.CL
|
68 |
7 years ago |
| 190 |
Deep Joint Source-Channel Coding for Wireless Image Retrieval
Mikolaj Jankowski, Deniz Gunduz, Krystian Mikolajczyk
|
👻
Ghosted
|
cs.IT
|
68 |
6 years ago |
| 191 |
Unsupervised Domain Adaptation for Speech Recognition via Uncertainty Driven Self-Training
Sameer Khurana, Niko Moritz, ... (+2 more)
|
👻
Ghosted
|
cs.CL
|
68 |
5 years ago |
| 192 |
When BERT Meets Quantum Temporal Convolution Learning for Text Classification in Heterogeneous Computing
Chao-Han Huck Yang, Jun Qi, ... (+3 more)
|
👻
Ghosted
|
cs.CL
|
68 |
4 years ago |
| 193 |
Adversarial Speaker Verification
Zhong Meng, Yong Zhao, ... (+2 more)
|
👻
Ghosted
|
cs.SD
|
67 |
7 years ago |
| 194 |
Leveraging Unpaired Text Data for Training End-to-End Speech-to-Intent Systems
Yinghui Huang, Hong-Kwang Kuo, ... (+6 more)
|
👻
Ghosted
|
cs.CL
|
67 |
5 years ago |
| 195 |
Improved Mask-CTC for Non-Autoregressive End-to-End ASR
Yosuke Higuchi, Hirofumi Inaguma, ... (+3 more)
|
👻
Ghosted
|
eess.AS
|
67 |
5 years ago |
| 196 |
Exploring Speech Recognition, Translation, and Understanding with Discrete Speech Units: A Comparative Study
Xuankai Chang, Brian Yan, ... (+15 more)
|
👻
Ghosted
|
cs.CL
|
67 |
2 years ago |
| 197 |
Invariances and Data Augmentation for Supervised Music Transcription
John Thickstun, Zaid Harchaoui, ... (+2 more)
|
👻
Ghosted
|
stat.ML
|
66 |
8 years ago |
| 198 |
Muse: Multi-modal target speaker extraction with visual cues
Zexu Pan, Ruijie Tao, ... (+2 more)
|
👻
Ghosted
|
eess.AS
|
65 |
5 years ago |
| 199 |
Diffusion-based Generative Speech Source Separation
Robin Scheibler, Youna Ji, ... (+4 more)
|
👻
Ghosted
|
eess.AS
|
65 |
3 years ago |
| 200 |
Fixed-Point Performance Analysis of Recurrent Neural Networks
Sungho Shin, Kyuyeon Hwang, Wonyong Sung
|
👻
Ghosted
|
cs.LG
|
64 |
10 years ago |