| 101 |
Speaker-Invariant Training via Adversarial Learning
Zhong Meng, Jinyu Li, ... (+6 more)
|
👻
Ghosted
|
eess.AS
|
115 |
8 years ago |
| 102 |
AttS2S-VC: Sequence-to-Sequence Voice Conversion with Attention and Context Preservation Mechanisms
Kou Tanaka, Hirokazu Kameoka, ... (+2 more)
|
👻
Ghosted
|
eess.AS
|
114 |
7 years ago |
| 103 |
Meta Learning for End-to-End Low-Resource Speech Recognition
Jui-Yang Hsu, Yuan-Jui Chen, Hung-yi Lee
|
👻
Ghosted
|
cs.SD
|
114 |
6 years ago |
| 104 |
Connecting Speech Encoder and Large Language Model for ASR
Wenyi Yu, Changli Tang, ... (+7 more)
|
👻
Ghosted
|
eess.AS
|
114 |
2 years ago |
| 105 |
End-to-End ASR-free Keyword Search from Speech
Kartik Audhkhasi, Andrew Rosenberg, ... (+3 more)
|
👻
Ghosted
|
cs.CL
|
113 |
9 years ago |
| 106 |
Recurrent Neural Network Training with Dark Knowledge Transfer
Zhiyuan Tang, Dong Wang, Zhiyong Zhang
|
👻
Ghosted
|
stat.ML
|
112 |
11 years ago |
| 107 |
Using Intelligent Reflecting Surfaces for Rank Improvement in MIMO Communications
Özgecan Özdogan, Emil Björnson, Erik G. Larsson
|
👻
Ghosted
|
eess.SP
|
110 |
6 years ago |
| 108 |
Transformer-based Online CTC/attention End-to-End Speech Recognition Architecture
Haoran Miao, Gaofeng Cheng, ... (+3 more)
|
👻
Ghosted
|
eess.AS
|
109 |
6 years ago |
| 109 |
Efficient keyword spotting using dilated convolutions and gating
Alice Coucke, Mohammed Chlieh, ... (+4 more)
|
👻
Ghosted
|
cs.LG
|
108 |
7 years ago |
| 110 |
Self-supervised Learning for ECG-based Emotion Recognition
Pritam Sarkar, Ali Etemad
|
👻
Ghosted
|
cs.LG
|
108 |
6 years ago |
| 111 |
MMSE precoder for massive MIMO using 1-bit quantization
Ovais Bin Usman, Hela Jedda, ... (+2 more)
|
👻
Ghosted
|
cs.IT
|
107 |
8 years ago |
| 112 |
Quaternion Convolutional Neural Networks for Heterogeneous Image Processing
Titouan Parcollet, Mohamed Morchid, Georges Linarès
|
👻
Ghosted
|
cs.CV
|
106 |
7 years ago |
| 113 |
Wave-Tacotron: Spectrogram-free end-to-end text-to-speech synthesis
Ron J. Weiss, RJ Skerry-Ryan, ... (+3 more)
|
👻
Ghosted
|
cs.CL
|
106 |
5 years ago |
| 114 |
Learning Representations of Emotional Speech with Deep Convolutional Generative Adversarial Networks
Jonathan Chang, Stefan Scherer
|
👻
Ghosted
|
cs.CL
|
105 |
9 years ago |
| 115 |
On the Compression of Recurrent Neural Networks with an Application to LVCSR acoustic modeling for Embedded Speech Recognition
Rohit Prabhavalkar, Ouais Alsharif, ... (+2 more)
|
👻
Ghosted
|
cs.CL
|
104 |
10 years ago |
| 116 |
Conditional Teacher-Student Learning
Zhong Meng, Jinyu Li, ... (+2 more)
|
👻
Ghosted
|
cs.LG
|
104 |
7 years ago |
| 117 |
Improving sequence-to-sequence speech recognition training with on-the-fly data augmentation
Thai-Son Nguyen, Sebastian Stueker, ... (+2 more)
|
👻
Ghosted
|
eess.AS
|
104 |
6 years ago |
| 118 |
Evaluating Voice Conversion-based Privacy Protection against Informed Attackers
Brij Mohan Lal Srivastava, Nathalie Vauquier, ... (+4 more)
|
👻
Ghosted
|
cs.CL
|
104 |
6 years ago |
| 119 |
Convolutional Neural Network Approach for EEG-based Emotion Recognition using Brain Connectivity and its Spatial Information
Seong-Eun Moon, Soobeom Jang, Jong-Seok Lee
|
👻
Ghosted
|
cs.HC
|
101 |
7 years ago |
| 120 |
Correction of Automatic Speech Recognition with Transformer Sequence-to-sequence Model
Oleksii Hrinchuk, Mariya Popova, Boris Ginsburg
|
👻
Ghosted
|
cs.CL
|
101 |
6 years ago |
| 121 |
Advancing Acoustic-to-Word CTC Model
Jinyu Li, Guoli Ye, ... (+3 more)
|
👻
Ghosted
|
cs.CL
|
100 |
8 years ago |
| 122 |
Adversarial Teacher-Student Learning for Unsupervised Domain Adaptation
Zhong Meng, Jinyu Li, ... (+3 more)
|
👻
Ghosted
|
eess.AS
|
99 |
8 years ago |
| 123 |
Deep Learning Based Phase Reconstruction for Speaker Separation: A Trigonometric Perspective
Zhong-Qiu Wang, Ke Tan, DeLiang Wang
|
👻
Ghosted
|
cs.SD
|
99 |
7 years ago |
| 124 |
Sample-level CNN Architectures for Music Auto-tagging Using Raw Waveforms
Taejun Kim, Jongpil Lee, Juhan Nam
|
👻
Ghosted
|
cs.SD
|
98 |
8 years ago |
| 125 |
Sequence-based Multi-lingual Low Resource Speech Recognition
Siddharth Dalmia, Ramon Sanabria, ... (+2 more)
|
👻
Ghosted
|
cs.CL
|
98 |
8 years ago |
| 126 |
This dataset does not exist: training models from generated images
Victor Besnier, Himalaya Jain, ... (+3 more)
|
👻
Ghosted
|
cs.CV
|
98 |
6 years ago |
| 127 |
LoRa Digital Receiver Analysis and Implementation
Reza Ghanaatian, Orion Afisiadis, ... (+2 more)
|
👻
Ghosted
|
cs.IT
|
97 |
7 years ago |
| 128 |
Music Understanding LLaMA: Advancing Text-to-Music Generation with Question Answering and Captioning
Shansong Liu, Atin Sakkeer Hussain, ... (+2 more)
|
👻
Ghosted
|
cs.SD
|
96 |
2 years ago |
| 129 |
Advances in All-Neural Speech Recognition
G. Zweig, C. Yu, ... (+2 more)
|
👻
Ghosted
|
cs.CL
|
95 |
9 years ago |
| 130 |
Bootstrapping Graph Convolutional Neural Networks for Autism Spectrum Disorder Classification
Rushil Anirudh, Jayaraman J. Thiagarajan
|
👻
Ghosted
|
stat.ML
|
95 |
9 years ago |
| 131 |
Short-segment heart sound classification using an ensemble of deep convolutional neural networks
Fuad Noman, Chee-Ming Ting, ... (+2 more)
|
👻
Ghosted
|
cs.SD
|
95 |
7 years ago |
| 132 |
Generalized linear mixing model accounting for endmember variability
Tales Imbiriba, Ricardo Augusto Borsoi, José Carlos Moreira Bermudez
|
👻
Ghosted
|
cs.CV
|
94 |
8 years ago |
| 133 |
Lip2AudSpec: Speech reconstruction from silent lip movements video
Hassan Akbari, Himani Arora, ... (+2 more)
|
👻
Ghosted
|
cs.CV
|
94 |
8 years ago |
| 134 |
Adversarial Attacks on GMM i-vector based Speaker Verification Systems
Xu Li, Jinghua Zhong, ... (+4 more)
|
👻
Ghosted
|
eess.AS
|
93 |
6 years ago |
| 135 |
A Recurrent Variational Autoencoder for Speech Enhancement
Simon Leglaive, Xavier Alameda-Pineda, ... (+2 more)
|
👻
Ghosted
|
cs.LG
|
91 |
6 years ago |
| 136 |
Generating Synthetic Audio Data for Attention-Based Speech Recognition Systems
Nick Rossenbach, Albert Zeyer, ... (+2 more)
|
👻
Ghosted
|
cs.CL
|
91 |
6 years ago |
| 137 |
StyleMelGAN: An Efficient High-Fidelity Adversarial Vocoder with Temporal Adaptive Normalization
Ahmed Mustafa, Nicola Pia, Guillaume Fuchs
|
👻
Ghosted
|
eess.AS
|
91 |
5 years ago |
| 138 |
Self-supervised Text-independent Speaker Verification using Prototypical Momentum Contrastive Learning
Wei Xia, Chunlei Zhang, ... (+3 more)
|
👻
Ghosted
|
eess.AS
|
91 |
5 years ago |
| 139 |
Deep Long Short-Term Memory Adaptive Beamforming Networks For Multichannel Robust Speech Recognition
Zhong Meng, Shinji Watanabe, ... (+2 more)
|
👻
Ghosted
|
eess.AS
|
90 |
8 years ago |
| 140 |
Cycle-consistency training for end-to-end speech recognition
Takaaki Hori, Ramon Astudillo, ... (+4 more)
|
👻
Ghosted
|
cs.CL
|
90 |
7 years ago |
| 141 |
How should we evaluate supervised hashing?
Alexandre Sablayrolles, Matthijs Douze, ... (+2 more)
|
👻
Ghosted
|
cs.CV
|
89 |
9 years ago |
| 142 |
Investigation of enhanced Tacotron text-to-speech synthesis systems with self-attention for pitch accent language
Yusuke Yasuda, Xin Wang, ... (+2 more)
|
👻
Ghosted
|
eess.AS
|
89 |
7 years ago |
| 143 |
Cascaded encoders for unifying streaming and non-streaming ASR
Arun Narayanan, Tara N. Sainath, ... (+6 more)
|
👻
Ghosted
|
eess.AS
|
88 |
5 years ago |
| 144 |
Training Speech Recognition Models with Federated Learning: A Quality/Cost Framework
Dhruv Guliani, Francoise Beaufays, Giovanni Motta
|
👻
Ghosted
|
cs.LG
|
88 |
5 years ago |
| 145 |
Learning Compact Recurrent Neural Networks
Zhiyun Lu, Vikas Sindhwani, Tara N. Sainath
|
👻
Ghosted
|
cs.LG
|
87 |
10 years ago |
| 146 |
Lightweight and Efficient End-to-End Speech Recognition Using Low-Rank Transformer
Genta Indra Winata, Samuel Cahyawijaya, ... (+3 more)
|
👻
Ghosted
|
cs.CL
|
87 |
6 years ago |
| 147 |
Forward Attention in Sequence-to-sequence Acoustic Modelling for Speech Synthesis
Jing-Xuan Zhang, Zhen-Hua Ling, Li-Rong Dai
|
👻
Ghosted
|
cs.CL
|
86 |
7 years ago |
| 148 |
Attention-Augmented End-to-End Multi-Task Learning for Emotion Prediction from Speech
Zixing Zhang, Bingwen Wu, Bjoern Schuller
|
👻
Ghosted
|
cs.CL
|
86 |
7 years ago |
| 149 |
End-to-End Monaural Multi-speaker ASR System without Pretraining
Xuankai Chang, Yanmin Qian, ... (+2 more)
|
👻
Ghosted
|
cs.CL
|
85 |
7 years ago |
| 150 |
Advanced LSTM: A Study about Better Time Dependency Modeling in Emotion Recognition
Fei Tao, Gang Liu
|
👻
Ghosted
|
cs.LG
|
84 |
8 years ago |