| 301 |
End-to-End Feedback Loss in Speech Chain Framework via Straight-Through Estimator
Andros Tjandra, Sakriani Sakti, Satoshi Nakamura
|
👻
Ghosted
|
cs.CL
|
45 |
7 years ago |
| 302 |
A Recurrent Graph Neural Network for Multi-Relational Data
Vassilis N. Ioannidis, Antonio G. Marques, Georgios B. Giannakis
|
👻
Ghosted
|
cs.LG
|
45 |
7 years ago |
| 303 |
Similarity Learning for Authorship Verification in Social Media
Benedikt Boenninghoff, Robert M. Nickel, ... (+2 more)
|
👻
Ghosted
|
cs.CL
|
45 |
6 years ago |
| 304 |
End-to-End Speaker Diarization as Post-Processing
Shota Horiguchi, Paola Garcia, ... (+3 more)
|
👻
Ghosted
|
eess.AS
|
45 |
5 years ago |
| 305 |
Evidence of Vocal Tract Articulation in Self-Supervised Learning of Speech
Cheol Jun Cho, Peter Wu, ... (+2 more)
|
👻
Ghosted
|
eess.AS
|
45 |
3 years ago |
| 306 |
Learning From Yourself: A Self-Distillation Method for Fake Speech Detection
Jun Xue, Cunhang Fan, ... (+5 more)
|
👻
Ghosted
|
cs.SD
|
45 |
3 years ago |
| 307 |
AMC-Net: An Effective Network for Automatic Modulation Classification
Jiawei Zhang, Tiantian Wang, ... (+2 more)
|
👻
Ghosted
|
eess.SP
|
45 |
3 years ago |
| 308 |
Deep Multimodal Learning for Emotion Recognition in Spoken Language
Yue Gu, Shuhong Chen, Ivan Marsic
|
👻
Ghosted
|
cs.CL
|
44 |
8 years ago |
| 309 |
Extracting Domain Invariant Features by Unsupervised Learning for Robust Automatic Speech Recognition
Wei-Ning Hsu, James Glass
|
👻
Ghosted
|
cs.CL
|
44 |
8 years ago |
| 310 |
Towards Unsupervised Speech-to-Text Translation
Yu-An Chung, Wei-Hung Weng, ... (+2 more)
|
👻
Ghosted
|
cs.CL
|
44 |
7 years ago |
| 311 |
Transfer learning of language-independent end-to-end ASR with language model fusion
Hirofumi Inaguma, Jaejin Cho, ... (+3 more)
|
👻
Ghosted
|
cs.CL
|
44 |
7 years ago |
| 312 |
Deep geometric knowledge distillation with graphs
Carlos Lassance, Myriam Bontonou, ... (+4 more)
|
👻
Ghosted
|
cs.LG
|
44 |
6 years ago |
| 313 |
Dynamic Sparsity Neural Networks for Automatic Speech Recognition
Zhaofeng Wu, Ding Zhao, ... (+4 more)
|
👻
Ghosted
|
eess.AS
|
44 |
6 years ago |
| 314 |
Visual Prompting for Adversarial Robustness
Aochuan Chen, Peter Lorenz, ... (+3 more)
|
👻
Ghosted
|
cs.CV
|
44 |
3 years ago |
| 315 |
Egocentric Activity Recognition with Multimodal Fisher Vector
Sibo Song, Ngai-Man Cheung, ... (+3 more)
|
👻
Ghosted
|
cs.MM
|
43 |
10 years ago |
| 316 |
Son of Zorn's Lemma: Targeted Style Transfer Using Instance-aware Semantic Segmentation
Carlos Castillo, Soham De, ... (+4 more)
|
👻
Ghosted
|
cs.CV
|
43 |
9 years ago |
| 317 |
Revisiting the problem of audio-based hit song prediction using convolutional neural networks
Li-Chia Yang, Szu-Yu Chou, ... (+3 more)
|
👻
Ghosted
|
cs.SD
|
43 |
9 years ago |
| 318 |
Visual Features for Context-Aware Speech Recognition
Abhinav Gupta, Yajie Miao, ... (+2 more)
|
👻
Ghosted
|
cs.CL
|
43 |
8 years ago |
| 319 |
Dynamic Temporal Alignment of Speech to Lips
Tavi Halperin, Ariel Ephrat, Shmuel Peleg
|
👻
Ghosted
|
cs.CV
|
43 |
7 years ago |
| 320 |
Geometry of Deep Learning for Magnetic Resonance Fingerprinting
Mohammad Golbabaee, Dongdong Chen, ... (+3 more)
|
👻
Ghosted
|
cs.LG
|
43 |
7 years ago |
| 321 |
Contextual Speech Recognition with Difficult Negative Training Examples
Uri Alon, Golan Pundak, Tara N. Sainath
|
👻
Ghosted
|
eess.AS
|
43 |
7 years ago |
| 322 |
To Reverse the Gradient or Not: An Empirical Comparison of Adversarial and Multi-task Learning in Speech Recognition
Yossi Adi, Neil Zeghidour, ... (+4 more)
|
👻
Ghosted
|
cs.LG
|
43 |
7 years ago |
| 323 |
Emotional Voice Conversion using Multitask Learning with Text-to-speech
Tae-Ho Kim, Sungjae Cho, ... (+3 more)
|
👻
Ghosted
|
eess.AS
|
43 |
6 years ago |
| 324 |
Streaming Simultaneous Speech Translation with Augmented Memory Transformer
Xutai Ma, Yongqiang Wang, ... (+3 more)
|
👻
Ghosted
|
cs.CL
|
43 |
5 years ago |
| 325 |
Masked Modeling Duo: Learning Representations by Encouraging Both Networks to Model the Input
Daisuke Niizumi, Daiki Takeuchi, ... (+3 more)
|
👻
Ghosted
|
eess.AS
|
43 |
3 years ago |
| 326 |
SpeechLMScore: Evaluating speech generation using speech language model
Soumi Maiti, Yifan Peng, ... (+2 more)
|
👻
Ghosted
|
eess.AS
|
43 |
3 years ago |
| 327 |
Distributed Gradient Descent with Coded Partial Gradient Computations
Emre Ozfatura, Sennur Ulukus, Deniz Gunduz
|
👻
Ghosted
|
cs.LG
|
42 |
7 years ago |
| 328 |
Nose, eyes and ears: Head pose estimation by locating facial keypoints
Aryaman Gupta, Kalpit Thakkar, ... (+2 more)
|
👻
Ghosted
|
cs.CV
|
42 |
7 years ago |
| 329 |
SNDCNN: Self-normalizing deep CNNs with scaled exponential linear units for speech recognition
Zhen Huang, Tim Ng, ... (+4 more)
|
👻
Ghosted
|
cs.LG
|
42 |
6 years ago |
| 330 |
CASS-NAT: CTC Alignment-based Single Step Non-autoregressive Transformer for Speech Recognition
Ruchao Fan, Wei Chu, ... (+2 more)
|
👻
Ghosted
|
eess.AS
|
42 |
5 years ago |
| 331 |
PPG-based singing voice conversion with adversarial representation learning
Zhonghao Li, Benlai Tang, ... (+5 more)
|
👻
Ghosted
|
cs.SD
|
42 |
5 years ago |
| 332 |
Cross-Domain Sentiment Classification with Contrastive Learning and Mutual Information Maximization
Tian Li, Xiang Chen, ... (+3 more)
|
👻
Ghosted
|
cs.CL
|
42 |
5 years ago |
| 333 |
Toward Universal Text-to-Music Retrieval
SeungHeon Doh, Minz Won, ... (+2 more)
|
👻
Ghosted
|
cs.IR
|
42 |
3 years ago |
| 334 |
Generative AI-aided Joint Training-free Secure Semantic Communications via Multi-modal Prompts
Hongyang Du, Guangyuan Liu, ... (+6 more)
|
👻
Ghosted
|
eess.IV
|
42 |
2 years ago |
| 335 |
Language-Oriented Communication with Semantic Coding and Knowledge Distillation for Text-to-Image Generation
Hyelin Nam, Jihong Park, ... (+3 more)
|
👻
Ghosted
|
eess.SP
|
42 |
2 years ago |
| 336 |
Pixel-Superpixel Contrastive Learning and Pseudo-Label Correction for Hyperspectral Image Clustering
Renxiang Guan, Zihao Li, ... (+2 more)
|
👻
Ghosted
|
cs.CV
|
42 |
2 years ago |
| 337 |
A context-aware matching game for user association in wireless small cell networks
Nima Namvar, Walid Saad, ... (+2 more)
|
👻
Ghosted
|
cs.NI
|
41 |
10 years ago |
| 338 |
End-to-End Multimodal Speech Recognition
Shruti Palaskar, Ramon Sanabria, Florian Metze
|
👻
Ghosted
|
eess.AS
|
41 |
8 years ago |
| 339 |
Deep Transfer Learning for EEG-based Brain Computer Interface
Chuanqi Tan, Fuchun Sun, Wenchang Zhang
|
👻
Ghosted
|
cs.CV
|
41 |
7 years ago |
| 340 |
A Learning-Based Framework for Line-Spectra Super-resolution
Gautier Izacard, Brett Bernstein, Carlos Fernandez-Granda
|
👻
Ghosted
|
cs.LG
|
41 |
7 years ago |
| 341 |
Balanced Binary Neural Networks with Gated Residual
Mingzhu Shen, Xianglong Liu, ... (+2 more)
|
👻
Ghosted
|
cs.CV
|
41 |
6 years ago |
| 342 |
Federated Neuromorphic Learning of Spiking Neural Networks for Low-Power Edge Intelligence
Nicolas Skatchkovsky, Hyeryung Jang, Osvaldo Simeone
|
👻
Ghosted
|
cs.LG
|
41 |
6 years ago |
| 343 |
Non-Autoregressive Transformer ASR with CTC-Enhanced Decoder Input
Xingchen Song, Zhiyong Wu, ... (+4 more)
|
👻
Ghosted
|
cs.SD
|
41 |
5 years ago |
| 344 |
S3T: Self-Supervised Pre-training with Swin Transformer for Music Classification
Hang Zhao, Chen Zhang, ... (+3 more)
|
👻
Ghosted
|
eess.AS
|
41 |
4 years ago |
| 345 |
AV-SepFormer: Cross-Attention SepFormer for Audio-Visual Target Speaker Extraction
Jiuxin Lin, Xinyu Cai, ... (+8 more)
|
👻
Ghosted
|
cs.MM
|
41 |
2 years ago |
| 346 |
A network of deep neural networks for distant speech recognition
Mirco Ravanelli, Philemon Brakel, ... (+2 more)
|
👻
Ghosted
|
cs.CL
|
40 |
9 years ago |
| 347 |
Improving End-to-End Speech Recognition with Policy Learning
Yingbo Zhou, Caiming Xiong, Richard Socher
|
👻
Ghosted
|
cs.CL
|
40 |
8 years ago |
| 348 |
Adapting End-to-End Neural Speaker Verification to New Languages and Recording Conditions with Adversarial Training
Gautam Bhattacharya, Jahangir Alam, Patrick Kenny
|
👻
Ghosted
|
eess.AS
|
40 |
7 years ago |
| 349 |
Deep Ptych: Subsampled Fourier Ptychography using Generative Priors
Fahad Shamshad, Farwa Abbas, Ali Ahmed
|
👻
Ghosted
|
cs.LG
|
40 |
7 years ago |
| 350 |
Teacher-Student Training for Robust Tacotron-based TTS
Rui Liu, Berrak Sisman, ... (+4 more)
|
👻
Ghosted
|
cs.CL
|
40 |
6 years ago |