| 851 |
Identifying Source Speakers for Voice Conversion based Spoofing Attacks on Speaker Verification Systems
Danwei Cai, Zexin Cai, Ming Li
|
👻
Ghosted
|
eess.AS
|
14 |
4 years ago |
| 852 |
MetricBERT: Text Representation Learning via Self-Supervised Triplet Training
Itzik Malkiel, Dvir Ginzburg, ... (+4 more)
|
👻
Ghosted
|
cs.CL
|
14 |
3 years ago |
| 853 |
Learning ASR pathways: A sparse multilingual ASR model
Mu Yang, Andros Tjandra, ... (+4 more)
|
👻
Ghosted
|
eess.AS
|
14 |
3 years ago |
| 854 |
Structured State Space Decoder for Speech Recognition and Synthesis
Koichi Miyazaki, Masato Murata, Tomoki Koriyama
|
👻
Ghosted
|
cs.SD
|
14 |
3 years ago |
| 855 |
Neural Fourier Shift for Binaural Speech Rendering
Jin Woo Lee, Kyogu Lee
|
👻
Ghosted
|
eess.AS
|
14 |
3 years ago |
| 856 |
Bridging Speech and Textual Pre-trained Models with Unsupervised ASR
Jiatong Shi, Chan-Jan Hsu, ... (+6 more)
|
👻
Ghosted
|
cs.CL
|
14 |
3 years ago |
| 857 |
Backdoor Defense via Suppressing Model Shortcuts
Sheng Yang, Yiming Li, ... (+2 more)
|
👻
Ghosted
|
cs.CV
|
14 |
3 years ago |
| 858 |
In-Sensor & Neuromorphic Computing are all you need for Energy Efficient Computer Vision
Gourav Datta, Zeyu Liu, ... (+7 more)
|
👻
Ghosted
|
cs.CV
|
14 |
3 years ago |
| 859 |
NeRF-Gaze: A Head-Eye Redirection Parametric Model for Gaze Estimation
Pengwei Yin, Jiawu Dai, ... (+3 more)
|
👻
Ghosted
|
cs.CV
|
14 |
3 years ago |
| 860 |
GEmo-CLAP: Gender-Attribute-Enhanced Contrastive Language-Audio Pretraining for Accurate Speech Emotion Recognition
Yu Pan, Yanni Hu, ... (+6 more)
|
👻
Ghosted
|
cs.CL
|
14 |
3 years ago |
| 861 |
Communication Efficient Private Federated Learning Using Dithering
Burak Hasircioglu, Deniz Gunduz
|
👻
Ghosted
|
cs.LG
|
14 |
2 years ago |
| 862 |
SD-HuBERT: Sentence-Level Self-Distillation Induces Syllabic Organization in HuBERT
Cheol Jun Cho, Abdelrahman Mohamed, ... (+3 more)
|
👻
Ghosted
|
cs.CL
|
14 |
2 years ago |
| 863 |
Recognition-Guided Diffusion Model for Scene Text Image Super-Resolution
Yuxuan Zhou, Liangcai Gao, ... (+2 more)
|
👻
Ghosted
|
cs.CV
|
14 |
2 years ago |
| 864 |
Classification-Oriented Semantic Wireless Communications
Emrecan Kutay, Aylin Yener
|
👻
Ghosted
|
cs.IT
|
14 |
2 years ago |
| 865 |
sVAD: A Robust, Low-Power, and Light-Weight Voice Activity Detection with Spiking Neural Networks
Qu Yang, Qianhui Liu, ... (+4 more)
|
👻
Ghosted
|
cs.SD
|
14 |
2 years ago |
| 866 |
Enhancing Low-Resource ASR through Versatile TTS: Bridging the Data Gap
Guanrou Yang, Fan Yu, ... (+5 more)
|
👻
Ghosted
|
eess.AS
|
14 |
1 year ago |
| 867 |
Bayesian and hybrid Cramer-Rao bounds for QAM dynamical phase estimation
Jianxiao Yang, Benoit Geller, A Wei
|
👻
Ghosted
|
cs.IT
|
13 |
10 years ago |
| 868 |
Mobile Beamforming & Spatially Controlled Relay Communications
Dionysios S. Kalogerias, Athina P. Petropulu
|
👻
Ghosted
|
eess.SY
|
13 |
10 years ago |
| 869 |
Robust On-line Matrix Completion on Graphs
Symeon Chouvardas, Mohammed Amin Abdullah, ... (+2 more)
|
👻
Ghosted
|
cs.IT
|
13 |
10 years ago |
| 870 |
An FFT-based Synchronization Approach to Recognize Human Behaviors using STN-LFP Signal
Hosein M. Golshan, Adam O. Hebb, ... (+3 more)
|
👻
Ghosted
|
cs.CV
|
13 |
9 years ago |
| 871 |
Analysis of Distributed ADMM Algorithm for Consensus Optimization in Presence of Error
Layla Majzoobi, Farshad Lahouti
|
👻
Ghosted
|
cs.DC
|
13 |
9 years ago |
| 872 |
Hybrid Deep-Semantic Matrix Factorization for Tag-Aware Personalized Recommendation
Zhenghua Xu, Cheng Chen, ... (+2 more)
|
👻
Ghosted
|
cs.IR
|
13 |
8 years ago |
| 873 |
Full-info Training for Deep Speaker Feature Learning
Lantian Li, Zhiyuan Tang, ... (+2 more)
|
👻
Ghosted
|
cs.SD
|
13 |
8 years ago |
| 874 |
Investigating the Effects of Word Substitution Errors on Sentence Embeddings
Rohit Voleti, Julie M. Liss, Visar Berisha
|
👻
Ghosted
|
cs.CL
|
13 |
7 years ago |
| 875 |
Semi-Supervised Monaural Singing Voice Separation With a Masking Network Trained on Synthetic Mixtures
Michael Michelashvili, Sagie Benaim, Lior Wolf
|
👻
Ghosted
|
cs.SD
|
13 |
7 years ago |
| 876 |
Stability of Graph Neural Networks to Relative Perturbations
Fernando Gama, Joan Bruna, Alejandro Ribeiro
|
👻
Ghosted
|
cs.LG
|
13 |
6 years ago |
| 877 |
GPU-Accelerated Viterbi Exact Lattice Decoder for Batched Online and Offline Speech Recognition
Hugo Braun, Justin Luitjens, ... (+3 more)
|
👻
Ghosted
|
cs.CL
|
13 |
6 years ago |
| 878 |
Accurate 6D Object Pose Estimation by Pose Conditioned Mesh Reconstruction
Pedro Castro, Anil Armagan, Tae-Kyun Kim
|
👻
Ghosted
|
cs.CV
|
13 |
6 years ago |
| 879 |
Sequence-to-sequence Automatic Speech Recognition with Word Embedding Regularization and Fused Decoding
Alexander H. Liu, Tzu-Wei Sung, ... (+3 more)
|
👻
Ghosted
|
cs.CL
|
13 |
6 years ago |
| 880 |
DFSMN-SAN with Persistent Memory Model for Automatic Speech Recognition
Zhao You, Dan Su, ... (+3 more)
|
👻
Ghosted
|
eess.AS
|
13 |
6 years ago |
| 881 |
Learning To Characterize Adversarial Subspaces
Xiaofeng Mao, Yuefeng Chen, ... (+3 more)
|
👻
Ghosted
|
cs.CV
|
13 |
6 years ago |
| 882 |
WITCHcraft: Efficient PGD attacks with random step size
Ping-Yeh Chiang, Jonas Geiping, ... (+5 more)
|
👻
Ghosted
|
cs.LG
|
13 |
6 years ago |
| 883 |
Compositional embedding models for speaker identification and diarization with simultaneous speech from 2+ speakers
Zeqian Li, Jacob Whitehill
|
👻
Ghosted
|
cs.SD
|
13 |
5 years ago |
| 884 |
Robust Latent Representations via Cross-Modal Translation and Alignment
Vandana Rajan, Alessio Brutti, Andrea Cavallaro
|
👻
Ghosted
|
cs.LG
|
13 |
5 years ago |
| 885 |
End-to-end spoken language understanding using transformer networks and self-supervised pre-trained features
Edmilson Morais, Hong-Kwang J. Kuo, ... (+3 more)
|
👻
Ghosted
|
cs.CL
|
13 |
5 years ago |
| 886 |
Refining Automatic Speech Recognition System for older adults
Liu Chen, Meysam Asgari
|
👻
Ghosted
|
eess.AS
|
13 |
5 years ago |
| 887 |
Data-Efficient Framework for Real-world Multiple Sound Source 2D Localization
Guillaume Le Moing, Phongtharin Vinayavekhin, ... (+5 more)
|
👻
Ghosted
|
eess.AS
|
13 |
5 years ago |
| 888 |
Deformable Temporal Convolutional Networks for Monaural Noisy Reverberant Speech Separation
William Ravenscroft, Stefan Goetze, Thomas Hain
|
👻
Ghosted
|
cs.SD
|
13 |
3 years ago |
| 889 |
GraphMAD: Graph Mixup for Data Augmentation using Data-Driven Convex Clustering
Madeline Navarro, Santiago Segarra
|
👻
Ghosted
|
cs.LG
|
13 |
3 years ago |
| 890 |
An analysis of degenerating speech due to progressive dysarthria on ASR performance
Katrin Tomanek, Katie Seaver, ... (+4 more)
|
👻
Ghosted
|
eess.AS
|
13 |
3 years ago |
| 891 |
HDNet: Hierarchical Dynamic Network for Gait Recognition using Millimeter-Wave Radar
Yanyan Huang, Yong Wang, ... (+5 more)
|
👻
Ghosted
|
cs.CV
|
13 |
3 years ago |
| 892 |
TrimTail: Low-Latency Streaming ASR with Simple but Effective Spectrogram-Level Length Penalty
Xingchen Song, Di Wu, ... (+7 more)
|
👻
Ghosted
|
cs.SD
|
13 |
3 years ago |
| 893 |
Deep Neural Mel-Subband Beamformer for In-car Speech Separation
Vinay Kothapally, Yong Xu, ... (+3 more)
|
👻
Ghosted
|
eess.AS
|
13 |
3 years ago |
| 894 |
Adapted Multimodal BERT with Layer-wise Fusion for Sentiment Analysis
Odysseas S. Chlapanis, Georgios Paraskevopoulos, Alexandros Potamianos
|
👻
Ghosted
|
cs.CL
|
13 |
3 years ago |
| 895 |
Interference Leakage Minimization in RIS-assisted MIMO Interference Channels
Ignacio Santamaria, Mohammad Soleymani, ... (+2 more)
|
👻
Ghosted
|
cs.IT
|
13 |
3 years ago |
| 896 |
Higher-order Organization in the Human Brain from Matrix-Based Rényi's Entropy
Qiang Li, Shujian Yu, ... (+3 more)
|
👻
Ghosted
|
q-bio.NC
|
13 |
3 years ago |
| 897 |
Recovering from Privacy-Preserving Masking with Large Language Models
Arpita Vats, Zhe Liu, ... (+6 more)
|
👻
Ghosted
|
cs.CL
|
13 |
2 years ago |
| 898 |
Directional Source Separation for Robust Speech Recognition on Smart Glasses
Tiantian Feng, Ju Lin, ... (+8 more)
|
👻
Ghosted
|
cs.SD
|
13 |
2 years ago |
| 899 |
uSee: Unified Speech Enhancement and Editing with Conditional Diffusion Models
Muqiao Yang, Chunlei Zhang, ... (+5 more)
|
👻
Ghosted
|
cs.SD
|
13 |
2 years ago |
| 900 |
Zero Resource Code-switched Speech Benchmark Using Speech Utterance Pairs For Multiple Spoken Languages
Kuan-Po Huang, Chih-Kai Yang, ... (+3 more)
|
👻
Ghosted
|
eess.AS
|
13 |
2 years ago |