| 1151 |
Audio-visual speech enhancement with a deep Kalman filter generative model
Ali Golmakani, Mostafa Sadeghi, Romain Serizel
|
👻
Ghosted
|
cs.CV
|
8 |
3 years ago |
| 1152 |
Resource-Efficient Transfer Learning From Speech Foundation Model Using Hierarchical Feature Fusion
Zhouyuan Huo, Khe Chai Sim, ... (+4 more)
|
👻
Ghosted
|
cs.LG
|
8 |
3 years ago |
| 1153 |
On minimal variations for unsupervised representation learning
Vivien Cabannes, Alberto Bietti, Randall Balestriero
|
👻
Ghosted
|
cs.LG
|
8 |
3 years ago |
| 1154 |
Identifying Coordination in a Cognitive Radar Network -- A Multi-Objective Inverse Reinforcement Learning Approach
Luke Snow, Vikram Krishnamurthy, Brian M. Sadler
|
👻
Ghosted
|
eess.SP
|
8 |
3 years ago |
| 1155 |
Multilevel Transformer For Multimodal Emotion Recognition
Junyi He, Meimei Wu, ... (+3 more)
|
👻
Ghosted
|
cs.CL
|
8 |
3 years ago |
| 1156 |
Reverberation as Supervision for Speech Separation
Rohith Aralikatti, Christoph Boeddeker, ... (+3 more)
|
👻
Ghosted
|
eess.AS
|
8 |
3 years ago |
| 1157 |
Simultaneously Learning Robust Audio Embeddings and balanced Hash codes for Query-by-Example
Anup Singh, Kris Demuynck, Vipul Arora
|
👻
Ghosted
|
eess.AS
|
8 |
3 years ago |
| 1158 |
Embedding a Differentiable Mel-cepstral Synthesis Filter to a Neural Speech Synthesis System
Takenori Yoshimura, Shinji Takaki, ... (+6 more)
|
👻
Ghosted
|
eess.AS
|
8 |
3 years ago |
| 1159 |
Rethinking Implicit Neural Representations for Vision Learners
Yiran Song, Qianyu Zhou, Lizhuang Ma
|
👻
Ghosted
|
cs.CV
|
8 |
3 years ago |
| 1160 |
Deep Unfolded Tensor Robust PCA with Self-supervised Learning
Harry Dong, Megna Shah, ... (+2 more)
|
👻
Ghosted
|
stat.ML
|
8 |
3 years ago |
| 1161 |
CRC-Aided Learned Ensembles of Belief-Propagation Polar Decoders
Tomer Raviv, Alon Goldman, ... (+3 more)
|
👻
Ghosted
|
cs.IT
|
8 |
3 years ago |
| 1162 |
Joint Data Association, NLOS Mitigation, and Clutter Suppression for Networked Device-Free Sensing in 6G Cellular Network
Qin Shi, Liang Liu, Shuowen Zhang
|
👻
Ghosted
|
eess.SP
|
8 |
3 years ago |
| 1163 |
Dynamic Privacy Allocation for Locally Differentially Private Federated Learning with Composite Objectives
Jiaojiao Zhang, Dominik Fay, Mikael Johansson
|
👻
Ghosted
|
cs.LG
|
8 |
2 years ago |
| 1164 |
Causal-Story: Local Causal Attention Utilizing Parameter-Efficient Tuning For Visual Story Synthesis
Tianyi Song, Jiuxin Cao, ... (+3 more)
|
👻
Ghosted
|
cs.CV
|
8 |
2 years ago |
| 1165 |
Enhancing End-to-End Conversational Speech Translation Through Target Language Context Utilization
Amir Hussein, Brian Yan, ... (+3 more)
|
👻
Ghosted
|
cs.CL
|
8 |
2 years ago |
| 1166 |
Do self-supervised speech and language models extract similar representations as human brain?
Peili Chen, Linyang He, ... (+4 more)
|
👻
Ghosted
|
q-bio.NC
|
8 |
2 years ago |
| 1167 |
Fast Word Error Rate Estimation Using Self-Supervised Representations for Speech and Text
Chanho Park, Chengsong Lu, ... (+2 more)
|
👻
Ghosted
|
eess.AS
|
8 |
2 years ago |
| 1168 |
MERTech: Instrument Playing Technique Detection Using Self-Supervised Pretrained Model With Multi-Task Finetuning
Dichucheng Li, Yinghao Ma, ... (+7 more)
|
👻
Ghosted
|
cs.SD
|
8 |
2 years ago |
| 1169 |
Double Reverse Regularization Network Based on Self-Knowledge Distillation for SAR Object Classification
Bo Xu, Hao Zheng, ... (+3 more)
|
👻
Ghosted
|
cs.CV
|
8 |
2 years ago |
| 1170 |
Learning graphs and simplicial complexes from data
Andrei Buciulea, Elvin Isufi, ... (+2 more)
|
👻
Ghosted
|
eess.SP
|
8 |
2 years ago |
| 1171 |
Attention-Driven Multichannel Speech Enhancement in Moving Sound Source Scenarios
Yuzhu Wang, Archontis Politis, Tuomas Virtanen
|
👻
Ghosted
|
eess.AS
|
8 |
2 years ago |
| 1172 |
Stability of Graph Convolutional Neural Networks through the lens of small perturbation analysis
Lucia Testa, Claudio Battiloro, ... (+2 more)
|
👻
Ghosted
|
cs.LG
|
8 |
2 years ago |
| 1173 |
Near-Field Localization with $1$-bit Quantized Hybrid A/D Reception
Ioannis Gavras, Italo Atzeni, George C. Alexandropoulos
|
👻
Ghosted
|
cs.IT
|
8 |
2 years ago |
| 1174 |
Version age-based client scheduling policy for federated learning
Xinyi Hu, Nikolaos Pappas, Howard H. Yang
|
👻
Ghosted
|
cs.LG
|
8 |
2 years ago |
| 1175 |
Leveraging Pre-Trained Autoencoders for Interpretable Prototype Learning of Music Audio
Pablo Alonso-Jiménez, Leonardo Pepino, ... (+5 more)
|
👻
Ghosted
|
cs.SD
|
8 |
2 years ago |
| 1176 |
USV-AUV Collaboration Framework for Underwater Tasks under Extreme Sea Conditions
Jingzehua Xu, Guanwen Xie, ... (+3 more)
|
👻
Ghosted
|
cs.RO
|
8 |
1 year ago |
| 1177 |
Investigating the Effectiveness of Explainability Methods in Parkinson's Detection from Speech
Eleonora Mancini, Francesco Paissan, ... (+3 more)
|
👻
Ghosted
|
cs.SD
|
8 |
1 year ago |
| 1178 |
Follow-Your-MultiPose: Tuning-Free Multi-Character Text-to-Video Generation via Pose Guidance
Beiyuan Zhang, Yue Ma, ... (+4 more)
|
👻
Ghosted
|
cs.CV
|
8 |
1 year ago |
| 1179 |
Scalable Speech Enhancement with Dynamic Channel Pruning
Riccardo Miccini, Clement Laroche, ... (+2 more)
|
👻
Ghosted
|
eess.AS
|
8 |
1 year ago |
| 1180 |
DA-LIF: Dual Adaptive Leaky Integrate-and-Fire Model for Deep Spiking Neural Networks
Tianqing Zhang, Kairong Yu, ... (+2 more)
|
👻
Ghosted
|
cs.NE
|
8 |
1 year ago |
| 1181 |
Performance Analysis for Pilot-based 1-bit Channel Estimation with Unknown Quantization Threshold
Manuel Stein, Shahar Bar, ... (+2 more)
|
👻
Ghosted
|
cs.IT
|
7 |
10 years ago |
| 1182 |
Active Learning On Weighted Graphs Using Adaptive And Non-adaptive Approaches
Eyal En Gad, Akshay Gadde, ... (+2 more)
|
👻
Ghosted
|
cs.LG
|
7 |
10 years ago |
| 1183 |
Contour-based 3d tongue motion visualization using ultrasound image sequences
Kele Xu, Yin Yang, ... (+3 more)
|
👻
Ghosted
|
cs.CV
|
7 |
10 years ago |
| 1184 |
DOA estimation in structured phase-noisy environments: technical report
Angélique Drémeau, Cédric Herzet
|
👻
Ghosted
|
cs.IT
|
7 |
9 years ago |
| 1185 |
Learning conditional independence structure for high-dimensional uncorrelated vector processes
Nguyen Tran Quang, Alexander Jung
|
👻
Ghosted
|
stat.ML
|
7 |
9 years ago |
| 1186 |
Finite-State Channel Models for Signal Transduction in Neural Systems
Andrew W. Eckford, Kenneth A. Loparo, Peter J. Thomas
|
👻
Ghosted
|
q-bio.QM
|
7 |
9 years ago |
| 1187 |
Joint Bayesian Gaussian discriminant analysis for speaker verification
Yiyan Wang, Haotian Xu, Zhijian Ou
|
👻
Ghosted
|
cs.SD
|
7 |
9 years ago |
| 1188 |
Training Deep Neural Networks via Optimization Over Graphs
Guoqiang Zhang, W. Bastiaan Kleijn
|
👻
Ghosted
|
cs.LG
|
7 |
9 years ago |
| 1189 |
Jamming Resistant Receivers for Massive MIMO
Tan Tai Do, Emil Björnson, Erik G. Larsson
|
👻
Ghosted
|
cs.IT
|
7 |
9 years ago |
| 1190 |
Direct Ensemble Estimation of Density Functionals
Alan Wisler, Kevin Moon, Visar Berisha
|
👻
Ghosted
|
cs.IT
|
7 |
9 years ago |
| 1191 |
A Neural Network Approach for Mixing Language Models
Youssef Oualil, Dietrich Klakow
|
👻
Ghosted
|
cs.CL
|
7 |
8 years ago |
| 1192 |
A Supervised STDP-based Training Algorithm for Living Neural Networks
Yuan Zeng, Kevin Devincentis, ... (+5 more)
|
👻
Ghosted
|
cs.NE
|
7 |
8 years ago |
| 1193 |
Tracking of enriched dialog states for flexible conversational information access
Yinpei Dai, Zhijian Ou, ... (+2 more)
|
👻
Ghosted
|
cs.CL
|
7 |
8 years ago |
| 1194 |
Dynamically Context-Sensitive Time-Decay Attention for Dialogue Modeling
Shang-Yu Su, Pei-Chieh Yuan, Yun-Nung Chen
|
👻
Ghosted
|
cs.CL
|
7 |
7 years ago |
| 1195 |
Sparse Gaussian Process Audio Source Separation Using Spectrum Priors in the Time-Domain
Pablo A. Alvarado, Mauricio A. Álvarez, Dan Stowell
|
👻
Ghosted
|
eess.AS
|
7 |
7 years ago |
| 1196 |
A simple bound on the BER of the MAP decoder for massive MIMO systems
Christos Thrampoulidis, Ilias Zadik, Yury Polyanskiy
|
👻
Ghosted
|
cs.IT
|
7 |
7 years ago |
| 1197 |
Contextual Out-of-Domain Utterance Handling With Counterfeit Data Augmentation
Sungjin Lee, Igor Shalyminov
|
👻
Ghosted
|
cs.CL
|
7 |
7 years ago |
| 1198 |
Teach an all-rounder with experts in different domains
Zhao You, Dan Su, Dong Yu
|
👻
Ghosted
|
eess.AS
|
7 |
6 years ago |
| 1199 |
On Modeling ASR Word Confidence
Woojay Jeon, Maxwell Jordan, Mahesh Krishnamoorthy
|
👻
Ghosted
|
cs.CL
|
7 |
6 years ago |
| 1200 |
HumanGAN: generative adversarial network with human-based discriminator and its evaluation in speech perception modeling
Kazuki Fujii, Yuki Saito, ... (+3 more)
|
👻
Ghosted
|
cs.SD
|
7 |
6 years ago |