| 1301 |
Fast and efficient speech enhancement with variational autoencoders
Mostafa Sadeghi, Romain Serizel
|
👻
Ghosted
|
cs.SD
|
6 |
3 years ago |
| 1302 |
A Spectral Analysis of Graph Neural Networks on Dense and Sparse Graphs
Luana Ruiz, Ningyuan Huang, Soledad Villar
|
👻
Ghosted
|
cs.SI
|
6 |
3 years ago |
| 1303 |
DepthFormer: Multimodal Positional Encodings and Cross-Input Attention for Transformer-Based Segmentation Networks
Francesco Barbato, Giulia Rizzoli, Pietro Zanuttigh
|
👻
Ghosted
|
cs.CV
|
6 |
3 years ago |
| 1304 |
An online algorithm for contrastive Principal Component Analysis
Siavash Golkar, David Lipshutz, ... (+2 more)
|
👻
Ghosted
|
stat.ML
|
6 |
3 years ago |
| 1305 |
E2E Segmentation in a Two-Pass Cascaded Encoder ASR Model
W. Ronny Huang, Shuo-Yiin Chang, ... (+8 more)
|
👻
Ghosted
|
cs.CL
|
6 |
3 years ago |
| 1306 |
Mixer: DNN Watermarking using Image Mixup
Kassem Kallas, Teddy Furon
|
👻
Ghosted
|
cs.CR
|
6 |
3 years ago |
| 1307 |
Interpretation of Neural Networks is Susceptible to Universal Adversarial Perturbations
Haniyeh Ehsani Oskouie, Farzan Farnia
|
👻
Ghosted
|
cs.CV
|
6 |
3 years ago |
| 1308 |
Fully complex-valued deep learning model for visual perception
Aniruddh Sikdar, Sumanth Udupa, Suresh Sundaram
|
👻
Ghosted
|
cs.CV
|
6 |
3 years ago |
| 1309 |
Investigating Sindy As a Tool For Causal Discovery In Time Series Signals
Andrew O'Brien, Rosina Weber, Edward Kim
|
👻
Ghosted
|
cs.LG
|
6 |
3 years ago |
| 1310 |
Sparse Mixture Once-for-all Adversarial Training for Efficient In-Situ Trade-Off Between Accuracy and Robustness of DNNs
Souvik Kundu, Sairam Sundaresan, ... (+4 more)
|
👻
Ghosted
|
cs.CV
|
6 |
3 years ago |
| 1311 |
Heterogeneous Graph Learning for Acoustic Event Classification
Amir Shirian, Mona Ahmadian, ... (+2 more)
|
👻
Ghosted
|
cs.SD
|
6 |
3 years ago |
| 1312 |
Reliable Beamforming at Terahertz Bands: Are Causal Representations the Way Forward?
Christo Kurisummoottil Thomas, Walid Saad
|
👻
Ghosted
|
cs.IT
|
6 |
3 years ago |
| 1313 |
Zero-shot personalized lip-to-speech synthesis with face image based voice control
Zheng-Yan Sheng, Yang Ai, Zhen-Hua Ling
|
👻
Ghosted
|
cs.MM
|
6 |
3 years ago |
| 1314 |
Breaking Speaker Recognition with PaddingBack
Zhe Ye, Diqun Yan, ... (+2 more)
|
👻
Ghosted
|
cs.CR
|
6 |
2 years ago |
| 1315 |
Parameter Efficient Audio Captioning With Faithful Guidance Using Audio-text Shared Latent Representation
Arvind Krishna Sridhar, Yinyi Guo, ... (+2 more)
|
👻
Ghosted
|
cs.CL
|
6 |
2 years ago |
| 1316 |
Semantic reconstruction of continuous language from MEG signals
Bo Wang, Xiran Xu, ... (+4 more)
|
👻
Ghosted
|
cs.HC
|
6 |
2 years ago |
| 1317 |
Improving Speech Recognition for African American English With Audio Classification
Shefali Garg, Zhouyuan Huo, ... (+12 more)
|
👻
Ghosted
|
eess.AS
|
6 |
2 years ago |
| 1318 |
Corpus Synthesis for Zero-shot ASR domain Adaptation using Large Language Models
Hsuan Su, Ting-Yao Hu, ... (+6 more)
|
👻
Ghosted
|
eess.AS
|
6 |
2 years ago |
| 1319 |
High-Fidelity Speech Synthesis with Minimal Supervision: All Using Diffusion Models
Chunyu Qiang, Hao Li, ... (+5 more)
|
👻
Ghosted
|
cs.SD
|
6 |
2 years ago |
| 1320 |
Forgetting Private Textual Sequences in Language Models via Leave-One-Out Ensemble
Zhe Liu, Ozlem Kalinli
|
👻
Ghosted
|
cs.CL
|
6 |
2 years ago |
| 1321 |
Correction Focused Language Model Training for Speech Recognition
Yingyi Ma, Zhe Liu, Ozlem Kalinli
|
👻
Ghosted
|
cs.CL
|
6 |
2 years ago |
| 1322 |
On The Open Prompt Challenge In Conditional Audio Generation
Ernie Chang, Sidd Srinivasan, ... (+9 more)
|
👻
Ghosted
|
cs.SD
|
6 |
2 years ago |
| 1323 |
Multiple Object Tracking based on Occlusion-Aware Embedding Consistency Learning
Yaoqi Hu, Axi Niu, ... (+4 more)
|
👻
Ghosted
|
cs.CV
|
6 |
2 years ago |
| 1324 |
Correlated Attention in Transformers for Multivariate Time Series
Quang Minh Nguyen, Lam M. Nguyen, Subhro Das
|
👻
Ghosted
|
cs.LG
|
6 |
2 years ago |
| 1325 |
A Parameterized Generative Adversarial Network Using Cyclic Projection for Explainable Medical Image Classification
Xiangyu Xiong, Yue Sun, ... (+7 more)
|
👻
Ghosted
|
cs.CV
|
6 |
2 years ago |
| 1326 |
Multi-Rate Variable-Length CSI Compression for FDD Massive MIMO
Bumsu Park, Heedong Do, Namyoon Lee
|
👻
Ghosted
|
cs.IT
|
6 |
2 years ago |
| 1327 |
Towards Controlled Table-to-Text Generation with Scientific Reasoning
Zhixin Guo, Jianping Zhou, ... (+7 more)
|
👻
Ghosted
|
cs.CL
|
6 |
2 years ago |
| 1328 |
Comparable Demonstrations are Important in In-Context Learning: A Novel Perspective on Demonstration Selection
Caoyun Fan, Jidong Tian, ... (+3 more)
|
👻
Ghosted
|
cs.CL
|
6 |
2 years ago |
| 1329 |
Can LLM find the green circle? Investigation and Human-guided tool manipulation for compositional generalization
Min Zhang, Jianfeng He, ... (+4 more)
|
👻
Ghosted
|
cs.CL
|
6 |
2 years ago |
| 1330 |
AEGIS-Net: Attention-guided Multi-Level Feature Aggregation for Indoor Place Recognition
Yuhang Ming, Jian Ma, ... (+4 more)
|
👻
Ghosted
|
cs.CV
|
6 |
2 years ago |
| 1331 |
Improving Cross-domain Few-shot Classification with Multilayer Perceptron
Shuanghao Bai, Wanqi Zhou, ... (+3 more)
|
👻
Ghosted
|
cs.CV
|
6 |
2 years ago |
| 1332 |
A Soft Contrastive Learning-based Prompt Model for Few-shot Sentiment Analysis
Jingyi Zhou, Jie Zhou, ... (+6 more)
|
👻
Ghosted
|
cs.CL
|
6 |
2 years ago |
| 1333 |
BLSTM-Based Confidence Estimation for End-to-End Speech Recognition
Atsunori Ogawa, Naohiro Tawara, ... (+2 more)
|
👻
Ghosted
|
eess.AS
|
6 |
2 years ago |
| 1334 |
Learning Audio Concepts from Counterfactual Natural Language
Ali Vosoughi, Luca Bondi, ... (+2 more)
|
👻
Ghosted
|
cs.MM
|
6 |
2 years ago |
| 1335 |
Enhancing Image-Text Matching with Adaptive Feature Aggregation
Zuhui Wang, Yunting Yin, I. V. Ramakrishnan
|
👻
Ghosted
|
cs.IR
|
6 |
2 years ago |
| 1336 |
Encoding Time and Energy Model for SVT-AV1 based on Video Complexity
Lena Eichermüller, Gaurang Chaudhari, ... (+5 more)
|
👻
Ghosted
|
eess.IV
|
6 |
2 years ago |
| 1337 |
Panoramic Image Inpainting With Gated Convolution And Contextual Reconstruction Loss
Li Yu, Yanjun Gao, ... (+2 more)
|
👻
Ghosted
|
eess.IV
|
6 |
2 years ago |
| 1338 |
Enhancing Expressiveness in Dance Generation via Integrating Frequency and Music Style Information
Qiaochu Huang, Xu He, ... (+7 more)
|
👻
Ghosted
|
cs.MM
|
6 |
2 years ago |
| 1339 |
High-Dimensional Confidence Regions in Sparse MRI
Frederik Hoppe, Felix Krahmer, ... (+3 more)
|
👻
Ghosted
|
eess.SP
|
6 |
1 year ago |
| 1340 |
Rhythmic Foley: A Framework For Seamless Audio-Visual Alignment In Video-to-Audio Synthesis
Zhiqi Huang, Dan Luo, ... (+4 more)
|
👻
Ghosted
|
cs.SD
|
6 |
1 year ago |
| 1341 |
GraFPrint: A GNN-Based Approach for Audio Identification
Aditya Bhattacharjee, Shubhr Singh, Emmanouil Benetos
|
👻
Ghosted
|
cs.SD
|
6 |
1 year ago |
| 1342 |
Get Large Language Models Ready to Speak: A Late-fusion Approach for Speech Generation
Maohao Shen, Shun Zhang, ... (+6 more)
|
👻
Ghosted
|
cs.CL
|
6 |
1 year ago |
| 1343 |
DiffSR: Learning Radar Reflectivity Synthesis via Diffusion Model from Satellite Observations
Xuming He, Zhiwang Zhou, ... (+5 more)
|
👻
Ghosted
|
eess.IV
|
6 |
1 year ago |
| 1344 |
M3-CVC: Controllable Video Compression with Multimodal Generative Models
Rui Wan, Qi Zheng, Yibo Fan
|
👻
Ghosted
|
eess.IV
|
6 |
1 year ago |
| 1345 |
EM-MIAs: Enhancing Membership Inference Attacks in Large Language Models through Ensemble Modeling
Zichen Song, Sitan Huang, Zhongfeng Kang
|
👻
Ghosted
|
cs.RO
|
6 |
1 year ago |
| 1346 |
Semantic Residual for Multimodal Unified Discrete Representation
Hai Huang, Shulei Wang, Yan Xia
|
👻
Ghosted
|
cs.CV
|
6 |
1 year ago |
| 1347 |
LipGen: Viseme-Guided Lip Video Generation for Enhancing Visual Speech Recognition
Bowen Hao, Dongliang Zhou, ... (+5 more)
|
👻
Ghosted
|
cs.CV
|
6 |
1 year ago |
| 1348 |
Fast keypoint detection in video sequences
Luca Baroffio, Matteo Cesana, ... (+2 more)
|
👻
Ghosted
|
cs.CV
|
5 |
11 years ago |
| 1349 |
Enhancing Automatically Discovered Multi-level Acoustic Patterns Considering Context Consistency With Applications in Spoken Term Detection
Cheng-Tao Chung, Wei-Ning Hsu, ... (+2 more)
|
👻
Ghosted
|
cs.CL
|
5 |
10 years ago |
| 1350 |
Learning Data Triage: Linear Decoding Works for Compressive MRI
Yen-Huan Li, Volkan Cevher
|
👻
Ghosted
|
cs.IT
|
5 |
10 years ago |