| 501 |
Unsupervised vocal dereverberation with diffusion-based generative models
Koichi Saito, Naoki Murata, ... (+5 more)
|
👻
Ghosted
|
eess.AS
|
28 |
3 years ago |
| 502 |
Building Blocks for a Complex-Valued Transformer Architecture
Florian Eilers, Xiaoyi Jiang
|
👻
Ghosted
|
cs.LG
|
28 |
3 years ago |
| 503 |
VoxBlink: A Large Scale Speaker Verification Dataset on Camera
Yuke Lin, Xiaoyi Qin, ... (+5 more)
|
👻
Ghosted
|
eess.AS
|
28 |
2 years ago |
| 504 |
CommIN: Semantic Image Communications as an Inverse Problem with INN-Guided Diffusion Models
Jiakang Chen, Di You, ... (+2 more)
|
👻
Ghosted
|
eess.IV
|
28 |
2 years ago |
| 505 |
One-Shot Sensitivity-Aware Mixed Sparsity Pruning for Large Language Models
Hang Shao, Bei Liu, ... (+4 more)
|
👻
Ghosted
|
cs.CL
|
28 |
2 years ago |
| 506 |
Cross-Modality and Within-Modality Regularization for Audio-Visual DeepFake Detection
Heqing Zou, Meng Shen, ... (+4 more)
|
👻
Ghosted
|
cs.MM
|
28 |
2 years ago |
| 507 |
Partitioned Successive-Cancellation List Decoding of Polar Codes
Seyyed Ali Hashemi, Alexios Balatsoukas-Stimming, ... (+3 more)
|
👻
Ghosted
|
cs.AR
|
27 |
10 years ago |
| 508 |
Training Probabilistic Spiking Neural Networks with First-to-spike Decoding
Alireza Bagheri, Osvaldo Simeone, Bipin Rajendran
|
👻
Ghosted
|
stat.ML
|
27 |
8 years ago |
| 509 |
Sequence-to-Sequence ASR Optimization via Reinforcement Learning
Andros Tjandra, Sakriani Sakti, Satoshi Nakamura
|
👻
Ghosted
|
cs.CL
|
27 |
8 years ago |
| 510 |
MuSE-ing on the Impact of Utterance Ordering On Crowdsourced Emotion Annotations
Mimansa Jaiswal, Zakaria Aldeneh, ... (+5 more)
|
👻
Ghosted
|
cs.SD
|
27 |
7 years ago |
| 511 |
CT-CAPS: Feature Extraction-based Automated Framework for COVID-19 Disease Identification from Chest CT Scans using Capsule Networks
Shahin Heidarian, Parnian Afshar, ... (+5 more)
|
👻
Ghosted
|
eess.IV
|
27 |
5 years ago |
| 512 |
Structure-Aware Classification using Supervised Dictionary Learning
Yael Yankelevsky, Michael Elad
|
👻
Ghosted
|
cs.LG
|
26 |
9 years ago |
| 513 |
Performance Analysis for Time-of-Arrival Estimation with Oversampled Low-Complexity 1-bit A/D Conversion
Manuel S. Stein
|
👻
Ghosted
|
cs.IT
|
26 |
9 years ago |
| 514 |
Bi-Directional Lattice Recurrent Neural Networks for Confidence Estimation
Qiujia Li, Preben Ness, ... (+2 more)
|
👻
Ghosted
|
eess.AS
|
26 |
7 years ago |
| 515 |
Asynchronous Neighbor Discovery Using Coupled Compressive Sensing
Vamsi K. Amalladinne, Krishna R. Narayanan, ... (+2 more)
|
👻
Ghosted
|
eess.SP
|
26 |
7 years ago |
| 516 |
Gaussian-Constrained training for speaker verification
Lantian Li, Zhiyuan Tang, ... (+2 more)
|
👻
Ghosted
|
eess.AS
|
26 |
7 years ago |
| 517 |
Sequence-Level Knowledge Distillation for Model Compression of Attention-based Sequence-to-Sequence Speech Recognition
Raden Mu'az Mun'im, Nakamasa Inoue, Koichi Shinoda
|
👻
Ghosted
|
cs.CL
|
26 |
7 years ago |
| 518 |
Learning Affective Correspondence between Music and Image
Gaurav Verma, Eeshan Gunesh Dhekane, Tanaya Guha
|
👻
Ghosted
|
cs.MM
|
26 |
7 years ago |
| 519 |
Knowledge Distillation For Recurrent Neural Network Language Modeling With Trust Regularization
Yangyang Shi, Mei-Yuh Hwang, ... (+2 more)
|
👻
Ghosted
|
cs.CL
|
26 |
7 years ago |
| 520 |
Neural Percussive Synthesis Parameterised by High-Level Timbral Features
António Ramires, Pritish Chandna, ... (+3 more)
|
👻
Ghosted
|
eess.AS
|
26 |
6 years ago |
| 521 |
Orthros: Non-autoregressive End-to-end Speech Translation with Dual-decoder
Hirofumi Inaguma, Yosuke Higuchi, ... (+3 more)
|
👻
Ghosted
|
cs.CL
|
26 |
5 years ago |
| 522 |
Directional ASR: A New Paradigm for E2E Multi-Speaker Speech Recognition with Source Localization
Aswin Shanmugam Subramanian, Chao Weng, ... (+5 more)
|
👻
Ghosted
|
eess.AS
|
26 |
5 years ago |
| 523 |
BATT: Backdoor Attack with Transformation-based Triggers
Tong Xu, Yiming Li, ... (+2 more)
|
👻
Ghosted
|
cs.CR
|
26 |
3 years ago |
| 524 |
Spatially Selective Deep Non-linear Filters for Speaker Extraction
Kristina Tesch, Timo Gerkmann
|
👻
Ghosted
|
eess.AS
|
26 |
3 years ago |
| 525 |
Self-Supervised Learning for Speech Enhancement through Synthesis
Bryce Irvin, Marko Stamenovic, ... (+2 more)
|
👻
Ghosted
|
eess.AS
|
26 |
3 years ago |
| 526 |
A Dynamic Graph Interactive Framework with Label-Semantic Injection for Spoken Language Understanding
Zhihong Zhu, Weiyuan Xu, ... (+3 more)
|
👻
Ghosted
|
cs.CL
|
26 |
3 years ago |
| 527 |
The Multimodal Information based Speech Processing (MISP) 2022 Challenge: Audio-Visual Diarization and Recognition
Zhe Wang, Shilong Wu, ... (+13 more)
|
👻
Ghosted
|
cs.MM
|
26 |
3 years ago |
| 528 |
DiffVoice: Text-to-Speech with Latent Diffusion
Zhijun Liu, Yiwei Guo, Kai Yu
|
👻
Ghosted
|
eess.AS
|
26 |
3 years ago |
| 529 |
Boosting Large Language Model for Speech Synthesis: An Empirical Study
Hongkun Hao, Long Zhou, ... (+5 more)
|
👻
Ghosted
|
cs.CL
|
26 |
2 years ago |
| 530 |
MIMO Transmit Beampattern Matching Under Waveform Constraints
Ziping Zhao, Daniel P. Palomar
|
👻
Ghosted
|
math.OC
|
25 |
8 years ago |
| 531 |
STFT spectral loss for training a neural speech waveform model
Shinji Takaki, Toru Nakashika, ... (+2 more)
|
👻
Ghosted
|
eess.AS
|
25 |
7 years ago |
| 532 |
Matrix Completion With Variational Graph Autoencoders: Application in Hyperlocal Air Quality Inference
Tien Huu Do, Duc Minh Nguyen, ... (+6 more)
|
👻
Ghosted
|
cs.LG
|
25 |
7 years ago |
| 533 |
Multimodal Grounding for Sequence-to-Sequence Speech Recognition
Ozan Caglayan, Ramon Sanabria, ... (+3 more)
|
👻
Ghosted
|
cs.CL
|
25 |
7 years ago |
| 534 |
Detection of Backdoors in Trained Classifiers Without Access to the Training Set
Zhen Xiang, David J. Miller, George Kesidis
|
👻
Ghosted
|
cs.LG
|
25 |
6 years ago |
| 535 |
Kernel computations from large-scale random features obtained by Optical Processing Units
Ruben Ohana, Jonas Wacker, ... (+5 more)
|
👻
Ghosted
|
cs.ET
|
25 |
6 years ago |
| 536 |
Mixup-breakdown: a consistency training method for improving generalization of speech separation models
Max W. Y. Lam, Jun Wang, ... (+2 more)
|
👻
Ghosted
|
eess.AS
|
25 |
6 years ago |
| 537 |
Metric Learning with Background Noise Class for Few-shot Detection of Rare Sound Events
Kazuki Shimada, Yuichiro Koyama, Akira Inoue
|
👻
Ghosted
|
eess.AS
|
25 |
6 years ago |
| 538 |
Supervised online diarization with sample mean loss for multi-domain data
Enrico Fini, Alessio Brutti
|
👻
Ghosted
|
eess.AS
|
25 |
6 years ago |
| 539 |
Self-supervised Adversarial Training
Kejiang Chen, Hang Zhou, ... (+7 more)
|
👻
Ghosted
|
cs.LG
|
25 |
6 years ago |
| 540 |
Audio-attention discriminative language model for ASR rescoring
Ankur Gandhe, Ariya Rastrow
|
👻
Ghosted
|
eess.AS
|
25 |
6 years ago |
| 541 |
Learning to fool the speaker recognition
Jiguo Li, Xinfeng Zhang, ... (+5 more)
|
👻
Ghosted
|
eess.AS
|
25 |
6 years ago |
| 542 |
Two-Stage Adaptive Pooling with RT-qPCR for COVID-19 Screening
Anoosheh Heidarzadeh, Krishna R. Narayanan
|
👻
Ghosted
|
cs.IT
|
25 |
5 years ago |
| 543 |
An Efficient Algorithm for Device Detection and Channel Estimation in Asynchronous IoT Systems
Liang Liu, Ya-Feng Liu
|
👻
Ghosted
|
eess.SP
|
25 |
5 years ago |
| 544 |
Decentralized Deep Learning using Momentum-Accelerated Consensus
Aditya Balu, Zhanhong Jiang, ... (+4 more)
|
👻
Ghosted
|
cs.LG
|
25 |
5 years ago |
| 545 |
Unrolling of Deep Graph Total Variation for Image Denoising
Huy Vu, Gene Cheung, Yonina C. Eldar
|
👻
Ghosted
|
cs.CV
|
25 |
5 years ago |
| 546 |
Online Time-Varying Topology Identification via Prediction-Correction Algorithms
Alberto Natali, Mario Coutino, ... (+2 more)
|
👻
Ghosted
|
eess.SP
|
25 |
5 years ago |
| 547 |
Interpreting glottal flow dynamics for detecting COVID-19 from voice
Soham Deshmukh, Mahmoud Al Ismail, Rita Singh
|
👻
Ghosted
|
eess.AS
|
25 |
5 years ago |
| 548 |
DEAAN: Disentangled Embedding and Adversarial Adaptation Network for Robust Speaker Representation Learning
Mufan Sang, Wei Xia, John H. L. Hansen
|
👻
Ghosted
|
eess.AS
|
25 |
5 years ago |
| 549 |
Prosody-controllable spontaneous TTS with neural HMMs
Harm Lameris, Shivam Mehta, ... (+3 more)
|
👻
Ghosted
|
eess.AS
|
25 |
3 years ago |
| 550 |
A GMM-Based Stair Quality Model for Human Perceived JPEG Images
Sudeng Hu, Haiqiang Wang, C. -C. Jay Kuo
|
👻
Ghosted
|
cs.MM
|
24 |
10 years ago |