⚰️ Sound

R.I.P. 👻 Ghosted

Efficient Video to Audio Mapper with Visual Scene Detection

Mingjing Yi, Ming Li

cs.SD 🏛 APSIPA 📚 6 cites 1 year ago

R.I.P. 👻 Ghosted

Rhythmic Foley: A Framework For Seamless Audio-Visual Alignment In Video-to-Audio Synthesis

Zhiqi Huang, Dan Luo, ... (+4 more)

cs.SD 🏛 ICASSP 📚 6 cites 1 year ago

R.I.P. 👻 Ghosted

FoVNet: Configurable Field-of-View Speech Enhancement with Low Computation and Distortion for Smart Glasses

Zhongweiyang Xu, Ali Aroudi, ... (+5 more)

cs.SD 🏛 Interspeech 📚 6 cites 1 year ago

R.I.P. 💀 404 Not Found

SaMoye: Zero-shot Singing Voice Conversion Model Based on Feature Disentanglement and Enhancement

Zihao Wang, Le Ma, ... (+4 more)

cs.SD 🏛 arXiv 📚 6 cites 1 year ago

R.I.P. 👻 Ghosted

HybridVC: Efficient Voice Style Conversion with Text and Audio Prompts

Xinlei Niu, Jing Zhang, Charles Patrick Martin

cs.SD 🏛 Interspeech 📚 6 cites 2 years ago

R.I.P. 👻 Ghosted

Anchor-aware Deep Metric Learning for Audio-visual Retrieval

Donghuo Zeng, Yanan Wang, ... (+2 more)

cs.SD 🏛 ICMR 📚 6 cites 2 years ago

R.I.P. 👻 Ghosted

SongBsAb: A Dual Prevention Approach against Singing Voice Conversion based Illegal Song Covers

Guangke Chen, Yedi Zhang, ... (+4 more)

cs.SD 🏛 NDSS 📚 6 cites 2 years ago

📚 📚 The Cartographer

A review-based study on different Text-to-Speech technologies

Md. Jalal Uddin Chowdhury, Ashab Hussan

cs.SD 🏛 arXiv 📚 6 cites 2 years ago

🌅 💤 Eternal Rest

D4AM: A General Denoising Framework for Downstream Acoustic Models

Chi-Chang Lee, Yu Tsao, ... (+2 more)

cs.SD 🏛 ICLR 📚 6 cites 2 years ago

R.I.P. 👻 Ghosted

Seq2seq for Automatic Paraphasia Detection in Aphasic Speech

Matthew Perez, Duc Le, ... (+4 more)

cs.SD 🏛 arXiv 📚 6 cites 2 years ago

R.I.P. 👻 Ghosted

Audio-visual fine-tuning of audio-only ASR models

Avner May, Dmitriy Serdyuk, ... (+3 more)

cs.SD 🏛 arXiv 📚 6 cites 2 years ago

R.I.P. 👻 Ghosted

On The Open Prompt Challenge In Conditional Audio Generation

Ernie Chang, Sidd Srinivasan, ... (+9 more)

cs.SD 🏛 ICASSP 📚 6 cites 2 years ago

R.I.P. 👻 Ghosted

High-Fidelity Speech Synthesis with Minimal Supervision: All Using Diffusion Models

Chunyu Qiang, Hao Li, ... (+5 more)

cs.SD 🏛 ICASSP 📚 6 cites 2 years ago

R.I.P. 👻 Ghosted

Representation Learning for Audio Privacy Preservation using Source Separation and Robust Adversarial Learning

Diep Luong, Minh Tran, ... (+3 more)

cs.SD 🏛 ICASPAA W 📚 6 cites 2 years ago

R.I.P. 👻 Ghosted

ASTER: Automatic Speech Recognition System Accessibility Testing for Stutterers

Yi Liu, Yuekang Li, ... (+8 more)

cs.SD 🏛 ASE 📚 6 cites 2 years ago

R.I.P. 👻 Ghosted

Robust and lightweight audio fingerprint for Automatic Content Recognition

Anoubhav Agarwaal, Prabhat Kanaujia, ... (+2 more)

cs.SD 🏛 arXiv 📚 6 cites 3 years ago

R.I.P. 👻 Ghosted

Heterogeneous Graph Learning for Acoustic Event Classification

Amir Shirian, Mona Ahmadian, ... (+2 more)

cs.SD 🏛 ICASSP 📚 6 cites 3 years ago

R.I.P. 👻 Ghosted

TimbreCLIP: Connecting Timbre to Text and Images

Nicolas Jonason, Bob L. T. Sturm

cs.SD 🏛 arXiv 📚 6 cites 3 years ago

🌅 💤 Eternal Rest

Vis2Mus: Exploring Multimodal Representation Mapping for Controllable Music Generation

Runbang Zhang, Yixiao Zhang, ... (+3 more)

cs.SD 🏛 arXiv 📚 6 cites 3 years ago

R.I.P. 👻 Ghosted

Fast and efficient speech enhancement with variational autoencoders

Mostafa Sadeghi, Romain Serizel

cs.SD 🏛 ICASSP 📚 6 cites 3 years ago

R.I.P. 👻 Ghosted

Back-Translation-Style Data Augmentation for Mandarin Chinese Polyphone Disambiguation

Chunyu Qiang, Peng Yang, ... (+4 more)

cs.SD 🏛 APSIPA 📚 6 cites 3 years ago

R.I.P. 👻 Ghosted

Large-scale learning of generalised representations for speaker recognition

Jee-weon Jung, Hee-Soo Heo, ... (+6 more)

cs.SD 🏛 arXiv 📚 6 cites 3 years ago

R.I.P. 💀 404 Not Found

The Efficacy of Self-Supervised Speech Models for Audio Representations

Tung-Yu Wu, Chen-An Li, ... (+3 more)

cs.SD 🏛 NeurIPS 📚 6 cites 3 years ago

R.I.P. 👻 Ghosted

Faked Speech Detection with Zero Prior Knowledge

Sahar Al Ajmi, Khizar Hayat, ... (+4 more)

cs.SD 🏛 Discover Applied Sciences 📚 6 cites 3 years ago

🏛️ The Sound Crypt