⚰️ Sound

R.I.P. 👻 Ghosted

SonicRadiation: A Hybrid Numerical Solution for Sound Radiation without Ghost Cells

Xutong Jin, Guoping Wang, Sheng Li

cs.SD 🏛 arXiv 📚 0 cites 10 months ago

R.I.P. 👻 Ghosted

NAT: Neural Acoustic Transfer for Interactive Scenes in Real Time

Xutong Jin, Bo Pang, ... (+4 more)

cs.SD 🏛 IEEE TVCG 📚 0 cites 1 year ago

R.I.P. 👻 Ghosted

AsynFusion: Towards Asynchronous Latent Consistency Models for Decoupled Whole-Body Audio-Driven Avatars

Tianbao Zhang, Jian Zhao, ... (+6 more)

cs.SD 🏛 Pattern Recog. 📚 0 cites 1 year ago

R.I.P. 👻 Ghosted

LAV: Audio-Driven Dynamic Visual Generation with Neural Compression and StyleGAN2

Jongmin Jung, Dasaem Jeong

cs.SD 🏛 ISEA 📚 0 cites 1 year ago

R.I.P. 👻 Ghosted

Artificial intelligence in creating, representing or expressing an immersive soundscape

Rima Ayoubi, Laurent Lescop, Sang Bum Park

cs.SD 🏛 arXiv 📚 0 cites 1 year ago

R.I.P. 👻 Ghosted

Text-Driven Voice Conversion via Latent State-Space Modeling

Wen Li, Sofia Martinez, Priyanka Shah

cs.SD 🏛 arXiv 📚 0 cites 1 year ago

R.I.P. 👻 Ghosted

Seeing Sound: Assembling Sounds from Visuals for Audio-to-Image Generation

Darius Petermann, Mahdi M. Kalayeh

cs.SD 🏛 arXiv 📚 0 cites 1 year ago

R.I.P. 👻 Ghosted

Towards Practical Real-Time Low-Latency Music Source Separation

Junyu Wu, Jie Liu, ... (+3 more)

cs.SD 🏛 ICME 📚 0 cites 7 months ago

R.I.P. 👻 Ghosted

MusRec: Zero-Shot Text-to-Music Editing via Rectified Flow and Diffusion Transformers

Ali Boudaghi, Hadi Zare

cs.SD 🏛 arXiv 📚 0 cites 7 months ago

R.I.P. 👻 Ghosted

Audio-Visual Speech Enhancement In Complex Scenarios With Separation And Dereverberation Joint Modeling

Jiarong Du, Zhan Jin, ... (+5 more)

cs.SD 🏛 arXiv 📚 0 cites 7 months ago

R.I.P. 📜 Death by README

Model-Guided Dual-Role Alignment for High-Fidelity Open-Domain Video-to-Audio Generation

Kang Zhang, Trung X. Pham, ... (+4 more)

cs.SD 🏛 arXiv 📚 0 cites 7 months ago

R.I.P. 👻 Ghosted

Noise-Conditioned Mixture-of-Experts Framework for Robust Speaker Verification

Bin Gu, Haitao Zhao, Jibo Wei

cs.SD 🏛 arXiv 📚 0 cites 8 months ago

R.I.P. 👻 Ghosted

AWARE: Audio Watermarking with Adversarial Resistance to Edits

Kosta Pavlović, Lazar Stanarević, ... (+3 more)

cs.SD 🏛 arXiv 📚 0 cites 8 months ago

R.I.P. 👻 Ghosted

MotionBeat: Motion-Aligned Music Representation via Embodied Contrastive Learning and Bar-Equivariant Contact-Aware Encoding

Xuanchen Wang, Heng Wang, Weidong Cai

cs.SD 🏛 arXiv 📚 0 cites 8 months ago

R.I.P. 👻 Ghosted

Audio-Guided Visual Perception for Audio-Visual Navigation

Yi Wang, Yinfeng Yu, ... (+3 more)

cs.SD 🏛 arXiv 📚 0 cites 8 months ago

R.I.P. 👻 Ghosted

SeeingSounds: Learning Audio-to-Visual Alignment via Text

Simone Carnemolla, Matteo Pennisi, ... (+4 more)

cs.SD 🏛 ACM MM 📚 0 cites 8 months ago

R.I.P. 👻 Ghosted

Personality-Enhanced Multimodal Depression Detection in the Elderly

Honghong Wang, Jing Deng, Rong Zheng

cs.SD 🏛 ACM MM 📚 0 cites 8 months ago

R.I.P. 👻 Ghosted

AudioRole: An Audio Dataset for Character Role-Playing in Large Language Models

Wenyu Li, Xiaoqi Jiao, ... (+3 more)

cs.SD 🏛 arXiv 📚 0 cites 8 months ago

R.I.P. 👻 Ghosted

MusicWeaver: Composer-Style Structural Editing and Minute-Scale Coherent Music Generation

Xuanchen Wang, Heng Wang, Weidong Cai

cs.SD 🏛 arXiv 📚 0 cites 8 months ago

R.I.P. 👻 Ghosted

Efficient Speech Watermarking for Speech Synthesis via Progressive Knowledge Distillation

Yang Cui, Peter Pan, ... (+2 more)

cs.SD 🏛 arXiv 📚 0 cites 9 months ago

R.I.P. 👻 Ghosted

Speech-to-See: End-to-End Speech-Driven Open-Set Object Detection

Wenhuan Lu, Xinyue Song, ... (+4 more)

cs.SD 🏛 arXiv 📚 0 cites 9 months ago

R.I.P. 👻 Ghosted

On the de-duplication of the Lakh MIDI dataset

Eunjin Choi, Hyerin Kim, ... (+3 more)

cs.SD 🏛 ISMIR 📚 0 cites 9 months ago

R.I.P. 👻 Ghosted

Beyond Video-to-SFX: Video to Audio Synthesis with Environmentally Aware Speech

Xinlei Niu, Jianbo Ma, ... (+4 more)

cs.SD 🏛 arXiv 📚 0 cites 9 months ago

R.I.P. 👻 Ghosted

Two Web Toolkits for Multimodal Piano Performance Dataset Acquisition and Fingering Annotation

Junhyung Park, Yonghyun Kim, ... (+5 more)

cs.SD 🏛 arXiv 📚 0 cites 9 months ago

🏛️ The Sound Crypt