⚰️ Sound

R.I.P. 👻 Ghosted

Early Detection of Furniture-Infesting Wood-Boring Beetles Using CNN-LSTM Networks and MFCC-Based Acoustic Features

J. M. Chan Sri Manukalpa, H. S. Bopage, ... (+2 more)

cs.SD 🏛 arXiv 📚 1 cites 11 months ago

R.I.P. 👻 Ghosted

Reimagining Dance: Real-time Music Co-creation between Dancers and AI

Olga Vechtomova, Jeff Bos

cs.SD 🏛 arXiv 📚 1 cites 1 year ago

R.I.P. 👻 Ghosted

Insights on Harmonic Tones from a Generative Music Experiment

Emmanuel Deruty, Maarten Grachten

cs.SD 🏛 arXiv 📚 1 cites 1 year ago

R.I.P. 👻 Ghosted

UltrasonicSpheres: Localized, Multi-Channel Sound Spheres Using Off-the-Shelf Speakers and Earables

Michael Küttner, Valeria Zitz, ... (+3 more)

cs.SD 🏛 UbiComp Companion 📚 1 cites 1 year ago

R.I.P. 👻 Ghosted

FeatureSense: Protecting Speaker Attributes in Always-On Audio Sensing System

Bhawana Chhaglani, Sarmistha Sarna Gomasta, ... (+3 more)

cs.SD 🏛 arXiv 📚 1 cites 1 year ago

R.I.P. 👻 Ghosted

Do Captioning Metrics Reflect Music Semantic Alignment?

Jinwoo Lee, Kyogu Lee

cs.SD 🏛 arXiv 📚 1 cites 1 year ago

R.I.P. 👻 Ghosted

Towards Robust Transcription: Exploring Noise Injection Strategies for Training Data Augmentation

Yonghyun Kim, Alexander Lerch

cs.SD 🏛 arXiv 📚 1 cites 1 year ago

R.I.P. 👻 Ghosted

EAViT: External Attention Vision Transformer for Audio Classification

Aquib Iqbal, Abid Hasan Zim, ... (+4 more)

cs.SD 🏛 APSIPA 📚 1 cites 1 year ago

🌅 💤 Eternal Rest

Towards Training Music Taggers on Synthetic Data

Nadine Kroher, Steven Manangu, Aggelos Pikrakis

cs.SD 🏛 ICCMI 📚 1 cites 1 year ago

R.I.P. 👻 Ghosted

Lla-VAP: LSTM Ensemble of Llama and VAP for Turn-Taking Prediction

Hyunbae Jeon, Frederic Guintu, Rayvant Sahni

cs.SD 🏛 arXiv 📚 1 cites 1 year ago

R.I.P. 👻 Ghosted

Composers' Evaluations of an AI Music Tool: Insights for Human-Centred Design

Eleanor Row, György Fazekas

cs.SD 🏛 arXiv 📚 1 cites 1 year ago

R.I.P. 👻 Ghosted

AI TrackMate: Finally, Someone Who Will Give Your Music More Than Just "Sounds Great!"

Yi-Lin Jiang, Chia-Ho Hsiung, ... (+3 more)

cs.SD 🏛 arXiv 📚 1 cites 1 year ago

R.I.P. 👻 Ghosted

Embodied Exploration of Latent Spaces and Explainable AI

Elizabeth Wilson, Mika Satomi, ... (+3 more)

cs.SD 🏛 arXiv 📚 1 cites 1 year ago

R.I.P. 👻 Ghosted

M6(GPT)3: Generating Multitrack Modifiable Multi-Minute MIDI Music from Text using Genetic algorithms, Probabilistic methods and GPT Models in any Progression and Time Signature

Jakub Poćwiardowski, Mateusz Modrzejewski, Marek S. Tatara

cs.SD 🏛 ICME W 📚 1 cites 1 year ago

R.I.P. 👻 Ghosted

Optimizing Dysarthria Wake-Up Word Spotting: An End-to-End Approach for SLT 2024 LRDWWS Challenge

Shuiyun Liu, Yuxiang Kong, ... (+5 more)

cs.SD 🏛 SLT 📚 1 cites 1 year ago

R.I.P. 👻 Ghosted

MetaBGM: Dynamic Soundtrack Transformation For Continuous Multi-Scene Experiences With Ambient Awareness And Personalization

Haoxuan Liu, Zihao Wang, ... (+6 more)

cs.SD 🏛 arXiv 📚 1 cites 1 year ago

R.I.P. 👻 Ghosted

CoopASD: Cooperative Machine Anomalous Sound Detection with Privacy Concerns

Anbai Jiang, Yuchen Shi, ... (+3 more)

cs.SD 🏛 GLOBECOM 📚 1 cites 1 year ago

R.I.P. 👻 Ghosted

Information and motor constraints shape melodic diversity across cultures

John M McBride, Nahie Kim, ... (+4 more)

cs.SD 🏛 arXiv 📚 1 cites 1 year ago

R.I.P. 👻 Ghosted

MIDGET: Music Conditioned 3D Dance Generation

Jinwu Wang, Wei Mao, Miaomiao Liu

cs.SD 🏛 Applied Informatics 📚 1 cites 2 years ago

R.I.P. 👻 Ghosted

SP-MCQA: Evaluating Intelligibility of TTS Beyond the Word Level

Hitomi Jin Ling Tee, Chaoren Wang, ... (+2 more)

cs.SD 🏛 arXiv 📚 1 cites 7 months ago

R.I.P. 👻 Ghosted

U-Codec: Ultra Low Frame-rate Neural Speech Codec for Fast High-fidelity Speech Generation

Xusheng Yang, Long Zhou, ... (+7 more)

cs.SD 🏛 arXiv 📚 1 cites 8 months ago

R.I.P. ⚰️ The Empty Tomb

Diffusion-Link: Diffusion Probabilistic Model for Bridging the Audio-Text Modality Gap

KiHyun Nam, Jongmin Choi, ... (+3 more)

cs.SD 🏛 arXiv 📚 1 cites 8 months ago

R.I.P. 👻 Ghosted

XLSR-Kanformer: A KAN-Intergrated model for Synthetic Speech Detection

Phuong Tuan Dat, Tran Huy Dat

cs.SD 🏛 Advanced Video and Signal Based Surveillance 📚 1 cites 8 months ago

R.I.P. 👻 Ghosted

Novel Loss-Enhanced Universal Adversarial Patches for Sustainable Speaker Privacy

Elvir Karimov, Alexander Varlamov, ... (+3 more)

cs.SD 🏛 Interspeech 📚 1 cites 1 year ago

🏛️ The Sound Crypt