⚰️ Sound

R.I.P. 💀 404 Not Found

Two-Stream Joint-Training for Speaker Independent Acoustic-to-Articulatory Inversion

Jianrong Wang, Jinyu Liu, ... (+5 more)

cs.SD 🏛 ICASSP 📚 8 cites 3 years ago

R.I.P. 👻 Ghosted

A large-scale and PCR-referenced vocal audio dataset for COVID-19

Jobie Budd, Kieran Baker, ... (+24 more)

cs.SD 🏛 Scientific Data 📚 8 cites 3 years ago

R.I.P. 👻 Ghosted

Impact of annotation modality on label quality and model performance in the automatic assessment of laughter in-the-wild

Jose Vargas-Quiros, Laura Cabrera-Quiros, ... (+2 more)

cs.SD 🏛 IEEE TAC 📚 8 cites 3 years ago

R.I.P. 👻 Ghosted

Style-Label-Free: Cross-Speaker Style Transfer by Quantized VAE and Speaker-wise Normalization in Speech Synthesis

Chunyu Qiang, Peng Yang, ... (+3 more)

cs.SD 🏛 ICCSLP 📚 8 cites 3 years ago

R.I.P. 👻 Ghosted

Explicit Intensity Control for Accented Text-to-speech

Rui Liu, Haolin Zuo, ... (+3 more)

cs.SD 🏛 Interspeech 📚 8 cites 3 years ago

R.I.P. 👻 Ghosted

Knowledge Transfer For On-Device Speech Emotion Recognition with Neural Structured Learning

Yi Chang, Zhao Ren, ... (+3 more)

cs.SD 🏛 ICASSP 📚 8 cites 3 years ago

R.I.P. 👻 Ghosted

Learning Phone Recognition from Unpaired Audio and Phone Sequences Based on Generative Adversarial Network

Da-rong Liu, Po-chun Hsu, ... (+5 more)

cs.SD 🏛 IEEE/ACM TASLP 📚 8 cites 3 years ago

R.I.P. 👻 Ghosted

Synthesizing Personalized Non-speech Vocalization from Discrete Speech Representations

Chin-Cheng Hsu

cs.SD 🏛 arXiv 📚 8 cites 3 years ago

R.I.P. 👻 Ghosted

Acoustic Modeling for End-to-End Empathetic Dialogue Speech Synthesis Using Linguistic and Prosodic Contexts of Dialogue History

Yuto Nishimura, Yuki Saito, ... (+3 more)

cs.SD 🏛 Interspeech 📚 8 cites 4 years ago

R.I.P. 👻 Ghosted

From Environmental Sound Representation to Robustness of 2D CNN Models Against Adversarial Attacks

Mohammad Esmaeilpour, Patrick Cardinal, Alessandro Lameiras Koerich

cs.SD 🏛 Applied Acoustics 📚 8 cites 4 years ago

📚 📚 The Cartographer

Adversarial Attacks on Speech Recognition Systems for Mission-Critical Applications: A Survey

Ngoc Dung Huynh, Mohamed Reda Bouadjenek, ... (+5 more)

cs.SD 🏛 arXiv 📚 8 cites 4 years ago

R.I.P. 👻 Ghosted

GMM based multi-stage Wiener filtering for low SNR speech enhancement

Wageesha Manamperi, Prasanga N. Samarasinghe, ... (+2 more)

cs.SD 🏛 ICASE W 📚 8 cites 3 years ago

R.I.P. 👻 Ghosted

Musical Audio Similarity with Self-supervised Convolutional Neural Networks

Carl Thomé, Sebastian Piwell, Oscar Utterbäck

cs.SD 🏛 arXiv 📚 8 cites 4 years ago

R.I.P. 👻 Ghosted

The GigaMIDI Dataset with Features for Expressive Music Performance Detection

Keon Ju Maverick Lee, Jeff Ens, ... (+4 more)

cs.SD 🏛 Transactions of the International Society for Music Information Retrieval 📚 7 cites 1 year ago

R.I.P. 👻 Ghosted

Music Discovery Dialogue Generation Using Human Intent Analysis and Large Language Models

SeungHeon Doh, Keunwoo Choi, ... (+3 more)

cs.SD 🏛 ISMIR 📚 7 cites 1 year ago

R.I.P. 👻 Ghosted

DiffATR: Diffusion-based Generative Modeling for Audio-Text Retrieval

Yifei Xin, Xuxin Cheng, ... (+3 more)

cs.SD 🏛 Interspeech 📚 7 cites 1 year ago

R.I.P. 👻 Ghosted

MIDI-to-Tab: Guitar Tablature Inference via Masked Language Modeling

Drew Edwards, Xavier Riley, ... (+2 more)

cs.SD 🏛 ISMIR 📚 7 cites 1 year ago

R.I.P. 👻 Ghosted

Active Bird2Vec: Towards End-to-End Bird Sound Monitoring with Transformers

Lukas Rauch, Raphael Schwinger, ... (+4 more)

cs.SD 🏛 arXiv 📚 7 cites 2 years ago

R.I.P. 👻 Ghosted

EnchantDance: Unveiling the Potential of Music-Driven Dance Movement

Bo Han, Teng Zhang, ... (+4 more)

cs.SD 🏛 arXiv 📚 7 cites 2 years ago

R.I.P. 👻 Ghosted

Low-latency Speech Enhancement via Speech Token Generation

Huaying Xue, Xiulian Peng, Yan Lu

cs.SD 🏛 ICASSP 📚 7 cites 2 years ago

🌅 💤 Eternal Rest

Bridging High-Quality Audio and Video via Language for Sound Effects Retrieval from Visual Queries

Julia Wilkins, Justin Salamon, ... (+3 more)

cs.SD 🏛 ICASPAA W 📚 7 cites 2 years ago

R.I.P. 👻 Ghosted

MSAC: Multiple Speech Attribute Control Method for Reliable Speech Emotion Recognition

Yu Pan, Yuguang Yang, ... (+7 more)

cs.SD 🏛 arXiv 📚 7 cites 2 years ago

R.I.P. 👻 Ghosted

Transavs: End-To-End Audio-Visual Segmentation With Transformer

Yuhang Ling, Yuxi Li, ... (+4 more)

cs.SD 🏛 ICASSP 📚 7 cites 3 years ago

R.I.P. 👻 Ghosted

Multi-stream Convolutional Neural Network with Frequency Selection for Robust Speaker Verification

Wei Yao, Shen Chen, ... (+2 more)

cs.SD 🏛 Computing and informatics 📚 7 cites 5 years ago

🏛️ The Sound Crypt