⚰️ Sound

R.I.P. 👻 Ghosted

Parrot-Trained Adversarial Examples: Pushing the Practicality of Black-Box Audio Attacks against Speaker Recognition Models

Rui Duan, Zhe Qu, ... (+3 more)

cs.SD 🏛 arXiv 📚 1 cites 2 years ago

R.I.P. 👻 Ghosted

Weak Alignment Supervision from Hybrid Model Improves End-to-end ASR

Jintao Jiang, Yingbo Gao, Zoltan Tuske

cs.SD 🏛 arXiv 📚 1 cites 2 years ago

R.I.P. 👻 Ghosted

EmoDiarize: Speaker Diarization and Emotion Identification from Speech Signals using Convolutional Neural Networks

Hanan Hamza, Fiza Gafoor, ... (+3 more)

cs.SD 🏛 arXiv 📚 1 cites 2 years ago

R.I.P. 👻 Ghosted

Psychoacoustic Challenges Of Speech Enhancement On VoIP Platforms

Joseph Konan, Shikhar Agnihotri, ... (+5 more)

cs.SD 🏛 Synthetic Data’s Transformative Role in Foundational Speech Models 📚 1 cites 2 years ago

R.I.P. 👻 Ghosted

Comparative Analysis of Transfer Learning in Deep Learning Text-to-Speech Models on a Few-Shot, Low-Resource, Customized Dataset

Ze Liu

cs.SD 🏛 arXiv 📚 1 cites 2 years ago

R.I.P. 👻 Ghosted

On the Relation between Internal Language Model and Sequence Discriminative Training for Neural Transducers

Zijian Yang, Wei Zhou, ... (+2 more)

cs.SD 🏛 ICASSP 📚 1 cites 2 years ago

R.I.P. 👻 Ghosted

Audio Contrastive-based Fine-tuning: Decoupling Representation Learning and Classification

Yang Wang, Qibin Liang, ... (+4 more)

cs.SD 🏛 arXiv 📚 1 cites 2 years ago

🌅 💤 Eternal Rest

Differentially Private Adapters for Parameter Efficient Acoustic Modeling

Chun-Wei Ho, Chao-Han Huck Yang, Sabato Marco Siniscalchi

cs.SD 🏛 Interspeech 📚 1 cites 3 years ago

R.I.P. 👻 Ghosted

Barwise Music Structure Analysis with the Correlation Block-Matching Segmentation Algorithm

Axel Marmoret, Jérémy E. Cohen, Frédéric Bimbot

cs.SD 🏛 Transactions of the International Society for Music Information Retrieval 📚 1 cites 2 years ago

R.I.P. 👻 Ghosted

Music Augmentation and Denoising For Peak-Based Audio Fingerprinting

Kamil Akesbi, Dorian Desblancs, Benjamin Martin

cs.SD 🏛 arXiv 📚 1 cites 2 years ago

R.I.P. 👻 Ghosted

A Long-Tail Friendly Representation Framework for Artist and Music Similarity

Haoran Xiang, Junyu Dai, ... (+2 more)

cs.SD 🏛 arXiv 📚 1 cites 2 years ago

R.I.P. 👻 Ghosted

Transfer Learning with Semi-Supervised Dataset Annotation for Birdcall Classification

Anthony Miyaguchi, Nathan Zhong, ... (+2 more)

cs.SD 🏛 CLEF 📚 1 cites 2 years ago

R.I.P. 👻 Ghosted

An Order-Complexity Model for Aesthetic Quality Assessment of Homophony Music Performance

Xin Jin, Wu Zhou, ... (+4 more)

cs.SD 🏛 ICME W 📚 1 cites 3 years ago

R.I.P. 💀 404 Not Found

Approach to Learning Generalized Audio Representation Through Batch Embedding Covariance Regularization and Constant-Q Transforms

Ankit Shah, Shuyi Chen, ... (+3 more)

cs.SD 🏛 arXiv 📚 1 cites 3 years ago

R.I.P. 👻 Ghosted

Model Extraction Attack against Self-supervised Speech Models

Tsu-Yuan Hsu, Chen-An Li, ... (+2 more)

cs.SD 🏛 arXiv 📚 1 cites 3 years ago

R.I.P. 👻 Ghosted

Efficient Incremental Text-to-Speech on GPUs

Muyang Du, Chuan Liu, ... (+2 more)

cs.SD 🏛 APSIPA 📚 1 cites 3 years ago

R.I.P. 👻 Ghosted

Conditional variational autoencoder to improve neural audio synthesis for polyphonic music sound

Seokjin Lee, Minhan Kim, ... (+4 more)

cs.SD 🏛 arXiv 📚 1 cites 3 years ago

R.I.P. 👻 Ghosted

Exploiting Device and Audio Data to Tag Music with User-Aware Listening Contexts

Karim M. Ibrahim, Elena V. Epure, ... (+2 more)

cs.SD 🏛 ISMIR 📚 1 cites 3 years ago

R.I.P. 👻 Ghosted

"Seeing Sound": Audio Classification with the Wigner-Wille Distribution and Convolutional Neural Networks

Antonios Marios Christonasis, Stef van Eijndhoven, Peter Duin

cs.SD 🏛 Intelligent Systems with Applications 📚 1 cites 3 years ago

R.I.P. 👻 Ghosted

Predicting phoneme-level prosody latents using AR and flow-based Prior Networks for expressive speech synthesis

Konstantinos Klapsas, Karolos Nikitaras, ... (+6 more)

cs.SD 🏛 arXiv 📚 1 cites 3 years ago

R.I.P. 👻 Ghosted

Learning utterance-level representations through token-level acoustic latents prediction for Expressive Speech Synthesis

Karolos Nikitaras, Konstantinos Klapsas, ... (+7 more)

cs.SD 🏛 arXiv 📚 1 cites 3 years ago

R.I.P. 👻 Ghosted

Learning Invariant Representation and Risk Minimized for Unsupervised Accent Domain Adaptation

Chendong Zhao, Jianzong Wang, ... (+3 more)

cs.SD 🏛 SLT 📚 1 cites 3 years ago

R.I.P. 👻 Ghosted

Individualized Conditioning and Negative Distances for Speaker Separation

Tao Sun, Nidal Abuhajar, ... (+6 more)

cs.SD 🏛 ICMLA 📚 1 cites 3 years ago

R.I.P. 👻 Ghosted

THUEE system description for NIST 2020 SRE CTS challenge

Yu Zheng, Jinghan Peng, ... (+8 more)

cs.SD 🏛 arXiv 📚 1 cites 3 years ago

🏛️ The Sound Crypt