⚰️ Sound

R.I.P. 👻 Ghosted

Content Adaptive Front End For Audio Classification

Prateek Verma, Chris Chafe

cs.SD 🏛 arXiv 📚 0 cites 3 years ago

R.I.P. 👻 Ghosted

Time-frequency Network for Robust Speaker Recognition

Jiguo Li, Tianzi Zhang, ... (+2 more)

cs.SD 🏛 arXiv 📚 0 cites 3 years ago

R.I.P. 👻 Ghosted

Automated Arrangements of Multi-Part Music for Sets of Monophonic Instruments

Matthew Mccloskey, Gabrielle Curcio, ... (+3 more)

cs.SD 🏛 Computer Music Modeling and Retrieval 📚 0 cites 3 years ago

R.I.P. 👻 Ghosted

Audio Latent Space Cartography

Nicolas Jonason, Bob L. T. Sturm

cs.SD 🏛 arXiv 📚 0 cites 3 years ago

R.I.P. 👻 Ghosted

SIMD-size aware weight regularization for fast neural vocoding on CPU

Hiroki Kanagawa, Yusuke Ijima

cs.SD 🏛 SLT 📚 0 cites 3 years ago

R.I.P. 👻 Ghosted

Self-Supervised Hierarchical Metrical Structure Modeling

Junyan Jiang, Gus Xia

cs.SD 🏛 arXiv 📚 0 cites 3 years ago

R.I.P. 👻 Ghosted

Speech MOS multi-task learning and rater bias correction

Haleh Akrami, Hannes Gamper

cs.SD 🏛 ICASSP 📚 0 cites 3 years ago

R.I.P. 👻 Ghosted

Adaptive Speech Quality Aware Complex Neural Network for Acoustic Echo Cancellation with Supervised Contrastive Learning

Bozhong Liu, Xiaoxi Yu, Hantao Huang

cs.SD 🏛 arXiv 📚 0 cites 3 years ago

R.I.P. 👻 Ghosted

Relating Human Perception of Musicality to Prediction in a Predictive Coding Model

Nikolas McNeal, Jennifer Huang, ... (+5 more)

cs.SD 🏛 arXiv 📚 0 cites 3 years ago

R.I.P. 👻 Ghosted

Spectrograms Are Sequences of Patches

Leyi Zhao, Yi Li

cs.SD 🏛 arXiv 📚 0 cites 3 years ago

R.I.P. 👻 Ghosted

V-Cloak: Intelligibility-, Naturalness- & Timbre-Preserving Real-Time Voice Anonymization

Jiangyi Deng, Fei Teng, ... (+4 more)

cs.SD 🏛 arXiv 📚 0 cites 3 years ago

R.I.P. 👻 Ghosted

Adaptive re-calibration of channel-wise features for Adversarial Audio Classification

Vardhan Dongre, Abhinav Thimma Reddy, Nikhitha Reddeddy

cs.SD 🏛 arXiv 📚 0 cites 3 years ago

R.I.P. 👻 Ghosted

Speech Enhancement with Perceptually-motivated Optimization and Dual Transformations

Xucheng Wan, Kai Liu, ... (+2 more)

cs.SD 🏛 APSIPA 📚 0 cites 3 years ago

R.I.P. 👻 Ghosted

Dynamic Time-Alignment of Dimensional Annotations of Emotion using Recurrent Neural Networks

Sina Alisamir, Fabien Ringeval, Francois Portet

cs.SD 🏛 arXiv 📚 0 cites 3 years ago

R.I.P. 👻 Ghosted

SSCFormer: Push the Limit of Chunk-wise Conformer for Streaming ASR Using Sequentially Sampled Chunks and Chunked Causal Convolution

Fangyuan Wang, Bo Xu, Bo Xu

cs.SD 🏛 IEEE SPL 📚 0 cites 3 years ago

R.I.P. 👻 Ghosted

Dynamic Kernels and Channel Attention for Low Resource Speaker Verification

Anna Ollerenshaw, Md Asif Jalal, Thomas Hain

cs.SD 🏛 arXiv 📚 0 cites 3 years ago

R.I.P. 👻 Ghosted

Towards zero-shot Text-based voice editing using acoustic context conditioning, utterance embeddings, and reference encoders

Jason Fong, Yun Wang, ... (+5 more)

cs.SD 🏛 arXiv 📚 0 cites 3 years ago

R.I.P. 👻 Ghosted

Pronunciation Generation for Foreign Language Words in Intra-Sentential Code-Switching Speech Recognition

Wei Wang, Chao Zhang, Xiaopei Wu

cs.SD 🏛 arXiv 📚 0 cites 3 years ago

R.I.P. 👻 Ghosted

Speaker- and Age-Invariant Training for Child Acoustic Modeling Using Adversarial Multi-Task Learning

Mostafa Shahin, Beena Ahmed, Julien Epps

cs.SD 🏛 arXiv 📚 0 cites 3 years ago

R.I.P. 👻 Ghosted

Read it to me: An emotionally aware Speech Narration Application

Rishibha Bansal

cs.SD 🏛 arXiv 📚 0 cites 3 years ago

R.I.P. 👻 Ghosted

Bridging Music and Text with Crowdsourced Music Comments: A Sequence-to-Sequence Framework for Thematic Music Comments Generation

Peining Zhang, Junliang Guo, ... (+3 more)

cs.SD 🏛 arXiv 📚 0 cites 3 years ago

R.I.P. 👻 Ghosted

Chronological Self-Training for Real-Time Speaker Diarization

Dirk Padfield, Daniel J. Liebling

cs.SD 🏛 Interspeech 📚 0 cites 3 years ago

R.I.P. 👻 Ghosted

WOLONet: Wave Outlooker for Efficient and High Fidelity Speech Synthesis

Yi Wang, Yi Si

cs.SD 🏛 arXiv 📚 0 cites 4 years ago

R.I.P. 👻 Ghosted

SA: Sliding attack for synthetic speech detection with resistance to clipping and self-splicing

Deng JiaCheng, Dong Li, ... (+3 more)

cs.SD 🏛 arXiv 📚 0 cites 3 years ago

🏛️ The Sound Crypt