🏛️ The Sound Crypt
cs.SD: Where Sound papers rest without their code.
2574
Total Papers
1771
No Code
61
Twilight
742
Has Code
28.8%
Survival Rate
R.I.P.
👻
Ghosted
R.I.P.
👻
Ghosted
Reimagining Dance: Real-time Music Co-creation between Dancers and AI
R.I.P.
👻
Ghosted
Insights on Harmonic Tones from a Generative Music Experiment
R.I.P.
👻
Ghosted
UltrasonicSpheres: Localized, Multi-Channel Sound Spheres Using Off-the-Shelf Speakers and Earables
R.I.P.
👻
Ghosted
FeatureSense: Protecting Speaker Attributes in Always-On Audio Sensing System
R.I.P.
👻
Ghosted
Do Captioning Metrics Reflect Music Semantic Alignment?
R.I.P.
👻
Ghosted
Towards Robust Transcription: Exploring Noise Injection Strategies for Training Data Augmentation
R.I.P.
👻
Ghosted
EAViT: External Attention Vision Transformer for Audio Classification
🌅
💤
Eternal Rest
Towards Training Music Taggers on Synthetic Data
R.I.P.
👻
Ghosted
Lla-VAP: LSTM Ensemble of Llama and VAP for Turn-Taking Prediction
R.I.P.
👻
Ghosted
Composers' Evaluations of an AI Music Tool: Insights for Human-Centred Design
R.I.P.
👻
Ghosted
AI TrackMate: Finally, Someone Who Will Give Your Music More Than Just "Sounds Great!"
R.I.P.
👻
Ghosted
Embodied Exploration of Latent Spaces and Explainable AI
R.I.P.
👻
Ghosted
M6(GPT)3: Generating Multitrack Modifiable Multi-Minute MIDI Music from Text using Genetic algorithms, Probabilistic methods and GPT Models in any Progression and Time Signature
R.I.P.
👻
Ghosted
Optimizing Dysarthria Wake-Up Word Spotting: An End-to-End Approach for SLT 2024 LRDWWS Challenge
R.I.P.
👻
Ghosted
MetaBGM: Dynamic Soundtrack Transformation For Continuous Multi-Scene Experiences With Ambient Awareness And Personalization
R.I.P.
👻
Ghosted
CoopASD: Cooperative Machine Anomalous Sound Detection with Privacy Concerns
R.I.P.
👻
Ghosted
Information and motor constraints shape melodic diversity across cultures
R.I.P.
👻
Ghosted
MIDGET: Music Conditioned 3D Dance Generation
R.I.P.
👻
Ghosted
SP-MCQA: Evaluating Intelligibility of TTS Beyond the Word Level
R.I.P.
👻
Ghosted
U-Codec: Ultra Low Frame-rate Neural Speech Codec for Fast High-fidelity Speech Generation
R.I.P.
⚰️
The Empty Tomb
Diffusion-Link: Diffusion Probabilistic Model for Bridging the Audio-Text Modality Gap
R.I.P.
👻
Ghosted
XLSR-Kanformer: A KAN-Intergrated model for Synthetic Speech Detection
R.I.P.
👻
Ghosted