R.I.P.
👻
Ghosted
JaCappella Corpus: A Japanese a Cappella Vocal Ensemble Corpus
November 29, 2022 · 🏛 IEEE International Conference on Acoustics, Speech, and Signal Processing
"No code URL or promise found in abstract"
"HuggingFace models found (backfill)"
Evidence collected by the PWNC Scanner
Authors
Tomohiko Nakamura, Shinnosuke Takamichi, Naoko Tanji, Satoru Fukayama, Hiroshi Saruwatari
arXiv ID
2211.16028
Category
eess.AS: Audio & Speech
Cross-listed
cs.LG,
cs.SD
Citations
11
Venue
IEEE International Conference on Acoustics, Speech, and Signal Processing
Repository
https://huggingface.co/jaCappella/MRDLA_jaCappella_VES_48k
Last Checked
12 days ago
Abstract
We construct a corpus of Japanese a cappella vocal ensembles (jaCappella corpus) for vocal ensemble separation and synthesis. It consists of 35 copyright-cleared vocal ensemble songs and their audio recordings of individual voice parts. These songs were arranged from out-of-copyright Japanese children's songs and have six voice parts (lead vocal, soprano, alto, tenor, bass, and vocal percussion). They are divided into seven subsets, each of which features typical characteristics of a music genre such as jazz and enka. The variety in genre and voice part match vocal ensembles recently widespread in social media services such as YouTube, although the main targets of conventional vocal ensemble datasets are choral singing made up of soprano, alto, tenor, and bass. Experimental evaluation demonstrates that our corpus is a challenging resource for vocal ensemble separation. Our corpus is available on our project page (https://tomohikonakamura.github.io/jaCappella_corpus/).
Community Contributions
Found the code? Know the venue? Think something is wrong? Let us know!
📜 Similar Papers
In the same crypt — Audio & Speech
R.I.P.
👻
Ghosted
LPCNet: Improving Neural Speech Synthesis Through Linear Prediction
R.I.P.
👻
Ghosted
VoiceFilter: Targeted Voice Separation by Speaker-Conditioned Spectrogram Masking
R.I.P.
👻
Ghosted
TERA: Self-Supervised Learning of Transformer Encoder Representation for Speech
R.I.P.
👻
Ghosted
Mockingjay: Unsupervised Speech Representation Learning with Deep Bidirectional Transformer Encoders
R.I.P.
👻
Ghosted