Improved Vocal Effort Transfer Vector Estimation for Vocal Effort-Robust Speaker Verification

May 03, 2023 Β· Declared Dead Β· πŸ› International Workshop on Machine Learning for Signal Processing

πŸ‘» CAUSE OF DEATH: Ghosted
No code link whatsoever

"No code URL or promise found in abstract"

Evidence collected by the PWNC Scanner

Authors IvΓ‘n LΓ³pez-Espejo, Santi Prieto, Alfonso Ortega, Eduardo Lleida arXiv ID 2305.02147 Category eess.AS: Audio & Speech Cross-listed cs.HC Citations 0 Venue International Workshop on Machine Learning for Signal Processing Last Checked 3 months ago
Abstract
Despite the maturity of modern speaker verification technology, its performance still significantly degrades when facing non-neutrally-phonated (e.g., shouted and whispered) speech. To address this issue, in this paper, we propose a new speaker embedding compensation method based on a minimum mean square error (MMSE) estimator. This method models the joint distribution of the vocal effort transfer vector and non-neutrally-phonated embedding spaces and operates in a principal component analysis domain to cope with non-neutrally-phonated speech data scarcity. Experiments are carried out using a cutting-edge speaker verification system integrating a powerful self-supervised pre-trained model for speech representation. In comparison with a state-of-the-art embedding compensation method, the proposed MMSE estimator yields superior and competitive equal error rate results when tackling shouted and whispered speech, respectively.
Community shame:
Not yet rated
Community Contributions

Found the code? Know the venue? Think something is wrong? Let us know!

πŸ“œ Similar Papers

In the same crypt β€” Audio & Speech

Died the same way β€” πŸ‘» Ghosted