SCDF: A Speaker Characteristics DeepFake Speech Dataset for Bias Analysis

August 11, 2025 ยท Declared Dead ยท ๐Ÿ› Biometrics and Electronic Signatures

๐Ÿ‘ป CAUSE OF DEATH: Ghosted
No code link whatsoever

"No code URL or promise found in abstract"

Evidence collected by the PWNC Scanner

Authors Vojtฤ›ch Stanฤ›k, Karel Srna, Anton Firc, Kamil Malinka arXiv ID 2508.07944 Category cs.SD: Sound Cross-listed cs.AI, cs.CR Citations 0 Venue Biometrics and Electronic Signatures Last Checked 4 months ago
Abstract
Despite growing attention to deepfake speech detection, the aspects of bias and fairness remain underexplored in the speech domain. To address this gap, we introduce the Speaker Characteristics Deepfake (SCDF) dataset: a novel, richly annotated resource enabling systematic evaluation of demographic biases in deepfake speech detection. SCDF contains over 237,000 utterances in a balanced representation of both male and female speakers spanning five languages and a wide age range. We evaluate several state-of-the-art detectors and show that speaker characteristics significantly influence detection performance, revealing disparities across sex, language, age, and synthesizer type. These findings highlight the need for bias-aware development and provide a foundation for building non-discriminatory deepfake detection systems aligned with ethical and regulatory standards.
Community shame:
Not yet rated
Community Contributions

Found the code? Know the venue? Think something is wrong? Let us know!

๐Ÿ“œ Similar Papers

In the same crypt โ€” Sound

Died the same way โ€” ๐Ÿ‘ป Ghosted