Analyzing the Impact of Splicing Artifacts in Partially Fake Speech Signals

August 25, 2024 ยท Declared Dead ยท ๐Ÿ› The Automatic Speaker Verification Spoofing Countermeasures Workshop (ASVspoof 2024)

๐Ÿ‘ป CAUSE OF DEATH: Ghosted
No code link whatsoever

"No code URL or promise found in abstract"

Evidence collected by the PWNC Scanner

Authors Viola Negroni, Davide Salvi, Paolo Bestagini, Stefano Tubaro arXiv ID 2408.13784 Category cs.SD: Sound Cross-listed cs.AI, cs.MM, eess.AS Citations 8 Venue The Automatic Speaker Verification Spoofing Countermeasures Workshop (ASVspoof 2024) Last Checked 3 months ago
Abstract
Speech deepfake detection has recently gained significant attention within the multimedia forensics community. Related issues have also been explored, such as the identification of partially fake signals, i.e., tracks that include both real and fake speech segments. However, generating high-quality spliced audio is not as straightforward as it may appear. Spliced signals are typically created through basic signal concatenation. This process could introduce noticeable artifacts that can make the generated data easier to detect. We analyze spliced audio tracks resulting from signal concatenation, investigate their artifacts and assess whether such artifacts introduce any bias in existing datasets. Our findings reveal that by analyzing splicing artifacts, we can achieve a detection EER of 6.16% and 7.36% on PartialSpoof and HAD datasets, respectively, without needing to train any detector. These results underscore the complexities of generating reliable spliced audio data and lead to discussions that can help improve future research in this area.
Community shame:
Not yet rated
Community Contributions

Found the code? Know the venue? Think something is wrong? Let us know!

๐Ÿ“œ Similar Papers

In the same crypt โ€” Sound

Died the same way โ€” ๐Ÿ‘ป Ghosted