Listening Between the Lines: Synthetic Speech Detection Disregarding Verbal Content

February 08, 2024 ยท Declared Dead ยท ๐Ÿ› 2024 IEEE International Conference on Acoustics, Speech, and Signal Processing Workshops (ICASSPW)

๐Ÿ‘ป CAUSE OF DEATH: Ghosted
No code link whatsoever

"No code URL or promise found in abstract"

Evidence collected by the PWNC Scanner

Authors Davide Salvi, Temesgen Semu Balcha, Paolo Bestagini, Stefano Tubaro arXiv ID 2402.05567 Category cs.SD: Sound Cross-listed cs.MM, eess.AS Citations 9 Venue 2024 IEEE International Conference on Acoustics, Speech, and Signal Processing Workshops (ICASSPW) Last Checked 3 months ago
Abstract
Recent advancements in synthetic speech generation have led to the creation of forged audio data that are almost indistinguishable from real speech. This phenomenon poses a new challenge for the multimedia forensics community, as the misuse of synthetic media can potentially cause adverse consequences. Several methods have been proposed in the literature to mitigate potential risks and detect synthetic speech, mainly focusing on the analysis of the speech itself. However, recent studies have revealed that the most crucial frequency bands for detection lie in the highest ranges (above 6000 Hz), which do not include any speech content. In this work, we extensively explore this aspect and investigate whether synthetic speech detection can be performed by focusing only on the background component of the signal while disregarding its verbal content. Our findings indicate that the speech component is not the predominant factor in performing synthetic speech detection. These insights provide valuable guidance for the development of new synthetic speech detectors and their interpretability, together with some considerations on the existing work in the audio forensics field.
Community shame:
Not yet rated
Community Contributions

Found the code? Know the venue? Think something is wrong? Let us know!

๐Ÿ“œ Similar Papers

In the same crypt โ€” Sound

Died the same way โ€” ๐Ÿ‘ป Ghosted