A Methodology for Controlling the Emotional Expressiveness in Synthetic Speech -- a Deep Learning approach

July 05, 2019 Β· Declared Dead Β· πŸ› 2019 8th International Conference on Affective Computing and Intelligent Interaction Workshops and Demos (ACIIW)

πŸ‘» CAUSE OF DEATH: Ghosted
No code link whatsoever

"No code URL or promise found in abstract"

Evidence collected by the PWNC Scanner

Authors NoΓ© Tits arXiv ID 1907.02784 Category eess.AS: Audio & Speech Cross-listed cs.CL, cs.LG, cs.SD Citations 10 Venue 2019 8th International Conference on Affective Computing and Intelligent Interaction Workshops and Demos (ACIIW) Last Checked 3 months ago
Abstract
In this project, we aim to build a Text-to-Speech system able to produce speech with a controllable emotional expressiveness. We propose a methodology for solving this problem in three main steps. The first is the collection of emotional speech data. We discuss the various formats of existing datasets and their usability in speech generation. The second step is the development of a system to automatically annotate data with emotion/expressiveness features. We compare several techniques using transfer learning to extract such a representation through other tasks and propose a method to visualize and interpret the correlation between vocal and emotional features. The third step is the development of a deep learning-based system taking text and emotion/expressiveness as input and producing speech as output. We study the impact of fine tuning from a neutral TTS towards an emotional TTS in terms of intelligibility and perception of the emotion.
Community shame:
Not yet rated
Community Contributions

Found the code? Know the venue? Think something is wrong? Let us know!

πŸ“œ Similar Papers

In the same crypt β€” Audio & Speech

Died the same way β€” πŸ‘» Ghosted