A review-based study on different Text-to-Speech technologies

December 17, 2023 · The Cartographer · 🏛 arXiv.org

"No code URL or promise found in abstract"
"Title-pattern auto-detect: A review-based study on different Text-to-Speech technologies"

Evidence collected by the PWNC Scanner

Authors Md. Jalal Uddin Chowdhury, Ashab Hussan arXiv ID 2312.11563 Category cs.SD: Sound Cross-listed cs.CL, cs.LG, eess.AS Citations 6 Venue arXiv.org Last Checked 3 days ago

Abstract

This research paper presents a comprehensive review-based study on various Text-to-Speech (TTS) technologies. TTS technology is an important aspect of human-computer interaction, enabling machines to convert written text into audible speech. The paper examines the different TTS technologies available, including concatenative TTS, formant synthesis TTS, and statistical parametric TTS. The study focuses on comparing the advantages and limitations of these technologies in terms of their naturalness of voice, the level of complexity of the system, and their suitability for different applications. In addition, the paper explores the latest advancements in TTS technology, including neural TTS and hybrid TTS. The findings of this research will provide valuable insights for researchers, developers, and users who want to understand the different TTS technologies and their suitability for specific applications.