SottoVoce: An Ultrasound Imaging-Based Silent Speech Interaction Using Deep Neural Networks

March 03, 2023 ยท Declared Dead ยท ๐Ÿ› International Conference on Human Factors in Computing Systems

๐Ÿ‘ป CAUSE OF DEATH: Ghosted
No code link whatsoever

"No code URL or promise found in abstract"

Evidence collected by the PWNC Scanner

Authors Naoki Kimura, Michinari Kono, Jun Rekimoto arXiv ID 2303.01758 Category cs.HC: Human-Computer Interaction Cross-listed cs.LG, cs.SD, eess.AS, eess.IV Citations 128 Venue International Conference on Human Factors in Computing Systems Last Checked 2 months ago
Abstract
The availability of digital devices operated by voice is expanding rapidly. However, the applications of voice interfaces are still restricted. For example, speaking in public places becomes an annoyance to the surrounding people, and secret information should not be uttered. Environmental noise may reduce the accuracy of speech recognition. To address these limitations, a system to detect a user's unvoiced utterance is proposed. From internal information observed by an ultrasonic imaging sensor attached to the underside of the jaw, our proposed system recognizes the utterance contents without the user's uttering voice. Our proposed deep neural network model is used to obtain acoustic features from a sequence of ultrasound images. We confirmed that audio signals generated by our system can control the existing smart speakers. We also observed that a user can adjust their oral movement to learn and improve the accuracy of their voice recognition.
Community shame:
Not yet rated
Community Contributions

Found the code? Know the venue? Think something is wrong? Let us know!

๐Ÿ“œ Similar Papers

In the same crypt โ€” Human-Computer Interaction

Died the same way โ€” ๐Ÿ‘ป Ghosted