Robust Speech and Natural Language Processing Models for Depression Screening

December 26, 2024 Β· Declared Dead Β· πŸ› IEEE Signal Processing in Medicine and Biology Symposium

πŸ‘» CAUSE OF DEATH: Ghosted
No code link whatsoever

"No code URL or promise found in abstract"

Evidence collected by the PWNC Scanner

Authors Y. Lu, A. Harati, T. Rutowski, R. Oliveira, P. Chlebek, E. Shriberg arXiv ID 2412.19072 Category eess.AS: Audio & Speech Cross-listed cs.CL Citations 5 Venue IEEE Signal Processing in Medicine and Biology Symposium Last Checked 3 months ago
Abstract
Depression is a global health concern with a critical need for increased patient screening. Speech technology offers advantages for remote screening but must perform robustly across patients. We have described two deep learning models developed for this purpose. One model is based on acoustics; the other is based on natural language processing. Both models employ transfer learning. Data from a depression-labeled corpus in which 11,000 unique users interacted with a human-machine application using conversational speech is used. Results on binary depression classification have shown that both models perform at or above AUC=0.80 on unseen data with no speaker overlap. Performance is further analyzed as a function of test subset characteristics, finding that the models are generally robust over speaker and session variables. We conclude that models based on these approaches offer promise for generalized automated depression screening.
Community shame:
Not yet rated
Community Contributions

Found the code? Know the venue? Think something is wrong? Let us know!

πŸ“œ Similar Papers

In the same crypt β€” Audio & Speech

Died the same way β€” πŸ‘» Ghosted