A review of depression and suicide risk assessment using speech analysis

N Cummins, S Scherer, J Krajewski, S Schnieder… - Speech …, 2015 - Elsevier
This paper is the first review into the automatic analysis of speech for use as an objective
predictor of depression and suicidality. Both conditions are major public health concerns; …

Computerized analysis of speech and voice for Parkinson's disease: A systematic review

QC Ngo, MA Motin, ND Pah, P Drotár… - Computer Methods and …, 2022 - Elsevier
Background and objective Speech impairment is an early symptom of Parkinson's disease
(PD). This study has summarized the literature related to speech and voice in detecting PD …

Seamless: Multilingual Expressive and Streaming Speech Translation

L Barrault, YA Chung, MC Meglioli, D Dale… - arxiv preprint arxiv …, 2023 - arxiv.org
Large-scale automatic speech translation systems today lack key features that help machine-
mediated communication feel seamless when compared to human-to-human dialogue. In …

Emotional voice conversion: Theory, databases and esd

K Zhou, B Sisman, R Liu, H Li - Speech Communication, 2022 - Elsevier
In this paper, we first provide a review of the state-of-the-art emotional voice conversion
research, and the existing emotional speech databases. We then motivate the development …

Generative spoken dialogue language modeling

TA Nguyen, E Kharitonov, J Copet, Y Adi… - Transactions of the …, 2023 - direct.mit.edu
We introduce dGSLM, the first “textless” model able to generate audio samples of naturalistic
spoken dialogues. It uses recent work on unsupervised spoken unit discovery coupled with …

[PDF][PDF] The INTERSPEECH 2013 computational paralinguistics challenge: Social signals, conflict, emotion, autism

B Schuller, S Steidl, A Batliner… - … 2013, 14th Annual …, 2013 - mediatum.ub.tum.de
Abstract The INTERSPEECH 2013 Computational Paralinguistics Challenge provides for
the first time a unified test-bed for Social Signals such as laughter in speech. It further …

[KNIHA][B] Teaching and researching: Listening

M Rost - 2013 - taylorfrancis.com
Teaching and Researching Listening provides a focused, state-of-the-art treatment of the
linguistic, psycholinguistic and pragmatic processes that are involved in oral language use …

Towards emotionally aware AI smart classroom: Current issues and directions for engineering and education

Y Kim, T Soyata, RF Behnagh - Ieee Access, 2018 - ieeexplore.ieee.org
Future smart classrooms that we envision will significantly enhance learning experience and
seamless communication among students and teachers using real-time sensing and …

Multimodal fusion of bert-cnn and gated cnn representations for depression detection

M Rodrigues Makiuchi, T Warnita, K Uto… - Proceedings of the 9th …, 2019 - dl.acm.org
Depression is a common, but serious mental disorder that affects people all over the world.
Besides providing an easier way of diagnosing the disorder, a computer-aided automatic …

Behavioral signal processing: Deriving human behavioral informatics from speech and language

S Narayanan, PG Georgiou - Proceedings of the IEEE, 2013 - ieeexplore.ieee.org
The expression and experience of human behavior are complex and multimodal and
characterized by individual and contextual heterogeneity and variability. Speech and …