Speech analysis for health: Current state-of-the-art and the increasing impact of deep learning

N Cummins, A Baird, BW Schuller - Methods, 2018 - Elsevier
Due to the complex and intricate nature associated with their production, the acoustic-
prosodic properties of a speech signal are modulated with a range of health related effects …

Applying machine learning to facilitate autism diagnostics: pitfalls and promises

D Bone, MS Goodwin, MP Black, CC Lee… - Journal of autism and …, 2015 - Springer
Abstract Machine learning has immense potential to enhance diagnostic and intervention
research in the behavioral sciences, and may be especially useful in investigations involving …

Speech emotion recognition from 3D log-mel spectrograms with deep learning network

H Meng, T Yan, F Yuan, H Wei - IEEE access, 2019 - ieeexplore.ieee.org
Speech emotion recognition is a vital and challenging task that the feature extraction plays a
significant role in the SER performance. With the development of deep learning, we put our …

The Geneva minimalistic acoustic parameter set (GeMAPS) for voice research and affective computing

F Eyben, KR Scherer, BW Schuller… - IEEE transactions on …, 2015 - ieeexplore.ieee.org
Work on voice sciences over recent decades has led to a proliferation of acoustic
parameters that are used quite selectively and are not always extracted in a similar fashion …

Multimodal fusion of bert-cnn and gated cnn representations for depression detection

M Rodrigues Makiuchi, T Warnita, K Uto… - Proceedings of the 9th …, 2019 - dl.acm.org
Depression is a common, but serious mental disorder that affects people all over the world.
Besides providing an easier way of diagnosing the disorder, a computer-aided automatic …

Alzheimer's disease and automatic speech analysis: a review

MLB Pulido, JBA Hernández, MÁF Ballester… - Expert systems with …, 2020 - Elsevier
The objective of this paper is to present the state of-the-art relating to automatic speech and
voice analysis techniques as applied to the monitoring of patients suffering from Alzheimer's …

Paralinguistics in speech and language—state-of-the-art and the challenge

B Schuller, S Steidl, A Batliner, F Burkhardt… - Computer Speech & …, 2013 - Elsevier
Paralinguistic analysis is increasingly turning into a mainstream topic in speech and
language processing. This article aims to provide a broad overview of the constantly …

[KNIHA][B] Real-time speech and music classification by large audio feature space extraction

F Eyben - 2015 - books.google.com
This book reports on an outstanding thesis that has significantly advanced the state-of-the-
art in the automated analysis and classification of speech and music. It defines several …

LSTM-modeling of continuous emotions in an audiovisual affect recognition framework

M Wöllmer, M Kaiser, F Eyben, B Schuller… - Image and Vision …, 2013 - Elsevier
Automatically recognizing human emotions from spontaneous and non-prototypical real-life
data is currently one of the most challenging tasks in the field of affective computing. This …

Semisupervised autoencoders for speech emotion recognition

J Deng, X Xu, Z Zhang, S Frühholz… - … /ACM Transactions on …, 2017 - ieeexplore.ieee.org
Despite the widespread use of supervised learning methods for speech emotion recognition,
they are severely restricted due to the lack of sufficient amount of labelled speech data for …