Machine learning for stuttering identification: Review, challenges and future directions

SA Sheikh, M Sahidullah, F Hirsch, S Ouni - Neurocomputing, 2022 - Elsevier
Stuttering is a speech disorder during which the flow of speech is interrupted by involuntary
pauses and repetition of sounds. Stuttering identification is an interesting interdisciplinary …

An overview of the icassp special session on ai security and privacy in speech and audio processing

Z Ren, K Qian, T Schultz, BW Schuller - Proceedings of the 5th ACM …, 2023 - dl.acm.org
Perceiving and producing speech and audio signals are the basic ways for humans to
communicate with each other and know about the world. Benefiting from the advancement of …

Muse 2022 challenge: Multimodal humour, emotional reactions, and stress

S Amiriparian, L Christ, A König, EM Meßner… - Proceedings of the 30th …, 2022 - dl.acm.org
The 3rd Multimodal Sentiment Analysis Challenge (MuSe) focuses on multimodal affective
computing. The workshop is held in conjunction with ACM Multimedia'22. Three datasets are …

Wav2vec2-based paralinguistic systems to recognise vocalised emotions and stuttering

T Grósz, D Porjazovski, Y Getman, S Kadiri… - Proceedings of the 30th …, 2022 - dl.acm.org
With the rapid advancement in automatic speech recognition and natural language
understanding, a complementary field (paralinguistics) emerged, focusing on the non-verbal …

Advancing stuttering detection via data augmentation, class-balanced loss and multi-contextual deep learning

SA Sheikh, M Sahidullah, F Hirsch… - IEEE Journal of …, 2023 - ieeexplore.ieee.org
Stuttering is a neuro-developmental speech impairment characterized by uncontrolled
utterances (interjections) and core behaviors (blocks, repetitions, and prolongations), and is …

Viper: Video-based perceiver for emotion recognition

L Vaiani, M La Quatra, L Cagliero, P Garza - Proceedings of the 3rd …, 2022 - dl.acm.org
Recognizing human emotions from videos requires a deep understanding of the underlying
multimodal sources, including images, audio, and text. Since the input data sources are …

Classification of stuttering–The ComParE challenge and beyond

SP Bayerl, M Gerczuk, A Batliner, C Bergler… - Computer Speech & …, 2023 - Elsevier
Abstract The ACM Multimedia 2022 Computational Paralinguistics Challenge (ComParE)
featured a sub-challenge on the classification of stuttering in order to bring attention to this …

Detecting vocal fatigue with neural embeddings

SP Bayerl, D Wagner, I Baumann, T Bocklet… - Journal of Voice, 2023 - Elsevier
Vocal fatigue refers to the feeling of tiredness and weakness of voice due to extended
utilization. This paper investigates the effectiveness of neural embeddings for the detection …

Audio features from the Wav2Vec 2.0 embeddings for the ACM multimedia 2022 stuttering challenge

C Montacié, MJ Caraty, N Lackovic - Proceedings of the 30th ACM …, 2022 - dl.acm.org
The ACM Multimedia 2022 Stuttering Challenge is to determine the stuttering-related class
of a speech segment. There are seven stuttering-related classes and an eighth garbage …

Detecting Voice Fatigue With Artificial Intelligence

A Siripurapu, RT Sataloff - Journal of Voice, 2024 - Elsevier
Voice fatigue (VF) has many symptoms and can occur after extended or brief voice use,
depending on the presence or absence of voice pathology, and other factors. However …