Machine learning for stuttering identification: Review, challenges and future directions

SA Sheikh, M Sahidullah, F Hirsch, S Ouni - Neurocomputing, 2022 - Elsevier
Stuttering is a speech disorder during which the flow of speech is interrupted by involuntary
pauses and repetition of sounds. Stuttering identification is an interesting interdisciplinary …

Respiratory diseases diagnosis using audio analysis and artificial intelligence: a systematic review

P Kapetanidis, F Kalioras, C Tsakonas, P Tzamalis… - Sensors, 2024 - mdpi.com
Respiratory diseases represent a significant global burden, necessitating efficient diagnostic
methods for timely intervention. Digital biomarkers based on audio, acoustics, and sound …

Advancing stuttering detection via data augmentation, class-balanced loss and multi-contextual deep learning

SA Sheikh, M Sahidullah, F Hirsch… - IEEE Journal of …, 2023 - ieeexplore.ieee.org
Stuttering is a neuro-developmental speech impairment characterized by uncontrolled
utterances (interjections) and core behaviors (blocks, repetitions, and prolongations), and is …

Wav2vec2-based paralinguistic systems to recognise vocalised emotions and stuttering

T Grósz, D Porjazovski, Y Getman, S Kadiri… - Proceedings of the 30th …, 2022 - dl.acm.org
With the rapid advancement in automatic speech recognition and natural language
understanding, a complementary field (paralinguistics) emerged, focusing on the non-verbal …

Muse 2022 challenge: Multimodal humour, emotional reactions, and stress

S Amiriparian, L Christ, A König, EM Meßner… - Proceedings of the 30th …, 2022 - dl.acm.org
The 3rd Multimodal Sentiment Analysis Challenge (MuSe) focuses on multimodal affective
computing. The workshop is held in conjunction with ACM Multimedia'22. Three datasets are …

Classification of stuttering–The ComParE challenge and beyond

SP Bayerl, M Gerczuk, A Batliner, C Bergler… - Computer Speech & …, 2023 - Elsevier
Abstract The ACM Multimedia 2022 Computational Paralinguistics Challenge (ComParE)
featured a sub-challenge on the classification of stuttering in order to bring attention to this …

Detecting vocal fatigue with neural embeddings

SP Bayerl, D Wagner, I Baumann, T Bocklet… - Journal of Voice, 2023 - Elsevier
Vocal fatigue refers to the feeling of tiredness and weakness of voice due to extended
utilization. This paper investigates the effectiveness of neural embeddings for the detection …

Automated data augmentation for audio classification

Y Sun, K Xu, C Liu, Y Dou, H Wang… - … /ACM Transactions on …, 2024 - ieeexplore.ieee.org
Audio classification is a challenging task that requires categorizing audio data based on its
content or characteristics. Existing approaches for audio classification rely either on …

Viper: Video-based perceiver for emotion recognition

L Vaiani, M La Quatra, L Cagliero, P Garza - Proceedings of the 3rd …, 2022 - dl.acm.org
Recognizing human emotions from videos requires a deep understanding of the underlying
multimodal sources, including images, audio, and text. Since the input data sources are …

An overview of the icassp special session on ai security and privacy in speech and audio processing

Z Ren, K Qian, T Schultz, BW Schuller - Proceedings of the 5th ACM …, 2023 - dl.acm.org
Perceiving and producing speech and audio signals are the basic ways for humans to
communicate with each other and know about the world. Benefiting from the advancement of …