Machine learning for stuttering identification: Review, challenges and future directions
Stuttering is a speech disorder during which the flow of speech is interrupted by involuntary
pauses and repetition of sounds. Stuttering identification is an interesting interdisciplinary …
pauses and repetition of sounds. Stuttering identification is an interesting interdisciplinary …
An overview of the icassp special session on ai security and privacy in speech and audio processing
Perceiving and producing speech and audio signals are the basic ways for humans to
communicate with each other and know about the world. Benefiting from the advancement of …
communicate with each other and know about the world. Benefiting from the advancement of …
Muse 2022 challenge: Multimodal humour, emotional reactions, and stress
The 3rd Multimodal Sentiment Analysis Challenge (MuSe) focuses on multimodal affective
computing. The workshop is held in conjunction with ACM Multimedia'22. Three datasets are …
computing. The workshop is held in conjunction with ACM Multimedia'22. Three datasets are …
Wav2vec2-based paralinguistic systems to recognise vocalised emotions and stuttering
With the rapid advancement in automatic speech recognition and natural language
understanding, a complementary field (paralinguistics) emerged, focusing on the non-verbal …
understanding, a complementary field (paralinguistics) emerged, focusing on the non-verbal …
Advancing stuttering detection via data augmentation, class-balanced loss and multi-contextual deep learning
Stuttering is a neuro-developmental speech impairment characterized by uncontrolled
utterances (interjections) and core behaviors (blocks, repetitions, and prolongations), and is …
utterances (interjections) and core behaviors (blocks, repetitions, and prolongations), and is …
Viper: Video-based perceiver for emotion recognition
Recognizing human emotions from videos requires a deep understanding of the underlying
multimodal sources, including images, audio, and text. Since the input data sources are …
multimodal sources, including images, audio, and text. Since the input data sources are …
Classification of stuttering–The ComParE challenge and beyond
Abstract The ACM Multimedia 2022 Computational Paralinguistics Challenge (ComParE)
featured a sub-challenge on the classification of stuttering in order to bring attention to this …
featured a sub-challenge on the classification of stuttering in order to bring attention to this …
Detecting vocal fatigue with neural embeddings
Vocal fatigue refers to the feeling of tiredness and weakness of voice due to extended
utilization. This paper investigates the effectiveness of neural embeddings for the detection …
utilization. This paper investigates the effectiveness of neural embeddings for the detection …
Audio features from the Wav2Vec 2.0 embeddings for the ACM multimedia 2022 stuttering challenge
C Montacié, MJ Caraty, N Lackovic - Proceedings of the 30th ACM …, 2022 - dl.acm.org
The ACM Multimedia 2022 Stuttering Challenge is to determine the stuttering-related class
of a speech segment. There are seven stuttering-related classes and an eighth garbage …
of a speech segment. There are seven stuttering-related classes and an eighth garbage …
Detecting Voice Fatigue With Artificial Intelligence
A Siripurapu, RT Sataloff - Journal of Voice, 2024 - Elsevier
Voice fatigue (VF) has many symptoms and can occur after extended or brief voice use,
depending on the presence or absence of voice pathology, and other factors. However …
depending on the presence or absence of voice pathology, and other factors. However …