ALLIES: A Speech Corpus for Segmentation, Speaker Diarization, Speech Recognition and Speaker Change Detection

M Tahon, A Larcher, M Lebourdais… - Proceedings of the …, 2024 - aclanthology.org
This paper presents ALLIES, a meta corpus which gathers and extends existing French
corpora collected from radio and TV shows. The corpus contains 1048 audio files for about …

Lifelong Learning MOS Prediction for Synthetic Speech Quality Evaluation

F Saget, M Shamsi, M Tahon - Interspeech 2024, 2024 - hal.science
Mean Opinion Score (MOS) has been a long-standing standard for perceptive evaluation of
quality of speech synthesis models; however, this criterion is hardly reproducible, and costly …

Double Mixture: Towards Continual Event Detection from Speech

J Kang, T Wu, J Zhao, G Wang, Y Wei, H Yang… - arxiv preprint arxiv …, 2024 - arxiv.org
Speech event detection is crucial for multimedia retrieval, involving the tagging of both
semantic and acoustic events. Traditional ASR systems often overlook the interplay between …

Traitement automatique de la parole expressive: retour vers des systèmes interprétables?

M Tahon - 2023 - hal.science
La parole est un moyen de communication fondamental qui s' inscrit dans une interaction
entre le locuteur et ses auditeurs. En plus du contenu sémantique, le signal de parole nous …