Audio-Visual Speaker Diarization: Current Databases, Approaches and Challenges
Nowadays, the large amount of audio-visual content available has fostered the need to
develop new robust automatic speaker diarization systems to analyse and characterise it …
develop new robust automatic speaker diarization systems to analyse and characterise it …
Multimodal diarization systems by training enrollment models as identity representations
This paper describes a post-evaluation analysis of the system developed by ViVoLAB
research group for the IberSPEECH-RTVE 2020 Multimodal Diarization (MD) Challenge …
research group for the IberSPEECH-RTVE 2020 Multimodal Diarization (MD) Challenge …
Design of Intelligent models for Multimodal Socio-Affective Computing
C Luna Jiménez - 2023 - oa.upm.es
Dialog and human-machine communication systems have represented a revolution in
recent years. Nonetheless, users increasingly require more personalized and human-like …
recent years. Nonetheless, users increasingly require more personalized and human-like …
[PDF][PDF] Representation and metric learning advances for deep neural network face and speaker biometric systems
VM Bueno - 2022 - researchgate.net
The increasing use of technological devices and biometric recognition systems in people
daily lives has motivated a great deal of research interest in the development of effective and …
daily lives has motivated a great deal of research interest in the development of effective and …