Graph attention-based deep embedded clustering for speaker diarization
Y Wei, H Guo, Z Ge, Z Yang - Speech Communication, 2023 - Elsevier
Deep speaker embedding extraction models have recently served as the cornerstone for
modular speaker diarization systems. However, in current modular systems, the extracted …
modular speaker diarization systems. However, in current modular systems, the extracted …
Meta-learning with latent space clustering in generative adversarial network for speaker diarization
The performance of most speaker diarization systems with x-vector embeddings is both
vulnerable to noisy environments and lacks domain robustness. Earlier work on speaker …
vulnerable to noisy environments and lacks domain robustness. Earlier work on speaker …
Speaker diarization using latent space clustering in generative adversarial network
In this work, we propose deep latent space clustering for speaker diarization using
generative adversarial network (GAN) back-projection with the help of an encoder network …
generative adversarial network (GAN) back-projection with the help of an encoder network …
Linguistically aided speaker diarization using speaker role information
Speaker diarization relies on the assumption that speech segments corresponding to a
particular speaker are concentrated in a specific region of the speaker space; a region which …
particular speaker are concentrated in a specific region of the speaker space; a region which …
Multi-scale speaker diarization with neural affinity score fusion
Predicting the speaker's identity of short speech segments in human dialogue has been
considered one of the most challenging problems in speech signal processing. Speaker …
considered one of the most challenging problems in speech signal processing. Speaker …
Automatic prediction of suicidal risk in military couples using multimodal interaction cues from couples conversations
Suicide is a major societal challenge globally, with a wide range of risk factors, from
individual health, psychological and behavioral elements to socio-economic aspects …
individual health, psychological and behavioral elements to socio-economic aspects …
Look who's not talking
The objective of this work is speaker diarisation of speech recordingsin the wild'. The ability
to determine speech segments is a crucial part of diarisation systems, accounting for a large …
to determine speech segments is a crucial part of diarisation systems, accounting for a large …
[PDF][PDF] Extracting and Using Speaker Role Information in Speech Processing Applications
N Flemotomos - 2022 - nikosfl.github.io
Roles are one of the most important concepts in understanding and modeling human
behavior. According to social psychology, individuals assume distinct roles in different …
behavior. According to social psychology, individuals assume distinct roles in different …
[PDF][PDF] USC-SIPI Report# 449 Multimodal and Self-guided Clustering Approaches Toward Context Aware Speaker Diarization
TJ Park - 2021 - sipi.usc.edu
Speaker diarization has become an important field in recent years owing to the growing
demand for conversational artificial intelligence and interactive entertainment systems …
demand for conversational artificial intelligence and interactive entertainment systems …