Graph attention-based deep embedded clustering for speaker diarization

Y Wei, H Guo, Z Ge, Z Yang - Speech Communication, 2023 - Elsevier
Deep speaker embedding extraction models have recently served as the cornerstone for
modular speaker diarization systems. However, in current modular systems, the extracted …

Meta-learning with latent space clustering in generative adversarial network for speaker diarization

M Pal, M Kumar, R Peri, TJ Park, SH Kim… - … ACM transactions on …, 2021 - ieeexplore.ieee.org
The performance of most speaker diarization systems with x-vector embeddings is both
vulnerable to noisy environments and lacks domain robustness. Earlier work on speaker …

Speaker diarization using latent space clustering in generative adversarial network

M Pal, M Kumar, R Peri, TJ Park, SH Kim… - ICASSP 2020-2020 …, 2020 - ieeexplore.ieee.org
In this work, we propose deep latent space clustering for speaker diarization using
generative adversarial network (GAN) back-projection with the help of an encoder network …

Linguistically aided speaker diarization using speaker role information

N Flemotomos, P Georgiou, S Narayanan - arxiv preprint arxiv …, 2019 - arxiv.org
Speaker diarization relies on the assumption that speech segments corresponding to a
particular speaker are concentrated in a specific region of the speaker space; a region which …

Multi-scale speaker diarization with neural affinity score fusion

TJ Park, M Kumar, S Narayanan - ICASSP 2021-2021 IEEE …, 2021 - ieeexplore.ieee.org
Predicting the speaker's identity of short speech segments in human dialogue has been
considered one of the most challenging problems in speech signal processing. Speaker …

Automatic prediction of suicidal risk in military couples using multimodal interaction cues from couples conversations

SN Chakravarthula, M Nasir, SY Tseng… - ICASSP 2020-2020 …, 2020 - ieeexplore.ieee.org
Suicide is a major societal challenge globally, with a wide range of risk factors, from
individual health, psychological and behavioral elements to socio-economic aspects …

Look who's not talking

Y Kwon, HS Heo, J Huh, BJ Lee… - 2021 IEEE Spoken …, 2021 - ieeexplore.ieee.org
The objective of this work is speaker diarisation of speech recordingsin the wild'. The ability
to determine speech segments is a crucial part of diarisation systems, accounting for a large …

[PDF][PDF] Extracting and Using Speaker Role Information in Speech Processing Applications

N Flemotomos - 2022 - nikosfl.github.io
Roles are one of the most important concepts in understanding and modeling human
behavior. According to social psychology, individuals assume distinct roles in different …

[PDF][PDF] USC-SIPI Report# 449 Multimodal and Self-guided Clustering Approaches Toward Context Aware Speaker Diarization

TJ Park - 2021 - sipi.usc.edu
Speaker diarization has become an important field in recent years owing to the growing
demand for conversational artificial intelligence and interactive entertainment systems …

[引用][C] Speaker Diarization

M Kunešová - 2021 - Západočeská univerzita v Plzni