Sequence-to-Sequence Neural Diarization with Automatic Speaker Detection and Representation

M Cheng, Y Lin, M Li - arxiv preprint arxiv:2411.13849, 2024 - arxiv.org
This paper proposes a novel Sequence-to-Sequence Neural Diarization (SSND) framework
to perform online and offline speaker diarization. It is developed from the sequence-to …

Enhancing Low-Latency Speaker Diarization with Spatial Dictionary Learning

W Chen, TT Anh, X Zhong… - ICASSP 2024-2024 IEEE …, 2024 - ieeexplore.ieee.org
This study proposes a low-latency online speaker diarization framework. Specifically, we
design a spatial dictionary learning module shared across different frequency bands …

Online speaker diarization of meetings guided by speech separation

E Gruttadauria, M Fontaine… - ICASSP 2024-2024 IEEE …, 2024 - ieeexplore.ieee.org
Overlapped speech is notoriously problematic for speaker diarization systems.
Consequently, the use of speech separation has recently been proposed to improve their …

LS-EEND: Long-Form Streaming End-to-End Neural Diarization with Online Attractor Extraction

D Liang, X Li - arxiv preprint arxiv:2410.06670, 2024 - arxiv.org
This work proposes a frame-wise online/streaming end-to-end neural diarization (EEND)
method, which detects speaker activities in a frame-in-frame-out fashion. The proposed …

Winner Takes It All: An Efficient Overlap-Aware Hybrid Online Diarization with Partial Backtracking Mechanism

R Zhen, X Zhang, C Min, B Li - 2024 IEEE International …, 2024 - ieeexplore.ieee.org
In this paper, we propose technical enhancements to a recently introduced multi-stage
overlap-aware method for low-latency online speaker diarization that combines incremental …