Diaper: End-to-end neural diarization with perceiver-based attractors

F Landini, T Stafylakis, L Burget - IEEE/ACM Transactions on …, 2024‏ - ieeexplore.ieee.org
Until recently, the field of speaker diarization was dominated by cascaded systems. Due to
their limitations, mainly regarding overlapped speech and cumbersome pipelines, end-to …

X-tf-gridnet: A time–frequency domain target speaker extraction network with adaptive speaker embedding fusion

F Hao, X Li, C Zheng - Information Fusion, 2024‏ - Elsevier
Target speaker extraction (TSE) which has the capability to directly extract desired speech
given enrollment utterances of the target speaker has attracted more and more attention for …

DiaCorrect: Error correction back-end for speaker diarization

J Han, F Landini, J Rohdin, M Diez… - ICASSP 2024-2024 …, 2024‏ - ieeexplore.ieee.org
In this work, we propose an error correction framework, named DiaCorrect, to refine the
output of a diarization system in a simple yet effective way. This method is inspired by error …

EEND-DEMUX: End-to-End Neural Speaker Diarization via Demultiplexed Speaker Embeddings

SH Mun, MH Han, C Moon, NS Kim - arxiv preprint arxiv:2312.06065, 2023‏ - arxiv.org
In recent years, there have been studies to further improve the end-to-end neural speaker
diarization (EEND) systems. This letter proposes the EEND-DEMUX model, a novel …

End-to-End Neural Speaker Diarization With Non-Autoregressive Attractors

M Rybicka, J Villalba, T Thebaud… - … on Audio, Speech …, 2024‏ - ieeexplore.ieee.org
Despite many recent developments in speaker diarization, it remains a challenge and an
active area of research to make diarization robust and effective in real-life scenarios. Well …

Sequence-to-Sequence Neural Diarization with Automatic Speaker Detection and Representation

M Cheng, Y Lin, M Li - arxiv preprint arxiv:2411.13849, 2024‏ - arxiv.org
This paper proposes a novel Sequence-to-Sequence Neural Diarization (SSND) framework
to perform online and offline speaker diarization. It is developed from the sequence-to …

Self-Conditioning via Intermediate Predictions for End-to-End Neural Speaker Diarization

Y Fujita, T Ogawa, T Kobayashi - IEEE Access, 2023‏ - ieeexplore.ieee.org
This paper presents a speaker diarization model that incorporates label dependency via
intermediate predictions. The proposed method is categorized as an end-to-end neural …