Diaper: End-to-end neural diarization with perceiver-based attractors
Until recently, the field of speaker diarization was dominated by cascaded systems. Due to
their limitations, mainly regarding overlapped speech and cumbersome pipelines, end-to …
their limitations, mainly regarding overlapped speech and cumbersome pipelines, end-to …
X-tf-gridnet: A time–frequency domain target speaker extraction network with adaptive speaker embedding fusion
Target speaker extraction (TSE) which has the capability to directly extract desired speech
given enrollment utterances of the target speaker has attracted more and more attention for …
given enrollment utterances of the target speaker has attracted more and more attention for …
DiaCorrect: Error correction back-end for speaker diarization
In this work, we propose an error correction framework, named DiaCorrect, to refine the
output of a diarization system in a simple yet effective way. This method is inspired by error …
output of a diarization system in a simple yet effective way. This method is inspired by error …
EEND-DEMUX: End-to-End Neural Speaker Diarization via Demultiplexed Speaker Embeddings
In recent years, there have been studies to further improve the end-to-end neural speaker
diarization (EEND) systems. This letter proposes the EEND-DEMUX model, a novel …
diarization (EEND) systems. This letter proposes the EEND-DEMUX model, a novel …
End-to-End Neural Speaker Diarization With Non-Autoregressive Attractors
Despite many recent developments in speaker diarization, it remains a challenge and an
active area of research to make diarization robust and effective in real-life scenarios. Well …
active area of research to make diarization robust and effective in real-life scenarios. Well …
Sequence-to-Sequence Neural Diarization with Automatic Speaker Detection and Representation
This paper proposes a novel Sequence-to-Sequence Neural Diarization (SSND) framework
to perform online and offline speaker diarization. It is developed from the sequence-to …
to perform online and offline speaker diarization. It is developed from the sequence-to …
Self-Conditioning via Intermediate Predictions for End-to-End Neural Speaker Diarization
This paper presents a speaker diarization model that incorporates label dependency via
intermediate predictions. The proposed method is categorized as an end-to-end neural …
intermediate predictions. The proposed method is categorized as an end-to-end neural …