Encoder-decoder based attractors for end-to-end neural diarization

S Horiguchi, Y Fujita, S Watanabe… - … /ACM Transactions on …, 2022 - ieeexplore.ieee.org
This paper investigates an end-to-end neural diarization (EEND) method for an unknown
number of speakers. In contrast to the conventional cascaded approach to speaker …

Online neural diarization of unlimited numbers of speakers using global and local attractors

S Horiguchi, S Watanabe, P García… - … on Audio, Speech …, 2022 - ieeexplore.ieee.org
A method to perform offline and online speaker diarization for an unlimited number of
speakers is described in this paper. End-to-end neural diarization (EEND) has achieved …

Overlap-aware low-latency online speaker diarization based on end-to-end local segmentation

JM Coria, H Bredin, S Ghannay… - 2021 IEEE Automatic …, 2021 - ieeexplore.ieee.org
We propose to address online speaker diarization as a combination of incremental
clustering and local diarization applied to a rolling buffer updated every 500ms. Every single …

Frame-wise and overlap-robust speaker embeddings for meeting diarization

T Cord-Landwehr, C Boeddeker… - ICASSP 2023-2023 …, 2023 - ieeexplore.ieee.org
Using a Teacher-Student training approach we developed a speaker embedding extraction
system that outputs embeddings at frame rate. Given this high temporal resolution and the …

Speaker overlap-aware neural diarization for multi-party meeting analysis

Z Du, S Zhang, S Zheng, Z Yan - arxiv preprint arxiv:2211.10243, 2022 - arxiv.org
Recently, hybrid systems of clustering and neural diarization models have been successfully
applied in multi-party meeting analysis. However, current models always treat overlapped …

Sortformer: Seamless integration of speaker diarization and asr by bridging timestamps and tokens

T Park, I Medennikov, K Dhawan, W Wang… - arxiv preprint arxiv …, 2024 - arxiv.org
We propose Sortformer, a novel neural model for speaker diarization, trained with
unconventional objectives compared to existing end-to-end diarization models. The …

End-to-end Online Speaker Diarization with Target Speaker Tracking

W Wang, M Li - arxiv preprint arxiv:2310.08696, 2023 - arxiv.org
This paper proposes an online target speaker voice activity detection system for speaker
diarization tasks, which does not require a priori knowledge from the clustering-based …

[PDF][PDF] Online Speaker Diarization with Core Samples Selection.

Y Yue, J Du, MK He, YT Yeung, R Wang - INTERSPEECH, 2022 - isca-archive.org
We propose a novel online speaker diarization approach based on the VBx algorithm which
works well on the offline speaker diarization tasks. To efficiently process long-time …

Online target speaker voice activity detection for speaker diarization

W Wang, Q Lin, M Li - arxiv preprint arxiv:2207.05920, 2022 - arxiv.org
This paper proposes an online target speaker voice activity detection system for speaker
diarization tasks, which does not require a priori knowledge from the clustering-based …

Adapting speaker embeddings for speaker diarisation

Y Kwon, J Jung, HS Heo, YJ Kim, BJ Lee… - arxiv preprint arxiv …, 2021 - arxiv.org
The goal of this paper is to adapt speaker embeddings for solving the problem of speaker
diarisation. The quality of speaker embeddings is paramount to the performance of speaker …