Online end-to-end neural diarization with speaker-tracing buffer

Y Xue, S Horiguchi, Y Fujita… - 2021 IEEE Spoken …, 2021 - ieeexplore.ieee.org
This paper proposes a novel online speaker diarization algorithm based on a fully
supervised self-attention mechanism (SA-EEND). Online diarization inherently presents a …

Supervised online diarization with sample mean loss for multi-domain data

E Fini, A Brutti - … 2020-2020 IEEE International Conference on …, 2020 - ieeexplore.ieee.org
Recently, a fully supervised speaker diarization approach was proposed (UIS-RNN) which
models speakers using multiple instances of a parameter-sharing recurrent neural network …

Speech processing system and method

C Breslin, MJF Gales, KK Chin, KM Knill - US Patent 8,612,224, 2013 - Google Patents
US8612224B2 - Speech processing system and method - Google Patents US8612224B2 -
Speech processing system and method - Google Patents Speech processing system and …

Online speaker diarization using adapted i-vector transforms

W Zhu, J Pelecanos - 2016 IEEE International Conference on …, 2016 - ieeexplore.ieee.org
Many speaker diarization systems operate in an off-line mode. Such systems typically find
homogeneous segments and then cluster these segments according to speaker. Such …

[PDF][PDF] Online Speaker Diarization with Core Samples Selection.

Y Yue, J Du, MK He, YT Yeung, R Wang - INTERSPEECH, 2022 - isca-archive.org
We propose a novel online speaker diarization approach based on the VBx algorithm which
works well on the offline speaker diarization tasks. To efficiently process long-time …

Online target speaker voice activity detection for speaker diarization

W Wang, Q Lin, M Li - arxiv preprint arxiv:2207.05920, 2022 - arxiv.org
This paper proposes an online target speaker voice activity detection system for speaker
diarization tasks, which does not require a priori knowledge from the clustering-based …

Low-latency online speaker diarization with graph-based label generation

Y Zhang, Q Lin, W Wang, L Yang, X Wang… - arxiv preprint arxiv …, 2021 - arxiv.org
This paper introduces an online speaker diarization system that can handle long-time audio
with low latency. We enable Agglomerative Hierarchy Clustering (AHC) to work in an online …

Adaptive and online speaker diarization for meeting data

G Soldi, C Beaugeant, N Evans - 2015 23rd European Signal …, 2015 - ieeexplore.ieee.org
Speaker diarization aims to determinewho spoke when'in a given audio stream. Different
applications, such as document structuring or information retrieval have led to the …

Audio-video speaker diarization for unsupervised speaker and face model creation

P Campr, M Kunešová, J Vaněk, J Čech… - Text, Speech and …, 2014 - Springer
Our goal is to create speaker models in audio domain and face models in video domain from
a set of videos in an unsupervised manner. Such models can be used later for speaker …