Online end-to-end neural diarization with speaker-tracing buffer
This paper proposes a novel online speaker diarization algorithm based on a fully
supervised self-attention mechanism (SA-EEND). Online diarization inherently presents a …
supervised self-attention mechanism (SA-EEND). Online diarization inherently presents a …
Supervised online diarization with sample mean loss for multi-domain data
Recently, a fully supervised speaker diarization approach was proposed (UIS-RNN) which
models speakers using multiple instances of a parameter-sharing recurrent neural network …
models speakers using multiple instances of a parameter-sharing recurrent neural network …
Speech processing system and method
US8612224B2 - Speech processing system and method - Google Patents US8612224B2 -
Speech processing system and method - Google Patents Speech processing system and …
Speech processing system and method - Google Patents Speech processing system and …
Online speaker diarization using adapted i-vector transforms
W Zhu, J Pelecanos - 2016 IEEE International Conference on …, 2016 - ieeexplore.ieee.org
Many speaker diarization systems operate in an off-line mode. Such systems typically find
homogeneous segments and then cluster these segments according to speaker. Such …
homogeneous segments and then cluster these segments according to speaker. Such …
[PDF][PDF] Online Speaker Diarization with Core Samples Selection.
We propose a novel online speaker diarization approach based on the VBx algorithm which
works well on the offline speaker diarization tasks. To efficiently process long-time …
works well on the offline speaker diarization tasks. To efficiently process long-time …
Online target speaker voice activity detection for speaker diarization
This paper proposes an online target speaker voice activity detection system for speaker
diarization tasks, which does not require a priori knowledge from the clustering-based …
diarization tasks, which does not require a priori knowledge from the clustering-based …
Low-latency online speaker diarization with graph-based label generation
This paper introduces an online speaker diarization system that can handle long-time audio
with low latency. We enable Agglomerative Hierarchy Clustering (AHC) to work in an online …
with low latency. We enable Agglomerative Hierarchy Clustering (AHC) to work in an online …
Adaptive and online speaker diarization for meeting data
Speaker diarization aims to determinewho spoke when'in a given audio stream. Different
applications, such as document structuring or information retrieval have led to the …
applications, such as document structuring or information retrieval have led to the …
Audio-video speaker diarization for unsupervised speaker and face model creation
Our goal is to create speaker models in audio domain and face models in video domain from
a set of videos in an unsupervised manner. Such models can be used later for speaker …
a set of videos in an unsupervised manner. Such models can be used later for speaker …