A review of speaker diarization: Recent advances with deep learning

TJ Park, N Kanda, D Dimitriadis, KJ Han… - Computer Speech & …, 2022 - Elsevier
Speaker diarization is a task to label audio or video recordings with classes that correspond
to speaker identity, or in short, a task to identify “who spoke when”. In the early years …

Speaker diarization: A review of recent research

X Anguera, S Bozonnet, N Evans… - … on audio, speech …, 2012 - ieeexplore.ieee.org
Speaker diarization is the task of determining “who spoke when?” in an audio or video
recording that contains an unknown amount of speech and also an unknown number of …

An overview of automatic speaker diarization systems

SE Tranter, DA Reynolds - IEEE Transactions on audio, speech …, 2006 - ieeexplore.ieee.org
Audio diarization is the process of annotating an input audio channel with information that
attributes (possibly overlap**) temporal regions of signal energy to their specific sources …

Unsupervised methods for speaker diarization: An integrated and iterative approach

SH Shum, N Dehak, R Dehak… - IEEE Transactions on …, 2013 - ieeexplore.ieee.org
In speaker diarization, standard approaches typically perform speaker clustering on some
initial segmentation before refining the segment boundaries in a re-segmentation step to …

An open-source state-of-the-art toolbox for broadcast news diarization

M Rouvier, G Dupuy, P Gay, E Khoury, T Merlin… - Interspeech, 2013 - hal.science
This paper presents the LIUM open-source speaker diarization toolbox, mostly dedicated to
broadcast news. This tool includes both Hierarchical Agglomerative Clustering using well …

LIUM SpkDiarization: an open source toolkit for diarization

S Meignier, T Merlin - CMU SPUD Workshop, 2010 - hal.science
This paper presents an open-source diarization toolkit which is mostly dedicated to speaker
and developed by the LIUM. This toolkit includes hierarchical agglomerative clustering …

Speaker segmentation and clustering

M Kotti, V Moschou, C Kotropoulos - Signal processing, 2008 - Elsevier
This survey focuses on two challenging speech processing topics, namely: speaker
segmentation and speaker clustering. Speaker segmentation aims at finding speaker …

A review on speaker diarization systems and approaches

MH Moattar, MM Homayounpour - Speech Communication, 2012 - Elsevier
Speaker indexing or diarization is an important task in audio processing and retrieval.
Speaker diarization is the process of labeling a speech signal with labels corresponding to …

ALIZE/SpkDet: a state-of-the-art open source software for speaker recognition

JF Bonastre, N Scheffer, D Matrouf… - Odyssey 2008: The …, 2008 - hal.science
This paper presents the ALIZE/SpkDet open source software packages for text independent
speaker recognition. This software is based on the well-known UBM/GMM approach. It …

An Improved Speech Segmentation and Clustering Algorithm Based on SOM and K‐Means

N Jiang, T Liu - Mathematical Problems in Engineering, 2020 - Wiley Online Library
This paper studies the segmentation and clustering of speaker speech. In order to improve
the accuracy of speech endpoint detection, the traditional double‐threshold short‐time …