Speaker diarization: A review of recent research

X Anguera, S Bozonnet, N Evans… - … on audio, speech …, 2012 - ieeexplore.ieee.org
Speaker diarization is the task of determining “who spoke when?” in an audio or video
recording that contains an unknown amount of speech and also an unknown number of …

A review on speaker diarization systems and approaches

MH Moattar, MM Homayounpour - Speech Communication, 2012 - Elsevier
Speaker indexing or diarization is an important task in audio processing and retrieval.
Speaker diarization is the process of labeling a speech signal with labels corresponding to …

LIUM SpkDiarization: an open source toolkit for diarization

S Meignier, T Merlin - CMU SPUD Workshop, 2010 - hal.science
This paper presents an open-source diarization toolkit which is mostly dedicated to speaker
and developed by the LIUM. This toolkit includes hierarchical agglomerative clustering …

An open-source state-of-the-art toolbox for broadcast news diarization

M Rouvier, G Dupuy, P Gay, E Khoury, T Merlin… - Interspeech, 2013 - hal.science
This paper presents the LIUM open-source speaker diarization toolbox, mostly dedicated to
broadcast news. This tool includes both Hierarchical Agglomerative Clustering using well …

An information theoretic approach to speaker diarization of meeting data

D Vijayasenan, F Valente… - IEEE transactions on …, 2009 - ieeexplore.ieee.org
A speaker diarization system based on an information theoretic framework is described. The
problem is formulated according to the information bottleneck (IB) principle. Unlike other …

Acoustic classification and segmentation using modified spectral roll-off and variance-based features

M Kos, Z Kačič, D Vlaj - Digital Signal Processing, 2013 - Elsevier
This paper presents novel features and an architecture for an automatic on-line acoustic
classification and segmentation system. The system includes speech/non-speech …

Prosodic and other long-term features for speaker diarization

G Friedland, O Vinyals, Y Huang… - IEEE Transactions on …, 2009 - ieeexplore.ieee.org
Speaker diarization is defined as the task of determining ldquowho spoke whenrdquo given
an audio track and no other prior knowledge of any kind. The following article shows how a …

Hybrid speech and text analysis methods for speaker change detection

OH Anidjar, I Lapidot, C Hajaj, A Dvir… - IEEE/ACM Transactions …, 2021 - ieeexplore.ieee.org
Speaker Change Detection (SCD) is the task of segmenting an input audio-recording
according to speaker interchanges. Nowadays, many applications, such as Speaker …

Multi-modal speaker diarization of real-world meetings using compressed-domain video features

G Friedland, H Hung, C Yeo - 2009 IEEE International …, 2009 - ieeexplore.ieee.org
Speaker diarization is originally defined as the task of determining ldquowho spoke
whenrdquo given an audio track and no other prior knowledge of any kind. The following …

The use of long-term features for GMM-and i-vector-based speaker diarization systems

AW Zewoudie, J Luque, J Hernando - EURASIP Journal on Audio, Speech …, 2018 - Springer
Several factors contribute to the performance of speaker diarization systems. For instance,
the appropriate selection of speech features is one of the key aspects that affect speaker …