Speaker diarization: A review of recent research

X Anguera, S Bozonnet, N Evans… - … on audio, speech …, 2012 - ieeexplore.ieee.org
Speaker diarization is the task of determining “who spoke when?” in an audio or video
recording that contains an unknown amount of speech and also an unknown number of …

An overview of automatic speaker diarization systems

SE Tranter, DA Reynolds - IEEE Transactions on audio, speech …, 2006 - ieeexplore.ieee.org
Audio diarization is the process of annotating an input audio channel with information that
attributes (possibly overlap**) temporal regions of signal energy to their specific sources …

Acoustic beamforming for speaker diarization of meetings

X Anguera, C Wooters… - IEEE Transactions on …, 2007 - ieeexplore.ieee.org
When performing speaker diarization on recordings from meetings, multiple microphones of
different qualities are usually available and distributed around the meeting room. Although …

A review on speaker diarization systems and approaches

MH Moattar, MM Homayounpour - Speech Communication, 2012 - Elsevier
Speaker indexing or diarization is an important task in audio processing and retrieval.
Speaker diarization is the process of labeling a speech signal with labels corresponding to …

[PDF][PDF] A spectral clustering approach to speaker diarization.

H Ning, M Liu, H Tang, TS Huang - Interspeech, 2006 - Citeseer
In this paper, we present a spectral clustering approach to explore the possibility of
discovering structure from audio data. To apply the Ng-Jordan-Weiss (NJW) spectral …

[KNIHA][B] Robust speaker diarization for meetings

X Anguera Miró - 2006 - upcommons.upc.edu
This thesis shows research performed into the topic of speaker diarization for meeting
rooms. It looks into the algorithms and the implementation of an offline speaker …

Spoken content retrieval: A survey of techniques and technologies

M Larson, GJF Jones - Foundations and Trends® in …, 2012 - nowpublishers.com
Speech media, that is, digital audio and video containing spoken content, has blossomed in
recent years. Large collections are accruing on the Internet as well as in private and …

Prosodic and other long-term features for speaker diarization

G Friedland, O Vinyals, Y Huang… - IEEE Transactions on …, 2009 - ieeexplore.ieee.org
Speaker diarization is defined as the task of determining ldquowho spoke whenrdquo given
an audio track and no other prior knowledge of any kind. The following article shows how a …

Estimating dominance in multi-party meetings using speaker diarization

H Hung, Y Huang, G Friedland… - IEEE Transactions on …, 2010 - ieeexplore.ieee.org
With the increase in cheap commercially available sensors, recording meetings is becoming
an increasingly practical option. With this trend comes the need to summarize the recorded …

Using audio and video features to classify the most dominant person in a group meeting

H Hung, D Jayagopi, C Yeo, G Friedland, S Ba… - Proceedings of the 15th …, 2007 - dl.acm.org
The automated extraction of semantically meaningful information from multi-modal data is
becoming increasingly necessary due to the escalation of captured data for archival. A novel …