Who said that?: Audio-visual speaker diarisation of real-world meetings

JS Chung, BJ Lee, I Han - arxiv preprint arxiv:1906.10042, 2019 - arxiv.org
The goal of this work is to determine'who spoke when'in real-world meetings. The method
takes surround-view video and single or multi-channel audio as inputs, and generates …

Multimodal multi-channel on-line speaker diarization using sensor fusion through SVM

VP Minotto, CR Jung, B Lee - IEEE Transactions on Multimedia, 2015 - ieeexplore.ieee.org
Speaker diarization (SD) is the process of assigning speech segments of an audio stream to
its corresponding speakers, thus comprising the problem of voice activity detection (VAD) …

Online diarization of streaming audio-visual data for smart environments

J Schmalenstroeer… - IEEE Journal of Selected …, 2010 - ieeexplore.ieee.org
For an environment to be perceived as being smart, contextual information has to be
gathered to adapt the system's behavior and its interface towards the user. Being a rich …

[PDF][PDF] A hybrid approach to online speaker diarization.

C Vaquero, O Vinyals, G Friedland - InterSpeech, 2010 - researchgate.net
This article presents a low-latency speaker diarization system (“who is speaking now?”)
based on a hybrid approach that combines a traditional offline speaker diarization system …

Audio-visual Speaker Diarization: Improved Voice Activity Detection with CNN based Feature Extraction

K Fanaras, A Tragoudaras… - 2022 IEEE 65th …, 2022 - ieeexplore.ieee.org
Speaker diarization is a task to identify “who spoke when”. Moreover, nowadays, speakers'
audio clips usually are accompanied by visual information. Thus, in the latest works, speaker …

Multimodal speaker diarization for meetings using volume-evaluated SRP-PHAT and video analysis

P Cabañas-Molero, M Lucena, JM Fuertes… - Multimedia Tools and …, 2018 - Springer
Speaker diarization is traditionally defined as the problem of determining “who speaks
when” given an audio or video stream. This is an important task in many applications for …

Real-time implementation of speaker diarization system on raspberry PI3 using TLBO clustering algorithm

K Dabbabi, S Hajji, A Cherif - Circuits, Systems, and Signal Processing, 2020 - Springer
In the recent years, extensive researches have been performed on various possible
implementations of speaker diarization systems. These systems require efficient clustering …

Online speaker diarization for multimedia data retrieval on mobile devices

KM Park, JS Park, JH Bae, YH Oh - International Journal of Pattern …, 2012 - World Scientific
Speaker diarization detects speaker change points in spoken data and organizes speaker
clusters so that each cluster contains one speaker's segments. This study aims to develop …

[PDF][PDF] Akustische Szenenanalyse für die ambiente Kommunikation im vernetzten Haus

J Schmalenströer - 2010 - researchgate.net
“Ambient intelligence refers to the presence of a digital environment that is sensitive,
adaptive, and responsive to the presence of people. Within a home environment, ambient …

[PDF][PDF] Geometriekalibrierung akustischer Sensornetze

IR Häb-Umbach, P Schreier - core.ac.uk
Kurzfassung Die Aufnahme akustischer Signale durch mehrere Mikrofone bildet die
Grundlage für viele moderne Signalverarbeitungsalgorithmen. Mehrkanalige Aufnahmen …