Localizing speakers in multiple rooms by using deep neural networks

F Vesperini, P Vecchiotti, E Principi, S Squartini… - Computer Speech & …, 2018 - Elsevier
In the field of human speech capturing systems, a fundamental role is played by the source
localization algorithms. In this paper a Speaker Localization algorithm (SLOC) based on …

A deep neural network approach for voice activity detection in multi-room domestic scenarios

G Ferroni, R Bonfigli, E Principi… - … Joint Conference on …, 2015 - ieeexplore.ieee.org
This paper presents a Voice Activity Detector (VAD) for multi-room domestic scenarios. A
multi-room VAD (mVAD) simultaneously detects the time boundaries of a speech segment …

[PDF][PDF] Optimizing Voice Activity Detection for Noisy Conditions.

R Lin, C Costello, C Jankowski, V Mruthyunjaya - INTERSPEECH, 2019 - isca-archive.org
In this work, we focus our attention on how to improve Voice Activity Detection (VAD) in noisy
conditions. We propose a Convolutional Neural Network (CNN) based model, as well as a …

[PDF][PDF] Повышение робастности систем автоматического распознавания речи методами обработки сигналов

ОН Ладошко - 2016 - core.ac.uk
Стремительное развитие современных достижений в области цифровой обработки
сигналов способствует широкому распространению аппаратнопрограммных систем …