A review of blind source separation methods: two converging routes to ILRMA originating from ICA and NMF

H Sawada, N Ono, H Kameoka, D Kitamura… - … Transactions on Signal …, 2019 - cambridge.org
This paper describes several important methods for the blind source separation of audio
signals in an integrated manner. Two historically developed routes are featured. One started …

[HTML][HTML] A survey of sound source localization with deep learning methods

PA Grumiaux, S Kitić, L Girin, A Guérin - The Journal of the Acoustical …, 2022 - pubs.aip.org
This article is a survey of deep learning methods for single and multiple sound source
localization, with a focus on sound source localization in indoor environments, where …

[HTML][HTML] Machine learning in acoustics: Theory and applications

MJ Bianco, P Gerstoft, J Traer, E Ozanich… - The Journal of the …, 2019 - pubs.aip.org
Acoustic data provide scientific and engineering insights in fields ranging from biology and
communications to ocean and Earth science. We survey the recent advances and …

Past review, current progress, and challenges ahead on the cocktail party problem

Y Qian, C Weng, X Chang, S Wang, D Yu - Frontiers of Information …, 2018 - Springer
The cocktail party problem, ie, tracing and recognizing the speech of a specific speaker
when multiple speakers talk simultaneously, is one of the critical problems yet to be solved …

A consolidated perspective on multimicrophone speech enhancement and source separation

S Gannot, E Vincent… - … /ACM Transactions on …, 2017 - ieeexplore.ieee.org
Speech enhancement and separation are core problems in audio signal processing, with
commercial applications in devices as diverse as mobile phones, conference call systems …

An analysis of environment, microphone and data simulation mismatches in robust speech recognition

E Vincent, S Watanabe, AA Nugraha, J Barker… - Computer Speech & …, 2017 - Elsevier
Speech enhancement and automatic speech recognition (ASR) are most often evaluated in
matched (or multi-condition) settings where the acoustic conditions of the training data …

Multi-channel deep clustering: Discriminative spectral and spatial embeddings for speaker-independent speech separation

ZQ Wang, J Le Roux, JR Hershey - 2018 IEEE International …, 2018 - ieeexplore.ieee.org
The recently-proposed deep clustering algorithm represents a fundamental advance
towards solving the cocktail party problem in the single-channel case. When multiple …

SpatialNet: Extensively learning spatial information for multichannel joint speech separation, denoising and dereverberation

C Quan, X Li - IEEE/ACM Transactions on Audio, Speech, and …, 2024 - ieeexplore.ieee.org
This work proposes a neural network to extensively exploit spatial information for
multichannel joint speech separation, denoising and dereverberation, named SpatialNet. In …

The LOCATA challenge: Acoustic source localization and tracking

C Evers, HW Löllmann, H Mellmann… - … on Audio, Speech …, 2020 - ieeexplore.ieee.org
The ability to localize and track acoustic events is a fundamental prerequisite for equip**
machines with the ability to be aware of and engage with humans in their surrounding …

Memoryless techniques and wireless technologies for indoor localization with the internet of things

S Sadowski, P Spachos… - IEEE Internet of Things …, 2020 - ieeexplore.ieee.org
In recent years, the Internet of Things (IoT) has grown to include the tracking of devices
through the use of indoor positioning systems (IPSs) and location-based services (LBSs) …