Configurable doa estimation using incremental learning

Y **ao, RK Das - arxiv preprint arxiv:2407.03661, 2024 - arxiv.org
This study introduces a progressive neural network (PNN) model for direction of arrival
(DOA) estimation, DOA-PNN, addressing the challenge due to catastrophic forgetting in …

IPDnet: A universal direct-path IPD estimation network for sound source localization

Y Wang, B Yang, X Li - IEEE/ACM Transactions on Audio …, 2024 - ieeexplore.ieee.org
Extracting direct-path spatial feature is crucial for sound source localization in adverse
acoustic environments. This paper proposes IPDnet, a neural network that estimates direct …

Eliminating quantization errors in classification-based sound source localization

L Feng, XL Zhang, X Li - Neural Networks, 2025 - Elsevier
Abstract Sound Source Localization (SSL) involves estimating the Direction of Arrival (DOA)
of sound sources. Since the DOA estimation output space is continuous, regression might be …

Learning Multi-Dimensional Speaker Localization: Axis Partitioning, Unbiased Label Distribution, and Data Augmentation

L Feng, Y Gong, Z Liu, XL Zhang… - IEEE/ACM Transactions …, 2024 - ieeexplore.ieee.org
Multi-dimensional speaker localization (SL) aims to estimate the two-or three-dimensional
locations of speakers. A recent advancement in multi-dimensional SL is the end-to-end deep …

Mask-Weighted Spatial Likelihood Coding for Speaker-Independent Joint Localization and Mask Estimation

J Kienegger, A Mannanova, T Gerkmann - arxiv preprint arxiv:2410.19595, 2024 - arxiv.org
Due to their robustness and flexibility, neural-driven beamformers are a popular choice for
speech separation in challenging environments with a varying amount of simultaneous …

GLFER-Net: a polyphonic sound source localization and detection network based on global-local feature extraction and recalibration

M Ma, Y Hu, L He, H Huang - EURASIP Journal on Audio, Speech, and …, 2024 - Springer
Polyphonic sound source localization and detection (SSLD) task aims to recognize the
categories of sound events, identify their onset and offset times, and detect their …

Robust Spatial Filtering Network for Separating Speech in the Direction of Interest

D Liu, D Li, C Ma, X Jia - 2023 8th International Conference on …, 2023 - ieeexplore.ieee.org
In recent years, target speech separation has drawn a lot of attention with the development
of deep-learning methods. The target speech from the specific direction-of-interest (DOI) can …

[PDF][PDF] Deeply supervised curriculum learning for deep neural network-based sound source localization

MS Baek, JY Yang, JH Chang - … of the Annual Conference of the …, 2023 - isca-archive.org
Deep neural network (DNN) has made impressive progress in sound source localization
(SSL) tasks with the hard n-hot labels that represent specific directions-of-arrivals (DOAs) …