Physics-informed neural network for volumetric sound field reconstruction of speech signals

M Olivieri, X Karakonstantis, M Pezzoli… - EURASIP Journal on …, 2024 - Springer
Recent developments in acoustic signal processing have seen the integration of deep
learning methodologies, alongside the continued prominence of classical wave expansion …

TaBE: Decoupling spatial and spectral processing with taylor's unfolding method in the beamspace domain for multi-channel speech enhancement

A Li, G Yu, Z Xu, C Fan, X Li, C Zheng - Information Fusion, 2024 - Elsevier
In recent years, significant advancements have been made in neural beamforming,
leveraging spectral and spatial cues to enhance their performance in multi-channel speech …

Deep Kronecker Product Beamforming for Large-scale Microphone Arrays

W Meng, X Li, A Li, X Luo, S Yan, X Li… - … /ACM Transactions on …, 2024 - ieeexplore.ieee.org
Although deep learning based beamformers have achieved promising performance using
small microphone arrays, they suffer from performance degradation in very challenging …

Homula-rir: A room impulse response dataset for teleconferencing and spatial audio applications acquired through higher-order microphones and uniform linear …

F Miotello, P Ostan, M Pezzoli… - … , Speech, and Signal …, 2024 - ieeexplore.ieee.org
In this paper, we present HOMULA-RIR, a dataset of room impulse responses (RIRs)
acquired using both higher-order microphones (HOMs) and a uniform linear array (ULA), in …

All neural kronecker product beamforming for speech extraction with large-scale microphone arrays

W Meng, X Li, A Li, J Li, X Li… - ICASSP 2024-2024 IEEE …, 2024 - ieeexplore.ieee.org
Existing frame-wise neural beamformers for speech extraction can obtain promising
performance in relatively high signal-to-noise ratio (SNR) scenarios using small microphone …

Are you Really Alone? Detecting the use of Speech Separation Techniques on Audio Recordings

D Salvi, M Pezzoli, S Mandelli… - … and Security (WIFS), 2023 - ieeexplore.ieee.org
The pervasive influence of digital media has brought about new challenges in verifying the
authenticity and integrity of audio recordings. The ease of editing and altering audio has …

Inference-Adaptive Steering of Neural Networks for Real-Time Area-Based Sound Source Separation

M Strauss, W Mack, ML Valero… - IEEE Signal Processing …, 2025 - ieeexplore.ieee.org
We propose a novel adaptive steering technique that changes the target area of a spatial-
aware multi-microphone sound source separation algorithm during inference without the …

Complex-Valued Physics-Informed Neural Network for Near-Field Acoustic Holography

X Luan, M Olivieri, M Pezzoli… - 2024 32nd European …, 2024 - ieeexplore.ieee.org
We present a novel approach to Near-field Acoustic Holography (NAH) with the introduction
of the Complex-Valued Kirchhoff-Helmholtz Convolutional Neural Network (CV-KHCNN) …

Tabe: Decoupling Spatial and Spectral Processing with Taylor's Unfolding Method for Multi-Channel Speech Enhancement

A Li, G Yu, Z Xu, C Fan, X Li, C Zheng - Available at SSRN 4500736 - papers.ssrn.com
In recent years, significant advancements have been made in neural beamforming,
leveraging spectral and spatial cues to enhance their performance in multi-channel speech …

神经网络辅助估计先验语音存在概率的多通道降噪方法

雷菁, 王劲夫, 杨飞然, 杨军 - 信号处理, 2024 - signal.ejournal.org.cn
噪声功率谱密度矩阵的估计在波束形成中非常关键. 基于多通道语音存在概率(Multichannel
Speech Presence Probability, MCSPP) 估计噪声功率谱密度矩阵的方法 …