Augmented/mixed reality audio for hearables: Sensing, control, and rendering

R Gupta, J He, R Ranjan, WS Gan… - IEEE Signal …, 2022 - ieeexplore.ieee.org
Augmented or mixed reality (AR/MR) is emerging as one of the key technologies in the
future of computing. Audio cues are critical for maintaining a high degree of realism, social …

DeepMMSE: A deep learning approach to MMSE-based noise power spectral density estimation

Q Zhang, A Nicolson, M Wang… - … /ACM Transactions on …, 2020 - ieeexplore.ieee.org
An accurate noise power spectral density (PSD) tracker is an indispensable component of a
single-channel speech enhancement system. Bayesian-motivated minimum mean-square …

Insights into deep non-linear filters for improved multi-channel speech enhancement

K Tesch, T Gerkmann - IEEE/ACM Transactions on Audio …, 2022 - ieeexplore.ieee.org
The key advantage of using multiple microphones for speech enhancement is that spatial
filtering can be used to complement the tempo-spectral processing. In a traditional setting …

Speech recognition with a hearing-aid processing scheme combining beamforming with mask-informed speech enhancement

T Green, G Hilkhuysen, M Huckvale… - Trends in …, 2022 - journals.sagepub.com
A signal processing approach combining beamforming with mask-informed speech
enhancement was assessed by measuring sentence recognition in listeners with mild-to …

[HTML][HTML] Brain-informed speech separation (BISS) for enhancement of target speaker in multitalker speech perception

E Ceolini, J Hjortkjær, DDE Wong, J O'Sullivan… - NeuroImage, 2020 - Elsevier
Hearing-impaired people often struggle to follow the speech stream of an individual talker in
noisy environments. Recent studies show that the brain tracks attended speech and that the …

Advances in phase-aware signal processing in speech communication

P Mowlaee, R Saeidi, Y Stylianou - Speech communication, 2016 - Elsevier
During the past three decades, the issue of processing spectral phase has been largely
neglected in speech applications. There is no doubt that the interest of speech processing …

Non-intrusive speech intelligibility prediction for hearing-impaired users using intermediate asr features and human memory models

R Mogridge, G Close, R Sutherland… - ICASSP 2024-2024 …, 2024 - ieeexplore.ieee.org
Neural networks have been successfully used for non-intrusive speech intelligibility
prediction. Recently, the use of feature representations sourced from intermediate layers of …

Cognitive-driven binaural beamforming using EEG-based auditory attention decoding

A Aroudi, S Doclo - IEEE/ACM Transactions on Audio, Speech …, 2020 - ieeexplore.ieee.org
Identifying the target speaker in hearing aid applications is an essential ingredient to
improve speech intelligibility. Recently, a least-squares-based auditory attention decoding …

Spatially selective deep non-linear filters for speaker extraction

K Tesch, T Gerkmann - ICASSP 2023-2023 IEEE International …, 2023 - ieeexplore.ieee.org
In a scenario with multiple persons talking simultaneously, the spatial characteristics of the
signals are the most distinct feature for extracting the target signal. In this work, we develop a …

Non-intrusive speech quality prediction using modulation energies and LSTM-network

B Cauchi, K Siedenburg, JF Santos… - … ACM transactions on …, 2019 - ieeexplore.ieee.org
Many signal processing algorithms have been proposed to improve the quality of speech
recorded in the presence of noise and reverberation. Perceptual measures, ie, listening …