Spatial audio signal processing for binaural reproduction of recorded acoustic scenes–review and challenges

B Rafaely, V Tourbabin, E Habets… - Acta …, 2022‏ - acta-acustica.edpsciences.org
Spatial audio has been studied for several decades, but has seen much renewed interest
recently due to advances in both software and hardware for capture and playback, and the …

A metaverse: Taxonomy, components, applications, and open challenges

SM Park, YG Kim - IEEE access, 2022‏ - ieeexplore.ieee.org
Unlike previous studies on the Metaverse based on Second Life, the current Metaverse is
based on the social value of Generation Z that online and offline selves are not different …

Rendering spatial sound for interoperable experiences in the audio metaverse

JM Jot, R Audfray, M Hertensteiner… - 2021 Immersive and …, 2021‏ - ieeexplore.ieee.org
Interactive audio spatialization technology previously developed for video game authoring
and rendering has evolved into an essential component of platforms enabling shared …

Assessing HRTF preprocessing methods for Ambisonics rendering through perceptual models

I Engel, DFM Goodman, L Picinali - Acta Acustica, 2022‏ - acta-acustica.edpsciences.org
Binaural rendering of Ambisonics signals is a common way to reproduce spatial audio
content. Processing Ambisonics signals at low spatial orders is desirable in order to reduce …

Cross-modal generative model for visual-guided binaural stereo generation

Z Li, B Zhao, Y Yuan - Knowledge-Based Systems, 2024‏ - Elsevier
Binaural stereo audio is recorded by imitating the way the human ear receives sound, which
provides people with an immersive listening experience. Existing approaches leverage …

SAQAM: Spatial audio quality assessment metric

P Manocha, A Kumar, B Xu, A Menon, ID Gebru… - arxiv preprint arxiv …, 2022‏ - arxiv.org
Audio quality assessment is critical for assessing the perceptual realism of sounds.
However, the time and expense of obtaining''gold standard''human judgments limit the …

Spatial upsampling of sparse spherical microphone array signals

T Lübeck, JM Arend… - IEEE/ACM Transactions …, 2023‏ - ieeexplore.ieee.org
We present a method for spatial upsampling of signals captured with spherical microphone
arrays with a limited number of microphones. The upsampling is performed by adding virtual …

Both Ears Wide Open: Towards Language-Driven Spatial Audio Generation

P Sun, S Cheng, X Li, Z Ye, H Liu, H Zhang… - arxiv preprint arxiv …, 2024‏ - arxiv.org
Recently, diffusion models have achieved great success in mono-channel audio generation.
However, when it comes to stereo audio generation, the soundscapes often have a complex …

End-to-end paired ambisonic-binaural audio rendering

Y Zhu, Q Kong, J Shi, S Liu, X Ye… - IEEE/CAA Journal of …, 2024‏ - ieeexplore.ieee.org
Binaural rendering is of great interest to virtual reality and immersive media. Although
humans can naturally use their two ears to perceive the spatial information contained in …

Dplm: A deep perceptual spatial-audio localization metric

P Manocha, A Kumar, B Xu, A Menon… - … IEEE Workshop on …, 2021‏ - ieeexplore.ieee.org
Subjective evaluations are critical for assessing the perceptual realism of sounds in audio-
synthesis driven technologies like augmented and virtual reality. However, they are …