An outlook into the future of egocentric vision

C Plizzari, G Goletto, A Furnari, S Bansal… - International Journal of …, 2024 - Springer
What will the future be? We wonder! In this survey, we explore the gap between current
research in egocentric vision and the ever-anticipated future, where wearable computing …

Learning spatial features from audio-visual correspondence in egocentric videos

S Majumder, Z Al-Halah… - Proceedings of the IEEE …, 2024 - openaccess.thecvf.com
We propose a self-supervised method for learning representations based on spatial audio-
visual correspondences in egocentric videos. Our method uses a masked auto-encoding …

The Audio-Visual Conversational Graph: From an Egocentric-Exocentric Perspective

W Jia, M Liu, H Jiang, I Ananthabhotla… - Proceedings of the …, 2024 - openaccess.thecvf.com
In recent years the thriving development of research related to egocentric videos has
provided a unique perspective for the study of conversational interactions where both visual …

The Audio-Visual BatVision Dataset for Research on Sight and Sound

A Brunetto, S Hornauer, XY Stella… - 2023 IEEE/RSJ …, 2023 - ieeexplore.ieee.org
Vision research showed remarkable success in understanding our world, propelled by
datasets of images and videos. Sensor data from radar, LiDAR and cameras supports …

PANO-ECHO: PANOramic depth prediction enhancement with ECHO features

X Liu, A Brunetto, S Hornauer… - 2024 IEEE Conference …, 2024 - ieeexplore.ieee.org
Panoramic depth estimation gains importance with more 360° images being widely
available. However, traditional mono-to-depth approaches, optimized for a limited field of …

[HTML][HTML] Efficient learning-based sound propagation for virtual and real-world audio processing applications

AJ Ratnarajah - 2024 - search.proquest.com
Sound propagation is the process by which sound energy travels through a medium, such
as air, to the surrounding environment as sound waves. The room impulse response (RIR) …

[PDF][PDF] Egocentric video understanding across modalities and domains

C Plizzari - 2024 - tesidottorato.depositolegale.it
With the growing popularity of wearable cameras, egocentric vision has become an
increasingly researched area. This perspective offers a direct view from the wearer's …

Novel View Acoustic Parameter Estimation

RF Perez, R Gao, G Mückl, SVA Gari, I Ananthabhotla - openreview.net
The task of Novel View Acoustic Synthesis (NVAS)--generating Room Impulse Responses
(RIRs) for unseen source and receiver positions in a scene--has recently gained traction …