Egoexolearn: A dataset for bridging asynchronous ego-and exo-centric view of procedural activities in real world

Y Huang, G Chen, J Xu, M Zhang… - Proceedings of the …, 2024 - openaccess.thecvf.com
Being able to map the activities of others into one's own point of view is one fundamental
human skill even from a very early age. Taking a step toward understanding this human …

On bringing robots home

NMM Shafiullah, A Rai, H Etukuru, Y Liu, I Misra… - arxiv preprint arxiv …, 2023 - arxiv.org
Throughout history, we have successfully integrated various machines into our homes.
Dishwashers, laundry machines, stand mixers, and robot vacuums are a few recent …

Aria digital twin: A new benchmark dataset for egocentric 3d machine perception

X Pan, N Charron, Y Yang, S Peters… - Proceedings of the …, 2023 - openaccess.thecvf.com
Abstract We introduce the Aria Digital Twin (ADT)-an egocentric dataset captured using Aria
glasses with extensive object, environment, and human level ground truth. This ADT release …

SceneScript: Reconstructing Scenes with an Autoregressive Structured Language Model

A Avetisyan, C **e, H Howard-Jenkins, TY Yang… - … on Computer Vision, 2024 - Springer
We introduce SceneScript, a method that directly produces full scene models as a sequence
of structured language commands using an autoregressive, token-based approach. Our …

EgoLifter: Open-World 3D Segmentation for Egocentric Perception

Q Gu, Z Lv, D Frost, S Green, J Straub… - European Conference on …, 2024 - Springer
In this paper we present EgoLifter, a novel system that can automatically segment scenes
captured from egocentric sensors into a complete decomposition of individual 3D objects …

Put myself in your shoes: Lifting the egocentric perspective from exocentric videos

M Luo, Z Xue, A Dimakis, K Grauman - European Conference on Computer …, 2024 - Springer
We investigate exocentric-to-egocentric cross-view translation, which aims to generate a first-
person (egocentric) view of an actor based on a video recording that captures the actor from …

General place recognition survey: Towards real-world autonomy

P Yin, J Jiao, S Zhao, L Xu, G Huang, H Choset… - arxiv preprint arxiv …, 2024 - arxiv.org
In the realm of robotics, the quest for achieving real-world autonomy, capable of executing
large-scale and long-term operations, has positioned place recognition (PR) as a …

Real-time simulated avatar from head-mounted sensors

Z Luo, J Cao, R Khirodkar, A Winkler… - Proceedings of the …, 2024 - openaccess.thecvf.com
We present SimXR a method for controlling a simulated avatar from information (headset
pose and cameras) obtained from AR/VR headsets. Due to the challenging viewpoint of …

PANDALens: Towards AI-Assisted In-Context Writing on OHMD During Travels

R Cai, N Janaka, Y Chen, L Wang, S Zhao… - Proceedings of the 2024 …, 2024 - dl.acm.org
While effective for recording and sharing experiences, traditional in-context writing tools are
relatively passive and unintelligent, serving more like instruments rather than companions …

Egogaussian: Dynamic scene understanding from egocentric video with 3d gaussian splatting

D Zhang, G Li, J Li, MÃĢ Bressieux, O Hilliges… - arxiv preprint arxiv …, 2024 - arxiv.org
Human activities are inherently complex, often involving numerous object interactions. To
better understand these activities, it is crucial to model their interactions with the …