Ai2-thor: An interactive 3d environment for visual ai
E Kolve, R Mottaghi, W Han, E VanderBilt… - ar** models and algorithms enabling embodied agents to navigate and interact …
Habitat 3.0: A co-habitat for humans, avatars and robots
We present Habitat 3.0: a simulation platform for studying collaborative human-robot tasks in
home environments. Habitat 3.0 offers contributions across three dimensions:(1) Accurate …
home environments. Habitat 3.0 offers contributions across three dimensions:(1) Accurate …
Retrospectives on the embodied ai workshop
We present a retrospective on the state of Embodied AI research. Our analysis focuses on
13 challenges presented at the Embodied AI Workshop at CVPR. These challenges are …
13 challenges presented at the Embodied AI Workshop at CVPR. These challenges are …
Auxiliary tasks and exploration enable objectgoal navigation
Abstract ObjectGoal Navigation (ObjectNav) is an embodied task wherein agents are to
navigate to an object instance in an unseen environment. Prior works have shown that end …
navigate to an object instance in an unseen environment. Prior works have shown that end …
Tidee: Tidying up novel rooms using visuo-semantic commonsense priors
We introduce TIDEE, an embodied agent that tidies up a disordered scene based on
learned commonsense object placement and room arrangement priors. TIDEE explores a …
learned commonsense object placement and room arrangement priors. TIDEE explores a …
Embodied Intelligence: A Synergy of Morphology, Action, Perception and Learning
Embodied intelligence emphasizes that the intelligence is affected by the tight coupling of
brain, body and environment. It is continuously and dynamically generated through the …
brain, body and environment. It is continuously and dynamically generated through the …
Selective visual representations improve convergence and generalization for embodied ai
Embodied AI models often employ off the shelf vision backbones like CLIP to encode their
visual observations. Although such general purpose representations encode rich syntactic …
visual observations. Although such general purpose representations encode rich syntactic …
Interpretation of emergent communication in heterogeneous collaborative embodied agents
Communication between embodied AI agents has received increasing attention in recent
years. Despite its use, it is still unclear whether the learned communication is interpretable …
years. Despite its use, it is still unclear whether the learned communication is interpretable …