A survey of embodied ai: From simulators to research tasks

J Duan, S Yu, HL Tan, H Zhu… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
There has been an emerging paradigm shift from the era of “internet AI” to “embodied AI,”
where AI algorithms and agents no longer learn from datasets of images, videos or text …

Aligning cyber space with physical world: A comprehensive survey on embodied ai

Y Liu, W Chen, Y Bai, X Liang, G Li, W Gao… - arxiv preprint arxiv …, 2024 - arxiv.org
Embodied Artificial Intelligence (Embodied AI) is crucial for achieving Artificial General
Intelligence (AGI) and serves as a foundation for various applications that bridge cyberspace …

Autogen: Enabling next-gen llm applications via multi-agent conversation framework

Q Wu, G Bansal, J Zhang, Y Wu, S Zhang, E Zhu… - arxiv preprint arxiv …, 2023 - arxiv.org
This technical report presents AutoGen, a new framework that enables development of LLM
applications using multiple agents that can converse with each other to solve tasks. AutoGen …

Agentbench: Evaluating llms as agents

X Liu, H Yu, H Zhang, Y Xu, X Lei, H Lai, Y Gu… - arxiv preprint arxiv …, 2023 - arxiv.org
Large Language Models (LLMs) are becoming increasingly smart and autonomous,
targeting real-world pragmatic missions beyond traditional NLP tasks. As a result, there has …

Pointodyssey: A large-scale synthetic dataset for long-term point tracking

Y Zheng, AW Harley, B Shen… - Proceedings of the …, 2023 - openaccess.thecvf.com
We introduce PointOdyssey, a large-scale synthetic dataset, and data generation framework,
for the training and evaluation of long-term fine-grained tracking algorithms. Our goal is to …

On the opportunities and risks of foundation models

R Bommasani, DA Hudson, E Adeli, R Altman… - arxiv preprint arxiv …, 2021 - arxiv.org
AI is undergoing a paradigm shift with the rise of models (eg, BERT, DALL-E, GPT-3) that are
trained on broad data at scale and are adaptable to a wide range of downstream tasks. We …

A survey of zero-shot generalisation in deep reinforcement learning

R Kirk, A Zhang, E Grefenstette, T Rocktäschel - Journal of Artificial …, 2023 - jair.org
The study of zero-shot generalisation (ZSG) in deep Reinforcement Learning (RL) aims to
produce RL algorithms whose policies generalise well to novel unseen situations at …

Habitat-matterport 3d semantics dataset

K Yadav, R Ramrakhya… - Proceedings of the …, 2023 - openaccess.thecvf.com
Abstract We present the Habitat-Matterport 3D Semantics (HM3DSEM) dataset. HM3DSEM
is the largest dataset of 3D real-world spaces with densely annotated semantics that is …

Stochastic scene-aware motion prediction

M Hassan, D Ceylan, R Villegas… - Proceedings of the …, 2021 - openaccess.thecvf.com
A long-standing goal in computer vision is to capture, model, and realistically synthesize
human behavior. Specifically, by learning from data, our goal is to enable virtual humans to …

Housekeep: Tidying virtual households using commonsense reasoning

Y Kant, A Ramachandran, S Yenamandra… - … on Computer Vision, 2022 - Springer
We introduce Housekeep, a benchmark to evaluate commonsense reasoning in the home
for embodied AI. In Housekeep, an embodied agent must tidy a house by rearranging …