Aligning cyber space with physical world: A comprehensive survey on embodied ai
Embodied Artificial Intelligence (Embodied AI) is crucial for achieving Artificial General
Intelligence (AGI) and serves as a foundation for various applications that bridge cyberspace …
Intelligence (AGI) and serves as a foundation for various applications that bridge cyberspace …
A survey on integration of large language models with intelligent robots
In recent years, the integration of large language models (LLMs) has revolutionized the field
of robotics, enabling robots to communicate, understand, and reason with human-like …
of robotics, enabling robots to communicate, understand, and reason with human-like …
View selection for 3d captioning via diffusion ranking
Scalable annotation approaches are crucial for constructing extensive 3D-text datasets,
facilitating a broader range of applications. However, existing methods sometimes lead to …
facilitating a broader range of applications. However, existing methods sometimes lead to …
Manipulate-anything: Automating real-world robots using vision-language models
Large-scale endeavors like and widespread community efforts such as Open-X-Embodiment
have contributed to growing the scale of robot demonstration data. However, there is still an …
have contributed to growing the scale of robot demonstration data. However, there is still an …
Towards efficient llm grounding for embodied multi-agent collaboration
Grounding the reasoning ability of large language models (LLMs) for embodied tasks is
challenging due to the complexity of the physical world. Especially, LLM planning for multi …
challenging due to the complexity of the physical world. Especially, LLM planning for multi …
Rl-vlm-f: Reinforcement learning from vision language foundation model feedback
Reward engineering has long been a challenge in Reinforcement Learning (RL) research,
as it often requires extensive human effort and iterative processes of trial-and-error to design …
as it often requires extensive human effort and iterative processes of trial-and-error to design …
Agentgen: Enhancing planning abilities for large language model based agent via environment and task generation
Large Language Model (LLM) based agents have garnered significant attention and are
becoming increasingly popular. Furthermore, planning ability is a crucial component of an …
becoming increasingly popular. Furthermore, planning ability is a crucial component of an …
Grutopia: Dream general robots in a city at scale
Recent works have been exploring the scaling laws in the field of Embodied AI. Given the
prohibitive costs of collecting real-world data, we believe the Simulation-to-Real (Sim2Real) …
prohibitive costs of collecting real-world data, we believe the Simulation-to-Real (Sim2Real) …
What foundation models can bring for robot learning in manipulation: A survey
The realization of universal robots is an ultimate goal of researchers. However, a key hurdle
in achieving this goal lies in the robots' ability to manipulate objects in their unstructured …
in achieving this goal lies in the robots' ability to manipulate objects in their unstructured …
Robot learning in the era of foundation models: A survey
The proliferation of Large Language Models (LLMs) has s fueled a shift in robot learning
from automation towards general embodied Artificial Intelligence (AI). Adopting foundation …
from automation towards general embodied Artificial Intelligence (AI). Adopting foundation …