Turnitin
降AI改写
早检测系统
早降重系统
Turnitin-UK版
万方检测-期刊版
维普编辑部版
Grammarly检测
Paperpass检测
checkpass检测
PaperYY检测
Look before you leap: Unveiling the power of gpt-4v in robotic vision-language planning
In this study, we are interested in imbuing robots with the capability of physically-grounded
task planning. Recent advancements have shown that large language models (LLMs) …
task planning. Recent advancements have shown that large language models (LLMs) …
Manigaussian: Dynamic gaussian splatting for multi-task robotic manipulation
Performing language-conditioned robotic manipulation tasks in unstructured environments
is highly demanded for general intelligent robots. Conventional robotic manipulation …
is highly demanded for general intelligent robots. Conventional robotic manipulation …
Copa: General robotic manipulation through spatial constraints of parts with foundation models
Foundation models pre-trained on web-scale data are shown to encapsulate extensive
world knowledge beneficial for robotic manipulation in the form of task planning. However …
world knowledge beneficial for robotic manipulation in the form of task planning. However …
From 3D point‐cloud data to explainable geometric deep learning: State‐of‐the‐art and future challenges
A Saranti, B Pfeifer, C Gollob… - … : Data Mining and …, 2024 - Wiley Online Library
We present an exciting journey from 3D point‐cloud data (PCD) to the state of the art in
graph neural networks (GNNs) and their evolution with explainable artificial intelligence …
graph neural networks (GNNs) and their evolution with explainable artificial intelligence …
Sugar: Pre-training 3d visual representations for robotics
Learning generalizable visual representations from Internet data has yielded promising
results for robotics. Yet prevailing approaches focus on pre-training 2D representations …
results for robotics. Yet prevailing approaches focus on pre-training 2D representations …
Rise: 3d perception makes real-world robot imitation simple and effective
Precise robot manipulations require rich spatial information in imitation learning. Image-
based policies model object positions from fixed cameras, which are sensitive to camera …
based policies model object positions from fixed cameras, which are sensitive to camera …
SAM-E: leveraging visual foundation model with sequence imitation for embodied manipulation
Acquiring a multi-task imitation policy in 3D manipulation poses challenges in terms of
scene understanding and action prediction. Current methods employ both 3D representation …
scene understanding and action prediction. Current methods employ both 3D representation …
Cage: Causal attention enables data-efficient generalizable robotic manipulation
Generalization in robotic manipulation remains a critical challenge, particularly when scaling
to new environments with limited demonstrations. This paper introduces CAGE, a novel …
to new environments with limited demonstrations. This paper introduces CAGE, a novel …
Visual grounding for object-level generalization in reinforcement learning
Generalization is a pivotal challenge for agents following natural language instructions. To
approach this goal, we leverage a vision-language model (VLM) for visual grounding and …
approach this goal, we leverage a vision-language model (VLM) for visual grounding and …
Leveraging locality to boost sample efficiency in robotic manipulation
Given the high cost of collecting robotic data in the real world, sample efficiency is a
consistently compelling pursuit in robotics. In this paper, we introduce SGRv2, an imitation …
consistently compelling pursuit in robotics. In this paper, we introduce SGRv2, an imitation …