Turnitin
降AI改写
早检测系统
早降重系统
Turnitin-UK版
万方检测-期刊版
维普编辑部版
Grammarly检测
Paperpass检测
checkpass检测
PaperYY检测
Look before you leap: Unveiling the power of gpt-4v in robotic vision-language planning
In this study, we are interested in imbuing robots with the capability of physically-grounded
task planning. Recent advancements have shown that large language models (LLMs) …
task planning. Recent advancements have shown that large language models (LLMs) …
Bridgedata v2: A dataset for robot learning at scale
We introduce BridgeData V2, a large and diverse dataset of robotic manipulation behaviors
designed to facilitate research in scalable robot learning. BridgeData V2 contains 53,896 …
designed to facilitate research in scalable robot learning. BridgeData V2 contains 53,896 …
Vip: Towards universal visual reward and representation via value-implicit pre-training
Reward and representation learning are two long-standing challenges for learning an
expanding set of robot manipulation skills from sensory observations. Given the inherent …
expanding set of robot manipulation skills from sensory observations. Given the inherent …
Td-mpc2: Scalable, robust world models for continuous control
TD-MPC is a model-based reinforcement learning (RL) algorithm that performs local
trajectory optimization in the latent space of a learned implicit (decoder-free) world model. In …
trajectory optimization in the latent space of a learned implicit (decoder-free) world model. In …
Goal-conditioned imitation learning using score-based diffusion policies
We propose a new policy representation based on score-based diffusion models (SDMs).
We apply our new policy representation in the domain of Goal-Conditioned Imitation …
We apply our new policy representation in the domain of Goal-Conditioned Imitation …
Hiql: Offline goal-conditioned rl with latent states as actions
Unsupervised pre-training has recently become the bedrock for computer vision and natural
language processing. In reinforcement learning (RL), goal-conditioned RL can potentially …
language processing. In reinforcement learning (RL), goal-conditioned RL can potentially …
Factorized contrastive learning: Going beyond multi-view redundancy
In a wide range of multimodal tasks, contrastive learning has become a particularly
appealing approach since it can successfully learn representations from abundant …
appealing approach since it can successfully learn representations from abundant …
: Temporal Latent Action-Driven Contrastive Loss for Visual Reinforcement Learning
Despite recent progress in reinforcement learning (RL) from raw pixel data, sample
inefficiency continues to present a substantial obstacle. Prior works have attempted to …
inefficiency continues to present a substantial obstacle. Prior works have attempted to …
Inference via interpolation: Contrastive representations provably enable planning and inference
Given time series data, how can we answer questions like what will happen in the
future?''and how did we get here?''These sorts of probabilistic inference questions are …
future?''and how did we get here?''These sorts of probabilistic inference questions are …
Reinforcement learning from passive data via latent intentions
Passive observational data, such as human videos, is abundant and rich in information, yet
remains largely untapped by current RL methods. Perhaps surprisingly, we show that …
remains largely untapped by current RL methods. Perhaps surprisingly, we show that …