Turnitin
降AI改写
早检测系统
早降重系统
Turnitin-UK版
万方检测-期刊版
维普编辑部版
Grammarly检测
Paperpass检测
checkpass检测
PaperYY检测
A survey on offline reinforcement learning: Taxonomy, review, and open problems
RF Prudencio, MROA Maximo… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
With the widespread adoption of deep learning, reinforcement learning (RL) has
experienced a dramatic increase in popularity, scaling to previously intractable problems …
experienced a dramatic increase in popularity, scaling to previously intractable problems …
Minigrid & miniworld: Modular & customizable reinforcement learning environments for goal-oriented tasks
We present the Minigrid and Miniworld libraries which provide a suite of goal-oriented 2D
and 3D environments. The libraries were explicitly created with a minimalistic design …
and 3D environments. The libraries were explicitly created with a minimalistic design …
Learning agile soccer skills for a bipedal robot with deep reinforcement learning
We investigated whether deep reinforcement learning (deep RL) is able to synthesize
sophisticated and safe movement skills for a low-cost, miniature humanoid robot that can be …
sophisticated and safe movement skills for a low-cost, miniature humanoid robot that can be …
A generalist agent
Inspired by progress in large-scale language modeling, we apply a similar approach
towards building a single generalist agent beyond the realm of text outputs. The agent …
towards building a single generalist agent beyond the realm of text outputs. The agent …
Roco: Dialectic multi-robot collaboration with large language models
We propose a novel approach to multi-robot collaboration that harnesses the power of pre-
trained large language models (LLMs) for both high-level communication and low-level path …
trained large language models (LLMs) for both high-level communication and low-level path …
The primacy bias in deep reinforcement learning
This work identifies a common flaw of deep reinforcement learning (RL) algorithms: a
tendency to rely on early interactions and ignore useful evidence encountered later …
tendency to rely on early interactions and ignore useful evidence encountered later …
Safety gymnasium: A unified safe reinforcement learning benchmark
Artificial intelligence (AI) systems possess significant potential to drive societal progress.
However, their deployment often faces obstacles due to substantial safety concerns. Safe …
However, their deployment often faces obstacles due to substantial safety concerns. Safe …
Masked world models for visual control
Visual model-based reinforcement learning (RL) has the potential to enable sample-efficient
robot learning from visual observations. Yet the current approaches typically train a single …
robot learning from visual observations. Yet the current approaches typically train a single …
Synthetic experience replay
A key theme in the past decade has been that when large neural networks and large
datasets combine they can produce remarkable results. In deep reinforcement learning (RL) …
datasets combine they can produce remarkable results. In deep reinforcement learning (RL) …
robosuite: A modular simulation framework and benchmark for robot learning
robosuite is a simulation framework for robot learning powered by the MuJoCo physics
engine. It offers a modular design for creating robotic tasks as well as a suite of benchmark …
engine. It offers a modular design for creating robotic tasks as well as a suite of benchmark …