Turnitin
降AI改写
早检测系统
早降重系统
Turnitin-UK版
万方检测-期刊版
维普编辑部版
Grammarly检测
Paperpass检测
checkpass检测
PaperYY检测
Human-robot teaming: grand challenges
Abstract Purpose of Review Current real-world interaction between humans and robots is
extremely limited. We present challenges that, if addressed, will enable humans and robots …
extremely limited. We present challenges that, if addressed, will enable humans and robots …
Mimicgen: A data generation system for scalable robot learning using human demonstrations
Imitation learning from a large set of human demonstrations has proved to be an effective
paradigm for building capable robot agents. However, the demonstrations can be extremely …
paradigm for building capable robot agents. However, the demonstrations can be extremely …
Few-shot preference learning for human-in-the-loop rl
While reinforcement learning (RL) has become a more popular approach for robotics,
designing sufficiently informative reward functions for complex tasks has proven to be …
designing sufficiently informative reward functions for complex tasks has proven to be …
Data quality in imitation learning
In supervised learning, the question of data quality and curation has been sidelined in
recent years in favor of increasingly more powerful and expressive models that can ingest …
recent years in favor of increasingly more powerful and expressive models that can ingest …
i-sim2real: Reinforcement learning of robotic policies in tight human-robot interaction loops
Sim-to-real transfer is a powerful paradigm for robotic reinforcement learning. The ability to
train policies in simulation enables safe exploration and large-scale data collection quickly …
train policies in simulation enables safe exploration and large-scale data collection quickly …
Imitation learning by estimating expertise of demonstrators
Many existing imitation learning datasets are collected from multiple demonstrators, each
with different expertise at different parts of the environment. Yet, standard imitation learning …
with different expertise at different parts of the environment. Yet, standard imitation learning …
Learning reward functions from diverse sources of human feedback: Optimally integrating demonstrations and preferences
Reward functions are a common way to specify the objective of a robot. As designing reward
functions can be extremely challenging, a more promising approach is to directly learn …
functions can be extremely challenging, a more promising approach is to directly learn …
Confidence-aware imitation learning from demonstrations with varying optimality
Most existing imitation learning approaches assume the demonstrations are drawn from
experts who are optimal, but relaxing this assumption enables us to use a wider range of …
experts who are optimal, but relaxing this assumption enables us to use a wider range of …
Efficient preference-based reinforcement learning using learned dynamics models
Y Liu, G Datta, E Novoseller… - 2023 IEEE International …, 2023 - ieeexplore.ieee.org
Preference-based reinforcement learning (PbRL) can enable robots to learn to perform tasks
based on an individual's preferences without requiring a hand-crafted re-ward function …
based on an individual's preferences without requiring a hand-crafted re-ward function …
Robotic table tennis: A case study into a high speed learning system
We present a deep-dive into a real-world robotic learning system that, in previous work, was
shown to be capable of hundreds of table tennis rallies with a human and has the ability to …
shown to be capable of hundreds of table tennis rallies with a human and has the ability to …