Google Академія

M Natarajan, E Seraj, B Altundas, R Paleja, S Ye… - Current Robotics …, 2023 - Springer

Abstract Purpose of Review Current real-world interaction between humans and robots is
extremely limited. We present challenges that, if addressed, will enable humans and robots …

Зберегти Послатися Цитовано в 39 джерелах Пов’язані статті Кількість версій: 3

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Mimicgen: A data generation system for scalable robot learning using human demonstrations

A Mandlekar, S Nasiriany, B Wen, I Akinola… - arxiv preprint arxiv …, 2023 - arxiv.org

Imitation learning from a large set of human demonstrations has proved to be an effective
paradigm for building capable robot agents. However, the demonstrations can be extremely …

Зберегти Послатися Цитовано в 79 джерелах Пов’язані статті Кількість версій: 5 Показати у форматі HTML

[Free GPT-4]
[DeepSeek]

[PDF] mlr.press

Few-shot preference learning for human-in-the-loop rl

DJ Hejna III, D Sadigh - Conference on Robot Learning, 2023 - proceedings.mlr.press

While reinforcement learning (RL) has become a more popular approach for robotics,
designing sufficiently informative reward functions for complex tasks has proven to be …

Зберегти Послатися Цитовано в 93 джерелах Пов’язані статті Кількість версій: 7 Показати у форматі HTML

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Data quality in imitation learning

S Belkhale, Y Cui, D Sadigh - Advances in neural …, 2023 - proceedings.neurips.cc

In supervised learning, the question of data quality and curation has been sidelined in
recent years in favor of increasingly more powerful and expressive models that can ingest …

Зберегти Послатися Цитовано в 41 джерелах Пов’язані статті Кількість версій: 7 Показати у форматі HTML

[Free GPT-4]
[DeepSeek]

[PDF] mlr.press

i-sim2real: Reinforcement learning of robotic policies in tight human-robot interaction loops

SW Abeyruwan, L Graesser… - … on Robot Learning, 2023 - proceedings.mlr.press

Sim-to-real transfer is a powerful paradigm for robotic reinforcement learning. The ability to
train policies in simulation enables safe exploration and large-scale data collection quickly …

Зберегти Послатися Цитовано в 57 джерелах Пов’язані статті Кількість версій: 7 Показати у форматі HTML

[Free GPT-4]
[DeepSeek]

[PDF] mlr.press

Imitation learning by estimating expertise of demonstrators

M Beliaev, A Shih, S Ermon, D Sadigh… - International …, 2022 - proceedings.mlr.press

Many existing imitation learning datasets are collected from multiple demonstrators, each
with different expertise at different parts of the environment. Yet, standard imitation learning …

Зберегти Послатися Цитовано в 55 джерелах Пов’язані статті Кількість версій: 8 Показати у форматі HTML

[Free GPT-4]
[DeepSeek]

[PDF] sagepub.com

Learning reward functions from diverse sources of human feedback: Optimally integrating demonstrations and preferences

E Bıyık, DP Losey, M Palan… - … Journal of Robotics …, 2022 - journals.sagepub.com

Reward functions are a common way to specify the objective of a robot. As designing reward
functions can be extremely challenging, a more promising approach is to directly learn …

Зберегти Послатися Цитовано в 133 джерелах Пов’язані статті Кількість версій: 12

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Confidence-aware imitation learning from demonstrations with varying optimality

S Zhang, Z Cao, D Sadigh… - Advances in Neural …, 2021 - proceedings.neurips.cc

Most existing imitation learning approaches assume the demonstrations are drawn from
experts who are optimal, but relaxing this assumption enables us to use a wider range of …

Зберегти Послатися Цитовано в 56 джерелах Пов’язані статті Кількість версій: 8 Показати у форматі HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Efficient preference-based reinforcement learning using learned dynamics models

Y Liu, G Datta, E Novoseller… - 2023 IEEE International …, 2023 - ieeexplore.ieee.org

Preference-based reinforcement learning (PbRL) can enable robots to learn to perform tasks
based on an individual's preferences without requiring a hand-crafted re-ward function …

Зберегти Послатися Цитовано в 22 джерелах Пов’язані статті Кількість версій: 3

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Robotic table tennis: A case study into a high speed learning system

DB D'Ambrosio, J Abelian, S Abeyruwan, M Ahn… - arxiv preprint arxiv …, 2023 - arxiv.org

We present a deep-dive into a real-world robotic learning system that, in previous work, was
shown to be capable of hundreds of table tennis rallies with a human and has the ability to …

Зберегти Послатися Цитовано в 13 джерелах Пов’язані статті Кількість версій: 5 Показати у форматі HTML

Створити сповіщення

Послатися

Розширений пошук

Збережено в моїй бібліотеці

Learning from suboptimal demonstration via self-supervised reward regression

Human-robot teaming: grand challenges

Mimicgen: A data generation system for scalable robot learning using human demonstrations

Few-shot preference learning for human-in-the-loop rl

Data quality in imitation learning

i-sim2real: Reinforcement learning of robotic policies in tight human-robot interaction loops

Imitation learning by estimating expertise of demonstrators

Learning reward functions from diverse sources of human feedback: Optimally integrating demonstrations and preferences

Confidence-aware imitation learning from demonstrations with varying optimality

Efficient preference-based reinforcement learning using learned dynamics models

Robotic table tennis: A case study into a high speed learning system