Google 학술 검색

Explore, exploit or listen: Combining human feedback and policy model to speed up deep reinforcem...

[HTML][HTML] Integrating machine learning with human knowledge

C Deng, X Ji, C Rainey, J Zhang, W Lu - Iscience, 2020 - cell.com

Machine learning has been heavily researched and widely used in many disciplines.
However, achieving high accuracy requires a large amount of data that is sometimes …

저장 인용 177회 인용 관련 학술자료 전체 12개의 버전

[Free GPT-4]

[PDF] neurips.cc

Reward learning from human preferences and demonstrations in atari

B Ibarz, J Leike, T Pohlen, G Irving… - Advances in neural …, 2018 - proceedings.neurips.cc

To solve complex real-world problems with reinforcement learning, we cannot rely on
manually specified reward functions. Instead, we need humans to communicate an objective …

저장 인용 457회 인용 관련 학술자료 전체 7개의 버전 HTML 버전

[Free GPT-4]

[PDF] arxiv.org

Shared autonomy via deep reinforcement learning

S Reddy, AD Dragan, S Levine - arxiv preprint arxiv:1802.01744, 2018 - arxiv.org

In shared autonomy, user input is combined with semi-autonomous control to achieve a
common goal. The goal is often unknown ex-ante, so prior work enables agents to infer the …

Reinforcement learning with predefined and inferred reward machines in stochastic games

J Hu, Y Paliwal, H Kim, Y Wang, Z Xu - Neurocomputing, 2024 - Elsevier

This paper focuses on Multi-Agent Reinforcement Learning (MARL) in non-cooperative
stochastic games, particularly addressing the challenge of task completion characterized by …

저장 인용 3회 인용 관련 학술자료 전체 3개의 버전

[Free GPT-4]

[PDF] aaai.org

Improving deep reinforcement learning in minecraft with action advice

S Frazier, M Riedl - Proceedings of the AAAI conference on artificial …, 2019 - ojs.aaai.org

Training deep reinforcement learning agents complex behaviors in 3D virtual environments
requires significant computational resources. This is especially true in environments with …

저장 인용 39회 인용 관련 학술자료 전체 9개의 버전 HTML 버전

[Free GPT-4]

[PDF] arxiv.org

A framework for learning from demonstration with minimal human effort

M Rigter, B Lacerda, N Hawes - IEEE Robotics and Automation …, 2020 - ieeexplore.ieee.org

We consider robot learning in the context of shared autonomy, where control of the system
can switch between a human teleoperator and autonomous control. In this setting we …

저장 인용 37회 인용 관련 학술자료 전체 9개의 버전

[Free GPT-4]

[PDF] arxiv.org

Training value-aligned reinforcement learning agents using a normative prior

MS Al Nahian, S Frazier, M Riedl… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org

Value alignment is a property of intelligent agents wherein they solely pursue non-harmful
behaviors or human-beneficial goals. We introduce an approach to value-aligned …

저장 인용 22회 인용 관련 학술자료 전체 3개의 버전

[Free GPT-4]

[PDF] ieee.org

A Multifaceted Approach to Stock Market Trading Using Reinforcement Learning

Y Ansari, S Gillani, M Bukhari, B Lee, M Maqsood… - IEEE …, 2024 - ieeexplore.ieee.org

In the recent past, algorithmic stock market trading for financial markets has undergone
significant growth and played a major role in investment decisions. Several methods have …

저장 인용 1회 인용 관련 학술자료

[Free GPT-4]

[PDF] utexas.edu

Interactive reinforcement learning with inaccurate feedback

TAK Faulkner, ES Short… - 2020 IEEE International …, 2020 - ieeexplore.ieee.org

Interactive Reinforcement Learning (RL) enables agents to learn from two sources: rewards
taken from observations of the environment, and feedback or advice from a secondary critic …

저장 인용 27회 인용 관련 학술자료 전체 5개의 버전

[Free GPT-4]

[PDF] acm.org

Interactive reinforcement learning from imperfect teachers

TA Kessler Faulkner, A Thomaz - Companion of the 2021 ACM/IEEE …, 2021 - dl.acm.org

Robots can use information from people to improve learning speed or quality. However,
people can have short attention spans and misunderstand tasks. Our work addresses these …

저장 인용 18회 인용 관련 학술자료

알림 만들기

인용

고급 검색

라이브러리에 저장됨

Explore, exploit or listen: Combining human feedback and policy model to speed up deep reinforcem...

[HTML][HTML] Integrating machine learning with human knowledge

Reward learning from human preferences and demonstrations in atari

Shared autonomy via deep reinforcement learning

Reinforcement learning with predefined and inferred reward machines in stochastic games

Improving deep reinforcement learning in minecraft with action advice

A framework for learning from demonstration with minimal human effort

Training value-aligned reinforcement learning agents using a normative prior

A Multifaceted Approach to Stock Market Trading Using Reinforcement Learning

Interactive reinforcement learning with inaccurate feedback

Interactive reinforcement learning from imperfect teachers