[HTML][HTML] Integrating machine learning with human knowledge

C Deng, X Ji, C Rainey, J Zhang, W Lu - Iscience, 2020 - cell.com
Machine learning has been heavily researched and widely used in many disciplines.
However, achieving high accuracy requires a large amount of data that is sometimes …

Reward learning from human preferences and demonstrations in atari

B Ibarz, J Leike, T Pohlen, G Irving… - Advances in neural …, 2018 - proceedings.neurips.cc
To solve complex real-world problems with reinforcement learning, we cannot rely on
manually specified reward functions. Instead, we need humans to communicate an objective …

Shared autonomy via deep reinforcement learning

S Reddy, AD Dragan, S Levine - arxiv preprint arxiv:1802.01744, 2018 - arxiv.org
In shared autonomy, user input is combined with semi-autonomous control to achieve a
common goal. The goal is often unknown ex-ante, so prior work enables agents to infer the …

Reinforcement learning with predefined and inferred reward machines in stochastic games

J Hu, Y Paliwal, H Kim, Y Wang, Z Xu - Neurocomputing, 2024 - Elsevier
This paper focuses on Multi-Agent Reinforcement Learning (MARL) in non-cooperative
stochastic games, particularly addressing the challenge of task completion characterized by …

Improving deep reinforcement learning in minecraft with action advice

S Frazier, M Riedl - Proceedings of the AAAI conference on artificial …, 2019 - ojs.aaai.org
Training deep reinforcement learning agents complex behaviors in 3D virtual environments
requires significant computational resources. This is especially true in environments with …

A framework for learning from demonstration with minimal human effort

M Rigter, B Lacerda, N Hawes - IEEE Robotics and Automation …, 2020 - ieeexplore.ieee.org
We consider robot learning in the context of shared autonomy, where control of the system
can switch between a human teleoperator and autonomous control. In this setting we …

Training value-aligned reinforcement learning agents using a normative prior

MS Al Nahian, S Frazier, M Riedl… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Value alignment is a property of intelligent agents wherein they solely pursue non-harmful
behaviors or human-beneficial goals. We introduce an approach to value-aligned …

A Multifaceted Approach to Stock Market Trading Using Reinforcement Learning

Y Ansari, S Gillani, M Bukhari, B Lee, M Maqsood… - IEEE …, 2024 - ieeexplore.ieee.org
In the recent past, algorithmic stock market trading for financial markets has undergone
significant growth and played a major role in investment decisions. Several methods have …

Interactive reinforcement learning with inaccurate feedback

TAK Faulkner, ES Short… - 2020 IEEE International …, 2020 - ieeexplore.ieee.org
Interactive Reinforcement Learning (RL) enables agents to learn from two sources: rewards
taken from observations of the environment, and feedback or advice from a secondary critic …

Interactive reinforcement learning from imperfect teachers

TA Kessler Faulkner, A Thomaz - Companion of the 2021 ACM/IEEE …, 2021 - dl.acm.org
Robots can use information from people to improve learning speed or quality. However,
people can have short attention spans and misunderstand tasks. Our work addresses these …