[HTML][HTML] Integrating machine learning with human knowledge
Machine learning has been heavily researched and widely used in many disciplines.
However, achieving high accuracy requires a large amount of data that is sometimes …
However, achieving high accuracy requires a large amount of data that is sometimes …
Reward learning from human preferences and demonstrations in atari
To solve complex real-world problems with reinforcement learning, we cannot rely on
manually specified reward functions. Instead, we need humans to communicate an objective …
manually specified reward functions. Instead, we need humans to communicate an objective …
Shared autonomy via deep reinforcement learning
In shared autonomy, user input is combined with semi-autonomous control to achieve a
common goal. The goal is often unknown ex-ante, so prior work enables agents to infer the …
common goal. The goal is often unknown ex-ante, so prior work enables agents to infer the …
Reinforcement learning with predefined and inferred reward machines in stochastic games
This paper focuses on Multi-Agent Reinforcement Learning (MARL) in non-cooperative
stochastic games, particularly addressing the challenge of task completion characterized by …
stochastic games, particularly addressing the challenge of task completion characterized by …
Improving deep reinforcement learning in minecraft with action advice
Training deep reinforcement learning agents complex behaviors in 3D virtual environments
requires significant computational resources. This is especially true in environments with …
requires significant computational resources. This is especially true in environments with …
A framework for learning from demonstration with minimal human effort
We consider robot learning in the context of shared autonomy, where control of the system
can switch between a human teleoperator and autonomous control. In this setting we …
can switch between a human teleoperator and autonomous control. In this setting we …
Training value-aligned reinforcement learning agents using a normative prior
Value alignment is a property of intelligent agents wherein they solely pursue non-harmful
behaviors or human-beneficial goals. We introduce an approach to value-aligned …
behaviors or human-beneficial goals. We introduce an approach to value-aligned …
A Multifaceted Approach to Stock Market Trading Using Reinforcement Learning
In the recent past, algorithmic stock market trading for financial markets has undergone
significant growth and played a major role in investment decisions. Several methods have …
significant growth and played a major role in investment decisions. Several methods have …
Interactive reinforcement learning with inaccurate feedback
Interactive Reinforcement Learning (RL) enables agents to learn from two sources: rewards
taken from observations of the environment, and feedback or advice from a secondary critic …
taken from observations of the environment, and feedback or advice from a secondary critic …
Interactive reinforcement learning from imperfect teachers
Robots can use information from people to improve learning speed or quality. However,
people can have short attention spans and misunderstand tasks. Our work addresses these …
people can have short attention spans and misunderstand tasks. Our work addresses these …