A survey of inverse reinforcement learning: Challenges, methods and progress

S Arora, P Doshi - Artificial Intelligence, 2021 - Elsevier
Inverse reinforcement learning (IRL) is the problem of inferring the reward function of an
agent, given its policy or observed behavior. Analogous to RL, IRL is perceived both as a …

Markov decision processes with applications in wireless sensor networks: A survey

MA Alsheikh, DT Hoang, D Niyato… - … Surveys & Tutorials, 2015 - ieeexplore.ieee.org
Wireless sensor networks (WSNs) consist of autonomous and resource-limited devices. The
devices cooperate to monitor one or more physical phenomena within an area of interest …

Cooperative inverse reinforcement learning

D Hadfield-Menell, SJ Russell… - Advances in neural …, 2016 - proceedings.neurips.cc
For an autonomous system to be helpful to humans and to pose no unwarranted risks, it
needs to align its values with those of the humans in its environment in such a way that its …

Actor-critic deep reinforcement learning for solving job shop scheduling problems

CL Liu, CC Chang, CJ Tseng - Ieee Access, 2020 - ieeexplore.ieee.org
In the past decades, many optimization methods have been devised and applied to job shop
scheduling problem (JSSP) to find the optimal solution. Many methods assumed that the …

DeepMAG: Deep reinforcement learning with multi-agent graphs for flexible job shop scheduling

JD Zhang, Z He, WH Chan, CY Chow - Knowledge-Based Systems, 2023 - Elsevier
The flexible job shop scheduling (FJSS) is important in real-world factories due to the wide
applicability. FJSS schedules the operations of jobs to be executed by specific machines at …

Decision making in multiagent systems: A survey

Y Rizk, M Awad, EW Tunstel - IEEE Transactions on Cognitive …, 2018 - ieeexplore.ieee.org
Intelligent transport systems, efficient electric grids, and sensor networks for data collection
and analysis are some examples of the multiagent systems (MAS) that cooperate to achieve …

Independent reinforcement learners in cooperative markov games: a survey regarding coordination problems

L Matignon, GJ Laurent, N Le Fort-Piat - The Knowledge …, 2012 - cambridge.org
In the framework of fully cooperative multi-agent systems, independent (non-communicative)
agents that learn by reinforcement must overcome several difficulties to manage to …

[PDF][PDF] Nash Q-learning for general-sum stochastic games

J Hu, MP Wellman - Journal of machine learning research, 2003 - jmlr.org
We extend Q-learning to a noncooperative multiagent context, using the framework of
generalsum stochastic games. A learning agent maintains Q-functions over joint actions …

[PDF][PDF] Dynamic programming for partially observable stochastic games

EA Hansen, DS Bernstein, S Zilberstein - AAAI, 2004 - cdn.aaai.org
We develop an exact dynamic programming algorithm for partially observable stochastic
games (POSGs). The algorithm is a synthesis of dynamic programming for partially …

A benchmark for the comparison of 3-d motion segmentation algorithms

R Tron, R Vidal - 2007 IEEE conference on computer vision …, 2007 - ieeexplore.ieee.org
Over the past few years, several methods for segmenting a scene containing multiple rigidly
moving objects have been proposed. However, most existing methods have been tested on …