- Academic Search

S Arora, P Doshi - Artificial Intelligence, 2021 - Elsevier

Inverse reinforcement learning (IRL) is the problem of inferring the reward function of an
agent, given its policy or observed behavior. Analogous to RL, IRL is perceived both as a …

Opslaan Citeren Geciteerd door 821 Verwante artikelen Alle 6 versies

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Markov decision processes with applications in wireless sensor networks: A survey

MA Alsheikh, DT Hoang, D Niyato… - … Surveys & Tutorials, 2015 - ieeexplore.ieee.org

Wireless sensor networks (WSNs) consist of autonomous and resource-limited devices. The
devices cooperate to monitor one or more physical phenomena within an area of interest …

Opslaan Citeren Geciteerd door 260 Verwante artikelen Alle 16 versies

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Cooperative inverse reinforcement learning

D Hadfield-Menell, SJ Russell… - Advances in neural …, 2016 - proceedings.neurips.cc

For an autonomous system to be helpful to humans and to pose no unwarranted risks, it
needs to align its values with those of the humans in its environment in such a way that its …

Opslaan Citeren Geciteerd door 848 Verwante artikelen Alle 19 versies HTML-versie

[Free GPT-4]
[DeepSeek]

[PDF] ieee.org

Actor-critic deep reinforcement learning for solving job shop scheduling problems

CL Liu, CC Chang, CJ Tseng - Ieee Access, 2020 - ieeexplore.ieee.org

In the past decades, many optimization methods have been devised and applied to job shop
scheduling problem (JSSP) to find the optimal solution. Many methods assumed that the …

Opslaan Citeren Geciteerd door 271 Verwante artikelen Alle 5 versies

DeepMAG: Deep reinforcement learning with multi-agent graphs for flexible job shop scheduling

JD Zhang, Z He, WH Chan, CY Chow - Knowledge-Based Systems, 2023 - Elsevier

The flexible job shop scheduling (FJSS) is important in real-world factories due to the wide
applicability. FJSS schedules the operations of jobs to be executed by specific machines at …

Opslaan Citeren Geciteerd door 60 Verwante artikelen Alle 2 versies

[Free GPT-4]
[DeepSeek]

[PDF] cuni.cz

Decision making in multiagent systems: A survey

Y Rizk, M Awad, EW Tunstel - IEEE Transactions on Cognitive …, 2018 - ieeexplore.ieee.org

Intelligent transport systems, efficient electric grids, and sensor networks for data collection
and analysis are some examples of the multiagent systems (MAS) that cooperate to achieve …

Opslaan Citeren Geciteerd door 260 Verwante artikelen Alle 2 versies

[Free GPT-4]
[DeepSeek]

[PDF] hal.science

Independent reinforcement learners in cooperative markov games: a survey regarding coordination problems

L Matignon, GJ Laurent, N Le Fort-Piat - The Knowledge …, 2012 - cambridge.org

In the framework of fully cooperative multi-agent systems, independent (non-communicative)
agents that learn by reinforcement must overcome several difficulties to manage to …

Opslaan Citeren Geciteerd door 583 Verwante artikelen Alle 13 versies

[Free GPT-4]
[DeepSeek]

[PDF] jmlr.org

[PDF][PDF] Nash Q-learning for general-sum stochastic games

J Hu, MP Wellman - Journal of machine learning research, 2003 - jmlr.org

We extend Q-learning to a noncooperative multiagent context, using the framework of
generalsum stochastic games. A learning agent maintains Q-functions over joint actions …

Opslaan Citeren Geciteerd door 1764 Verwante artikelen Alle 22 versies HTML-versie

[Free GPT-4]
[DeepSeek]

[PDF] aaai.org

[PDF][PDF] Dynamic programming for partially observable stochastic games

EA Hansen, DS Bernstein, S Zilberstein - AAAI, 2004 - cdn.aaai.org

We develop an exact dynamic programming algorithm for partially observable stochastic
games (POSGs). The algorithm is a synthesis of dynamic programming for partially …

Opslaan Citeren Geciteerd door 779 Verwante artikelen Alle 13 versies HTML-versie

[Free GPT-4]
[DeepSeek]

[PDF] jhu.edu

A benchmark for the comparison of 3-d motion segmentation algorithms

R Tron, R Vidal - 2007 IEEE conference on computer vision …, 2007 - ieeexplore.ieee.org

Over the past few years, several methods for segmenting a scene containing multiple rigidly
moving objects have been proposed. However, most existing methods have been tested on …

Opslaan Citeren Geciteerd door 848 Verwante artikelen Alle 14 versies

Melding maken

Citeren

Geavanceerd zoeken

Opgeslagen in Mijn bibliotheek

Sequential optimality and coordination in multiagent systems

A survey of inverse reinforcement learning: Challenges, methods and progress

Markov decision processes with applications in wireless sensor networks: A survey

Cooperative inverse reinforcement learning

Actor-critic deep reinforcement learning for solving job shop scheduling problems

DeepMAG: Deep reinforcement learning with multi-agent graphs for flexible job shop scheduling

Decision making in multiagent systems: A survey

Independent reinforcement learners in cooperative markov games: a survey regarding coordination problems

[PDF][PDF] Nash Q-learning for general-sum stochastic games

[PDF][PDF] Dynamic programming for partially observable stochastic games

A benchmark for the comparison of 3-d motion segmentation algorithms