[Књига][B] A concise introduction to decentralized POMDPs

FA Oliehoek, C Amato - 2016 - Springer
This book presents an overview of formal decision making methods for decentralized
cooperative systems. It is aimed at graduate students and researchers in the fields of …

Efficient multi-robot search for a moving target

G Hollinger, S Singh, J Djugash… - … International Journal of …, 2009 - journals.sagepub.com
This paper examines the problem of locating a mobile, non-adversarial target in an indoor
environment using multiple robotic searchers. One way to formulate this problem is to …

Cognitive radio network for the smart grid: Experimental system architecture, control algorithms, security, and microgrid testbed

RC Qiu, Z Hu, Z Chen, N Guo… - … on Smart Grid, 2011 - ieeexplore.ieee.org
This paper systematically investigates the novel idea of applying the next generation
wireless technology, cognitive radio network, for the smart grid. In particular, system …

[PDF][PDF] Inverse reinforcement learning in partially observable environments

JD Choi, KE Kim - Journal of Machine Learning Research, 2011 - jmlr.org
Inverse reinforcement learning (IRL) is the problem of recovering the underlying reward
function from the behavior of an expert. Most of the existing IRL algorithms assume that the …

Optimally solving Dec-POMDPs as continuous-state MDPs

JS Dibangoye, C Amato, O Buffet, F Charpillet - Journal of Artificial …, 2016 - jair.org
Decentralized partially observable Markov decision processes (Dec-POMDPs) provide a
general model for decision-making under uncertainty in decentralized settings, but are …

An experimental design perspective on model-based reinforcement learning

V Mehta, B Paria, J Schneider, S Ermon… - arxiv preprint arxiv …, 2021 - arxiv.org
In many practical applications of RL, it is expensive to observe state transitions from the
environment. For example, in the problem of plasma control for nuclear fusion, computing …

POMDP and MOMDP solutions for structural life-cycle cost minimization under partial and mixed observability

KG Papakonstantinou, CP Andriotis… - Structure and …, 2018 - Taylor & Francis
Scheduling of inspection and maintenance policies during the life-cycle of operating
infrastructure necessitates optimization of long-term objectives in stochastic environments …

Processos de Decisão de Markov: um tutorial

J Pellegrini, J Wainer - Revista de Informática Teórica e Aplicada, 2007 - seer.ufrgs.br
Há situações em que decisões devem ser tomadas em seqüência, e o resultado de cada
decisão não é claro para o tomador de decisões. Estas situações podem ser formuladas …

Multi-modal active perception for information gathering in science missions

A Arora, PM Furlong, R Fitch, S Sukkarieh, T Fong - Autonomous Robots, 2019 - Springer
Robotic science missions in remote environments, such as deep ocean and outer space,
can involve studying phenomena that cannot directly be observed using on-board sensors …

[Књига][B] Cognitive radio communication and networking: Principles and practice

RC Qiu, Z Hu, H Li, MC Wicks - 2012 - books.google.com
The author presents a unified treatment of this highly interdisciplinary topic to help define the
notion of cognitive radio. The book begins with addressing issues such as the fundamental …