Planning with theory of mind

MK Ho, R Saxe, F Cushman - Trends in Cognitive Sciences, 2022 - cell.com
Understanding Theory of Mind should begin with an analysis of the problems it solves. The
traditional answer is that Theory of Mind is used for predicting others' thoughts and actions …

An overview of multi-agent reinforcement learning from game theoretical perspective

Y Yang, J Wang - arxiv preprint arxiv:2011.00583, 2020 - arxiv.org
Following the remarkable success of the AlphaGO series, 2019 was a booming year that
witnessed significant advances in multi-agent reinforcement learning (MARL) techniques …

[BUKU][B] Reinforcement Learning and Stochastic Optimization: A Unified Framework for Sequential Decisions: by Warren B. Powell (ed.), Wiley (2022). Hardback. ISBN …

I Halperin - 2022 - Taylor & Francis
What is reinforcement learning? How is reinforcement learning different from stochastic
optimization? And finally, can it be used for applications to quantitative finance for my current …

[BUKU][B] A concise introduction to decentralized POMDPs

FA Oliehoek, C Amato - 2016 - Springer
This book presents an overview of formal decision making methods for decentralized
cooperative systems. It is aimed at graduate students and researchers in the fields of …

Learning to drive from a world on rails

D Chen, V Koltun, P Krähenbühl - Proceedings of the IEEE …, 2021 - openaccess.thecvf.com
We learn an interactive vision-based driving policy from pre-recorded driving logs via a
model-based approach. A forward model of the world supervises a driving policy that …

A survey of multi-objective sequential decision-making

DM Roijers, P Vamplew, S Whiteson… - Journal of Artificial …, 2013 - jair.org
Sequential decision-making problems with multiple objectives arise naturally in practice and
pose unique challenges for research in decision-theoretic planning and learning, which has …

[BUKU][B] Probabilistic graphical models: principles and techniques

D Koller, N Friedman - 2009 - books.google.com
A general framework for constructing and using probabilistic models of complex systems that
would enable a computer to use available information for making decisions. Most tasks …

On the convergence of projective-simulation–based reinforcement learning in Markov decision processes

WL Boyajian, J Clausen, LM Trenkwalder… - Quantum machine …, 2020 - Springer
In recent years, the interest in leveraging quantum effects for enhancing machine learning
tasks has significantly increased. Many algorithms speeding up supervised and …

[HTML][HTML] Deliberation for autonomous robots: A survey

F Ingrand, M Ghallab - Artificial Intelligence, 2017 - Elsevier
Autonomous robots facing a diversity of open environments and performing a variety of tasks
and interactions need explicit deliberation in order to fulfill their missions. Deliberation is …

[BUKU][B] Approximate Dynamic Programming: Solving the curses of dimensionality

WB Powell - 2007 - books.google.com
A complete and accessible introduction to the real-world applications of approximate
dynamic programming With the growing levels of sophistication in modern-day operations, it …