- Academic Search

MK Ho, R Saxe, F Cushman - Trends in Cognitive Sciences, 2022 - cell.com

Understanding Theory of Mind should begin with an analysis of the problems it solves. The
traditional answer is that Theory of Mind is used for predicting others' thoughts and actions …

Simpan Kutip Dirujuk 122 kali Artikel terkait 13 versi

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

An overview of multi-agent reinforcement learning from game theoretical perspective

Y Yang, J Wang - arxiv preprint arxiv:2011.00583, 2020 - arxiv.org

Following the remarkable success of the AlphaGO series, 2019 was a booming year that
witnessed significant advances in multi-agent reinforcement learning (MARL) techniques …

Simpan Kutip Dirujuk 351 kali Artikel terkait 2 versi Versi HTML

[Free GPT-4]
[DeepSeek]

[PDF] tandfonline.com

[BUKU][B] Reinforcement Learning and Stochastic Optimization: A Unified Framework for Sequential Decisions: by Warren B. Powell (ed.), Wiley (2022). Hardback. ISBN …

I Halperin - 2022 - Taylor & Francis

What is reinforcement learning? How is reinforcement learning different from stochastic
optimization? And finally, can it be used for applications to quantitative finance for my current …

Simpan Kutip Dirujuk 220 kali Artikel terkait 11 versi Pencarian Perpustakaan

[Free GPT-4]
[DeepSeek]

[PDF] neu.edu

[BUKU][B] A concise introduction to decentralized POMDPs

FA Oliehoek, C Amato - 2016 - Springer

This book presents an overview of formal decision making methods for decentralized
cooperative systems. It is aimed at graduate students and researchers in the fields of …

Simpan Kutip Dirujuk 1393 kali Artikel terkait 14 versi Pencarian Perpustakaan

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Learning to drive from a world on rails

D Chen, V Koltun, P Krähenbühl - Proceedings of the IEEE …, 2021 - openaccess.thecvf.com

We learn an interactive vision-based driving policy from pre-recorded driving logs via a
model-based approach. A forward model of the world supervises a driving policy that …

Simpan Kutip Dirujuk 126 kali Artikel terkait 11 versi Versi HTML

[Free GPT-4]
[DeepSeek]

[PDF] jair.org

A survey of multi-objective sequential decision-making

DM Roijers, P Vamplew, S Whiteson… - Journal of Artificial …, 2013 - jair.org

Sequential decision-making problems with multiple objectives arise naturally in practice and
pose unique challenges for research in decision-theoretic planning and learning, which has …

Simpan Kutip Dirujuk 821 kali Artikel terkait 21 versi Versi HTML

[Free GPT-4]
[DeepSeek]

[PDF] psl.eu

[BUKU][B] Probabilistic graphical models: principles and techniques

D Koller, N Friedman - 2009 - books.google.com

A general framework for constructing and using probabilistic models of complex systems that
would enable a computer to use available information for making decisions. Most tasks …

Simpan Kutip Dirujuk 11699 kali Artikel terkait 14 versi Pencarian Perpustakaan

[Free GPT-4]
[DeepSeek]

[PDF] springer.com

On the convergence of projective-simulation–based reinforcement learning in Markov decision processes

WL Boyajian, J Clausen, LM Trenkwalder… - Quantum machine …, 2020 - Springer

In recent years, the interest in leveraging quantum effects for enhancing machine learning
tasks has significantly increased. Many algorithms speeding up supervised and …

Simpan Kutip Dirujuk 869 kali Artikel terkait 18 versi

[Free GPT-4]
[DeepSeek]

[HTML] sciencedirect.com

[HTML][HTML] Deliberation for autonomous robots: A survey

F Ingrand, M Ghallab - Artificial Intelligence, 2017 - Elsevier

Autonomous robots facing a diversity of open environments and performing a variety of tasks
and interactions need explicit deliberation in order to fulfill their missions. Deliberation is …

Simpan Kutip Dirujuk 419 kali Artikel terkait 6 versi

[Free GPT-4]
[DeepSeek]

[PDF] princeton.edu

[BUKU][B] Approximate Dynamic Programming: Solving the curses of dimensionality

WB Powell - 2007 - books.google.com

A complete and accessible introduction to the real-world applications of approximate
dynamic programming With the growing levels of sophistication in modern-day operations, it …

Simpan Kutip Dirujuk 5026 kali Artikel terkait 12 versi Pencarian Perpustakaan

Buat notifikasi

Kutip

Penelusuran lanjutan

Disimpan ke Koleksi saya

Decision-theoretic planning: Structural assumptions and computational leverage

Planning with theory of mind

An overview of multi-agent reinforcement learning from game theoretical perspective

[BUKU][B] Reinforcement Learning and Stochastic Optimization: A Unified Framework for Sequential Decisions: by Warren B. Powell (ed.), Wiley (2022). Hardback. ISBN …

[BUKU][B] A concise introduction to decentralized POMDPs

Learning to drive from a world on rails

A survey of multi-objective sequential decision-making

[BUKU][B] Probabilistic graphical models: principles and techniques

On the convergence of projective-simulation–based reinforcement learning in Markov decision processes

[HTML][HTML] Deliberation for autonomous robots: A survey

[BUKU][B] Approximate Dynamic Programming: Solving the curses of dimensionality