- Academic Search

K Zhang, Z Yang, T Başar - Handbook of reinforcement learning and …, 2021 - Springer

Recent years have witnessed significant advances in reinforcement learning (RL), which
has registered tremendous success in solving various sequential decision-making problems …

Simpan Kutip Dirujuk 1719 kali Artikel terkait 8 versi

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

An overview of multi-agent reinforcement learning from game theoretical perspective

Y Yang, J Wang - arxiv preprint arxiv:2011.00583, 2020 - arxiv.org

Following the remarkable success of the AlphaGO series, 2019 was a booming year that
witnessed significant advances in multi-agent reinforcement learning (MARL) techniques …

Simpan Kutip Dirujuk 352 kali Artikel terkait 2 versi Versi HTML

[Free GPT-4]
[DeepSeek]

[PDF] jmlr.org

Gflownet foundations

Y Bengio, S Lahlou, T Deleu, EJ Hu, M Tiwari… - The Journal of Machine …, 2023 - dl.acm.org

Generative Flow Networks (GFlowNets) have been introduced as a method to sample a
diverse set of candidates in an active learning context, with a training objective that makes …

Simpan Kutip Dirujuk 227 kali Artikel terkait 4 versi

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

The statistical complexity of interactive decision making

DJ Foster, SM Kakade, J Qian, A Rakhlin - arxiv preprint arxiv:2112.13487, 2021 - arxiv.org

A fundamental challenge in interactive learning and decision making, ranging from bandit
problems to reinforcement learning, is to provide sample-efficient, adaptive learning …

Simpan Kutip Dirujuk 210 kali Artikel terkait 6 versi Versi HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Dive into deep learning

A Zhang, ZC Lipton, M Li, AJ Smola - arxiv preprint arxiv:2106.11342, 2021 - arxiv.org

This open-source book represents our attempt to make deep learning approachable,
teaching readers the concepts, the context, and the code. The entire book is drafted in …

Simpan Kutip Dirujuk 1228 kali Artikel terkait 9 versi Versi HTML

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Hierarchical graph transformer with adaptive node sampling

Z Zhang, Q Liu, Q Hu, CK Lee - Advances in Neural …, 2022 - proceedings.neurips.cc

The Transformer architecture has achieved remarkable success in a number of domains
including natural language processing and computer vision. However, when it comes to …

Simpan Kutip Dirujuk 92 kali Artikel terkait 6 versi Versi HTML

[Free GPT-4]
[DeepSeek]

[PDF] tor-lattimore.com

[BUKU][B] Bandit algorithms

T Lattimore, C Szepesvári - 2020 - books.google.com

Decision-making in the face of uncertainty is a significant challenge in machine learning,
and the multi-armed bandit model is a commonly used framework to address it. This …

Simpan Kutip Dirujuk 3313 kali Artikel terkait 9 versi Pencarian Perpustakaan

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

A modern introduction to online learning

F Orabona - arxiv preprint arxiv:1912.13213, 2019 - arxiv.org

In this monograph, I introduce the basic concepts of Online Learning through a modern view
of Online Convex Optimization. Here, online learning refers to the framework of regret …

Simpan Kutip Dirujuk 425 kali Artikel terkait 3 versi Versi HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Derivative-free optimization methods

J Larson, M Menickelly, SM Wild - Acta Numerica, 2019 - cambridge.org

In many optimization problems arising from scientific, engineering and artificial intelligence
applications, objective and constraint functions are available only as the output of a black …

Simpan Kutip Dirujuk 514 kali Artikel terkait 9 versi

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Online learning: A comprehensive survey

SCH Hoi, D Sahoo, J Lu, P Zhao - Neurocomputing, 2021 - Elsevier

Online learning represents a family of machine learning methods, where a learner attempts
to tackle some predictive (or any type of decision-making) task by learning from a sequence …

Simpan Kutip Dirujuk 901 kali Artikel terkait 6 versi

Buat notifikasi

Kutip

Penelusuran lanjutan

Disimpan ke Koleksi saya

The nonstochastic multiarmed bandit problem

Multi-agent reinforcement learning: A selective overview of theories and algorithms

An overview of multi-agent reinforcement learning from game theoretical perspective

Gflownet foundations

The statistical complexity of interactive decision making

Dive into deep learning

Hierarchical graph transformer with adaptive node sampling

[BUKU][B] Bandit algorithms

A modern introduction to online learning

Derivative-free optimization methods

Online learning: A comprehensive survey