- Academic Search

M Jusup, P Holme, K Kanazawa, M Takayasu, I Romić… - Physics Reports, 2022 - Elsevier

Recent decades have seen a rise in the use of physics methods to study different societal
phenomena. This development has been due to physicists venturing outside of their …

Salva Cita Citato da 446 Articoli correlati Tutte e 11 le versioni

[Free GPT-4]

[PDF] arxiv.org

An overview of multi-agent reinforcement learning from game theoretical perspective

Y Yang, J Wang - arxiv preprint arxiv:2011.00583, 2020 - arxiv.org

Following the remarkable success of the AlphaGO series, 2019 was a booming year that
witnessed significant advances in multi-agent reinforcement learning (MARL) techniques …

Salva Cita Citato da 351 Articoli correlati Tutte e 2 le versioni Versione HTML

[Free GPT-4]

[PDF] arxiv.org

Foundational challenges in assuring alignment and safety of large language models

U Anwar, A Saparov, J Rando, D Paleka… - arxiv preprint arxiv …, 2024 - arxiv.org

This work identifies 18 foundational challenges in assuring the alignment and safety of large
language models (LLMs). These challenges are organized into three different categories …

Salva Cita Citato da 116 Articoli correlati Tutte e 3 le versioni Versione HTML

[Free GPT-4]

[PDF] neurips.cc

The surprising effectiveness of ppo in cooperative multi-agent games

C Yu, A Velu, E Vinitsky, J Gao… - Advances in …, 2022 - proceedings.neurips.cc

Abstract Proximal Policy Optimization (PPO) is a ubiquitous on-policy reinforcement learning
algorithm but is significantly less utilized than off-policy learning algorithms in multi-agent …

Salva Cita Citato da 1531 Articoli correlati Tutte e 12 le versioni Versione HTML

[Free GPT-4]

[PDF] springer.com

Multi-agent deep reinforcement learning: a survey

S Gronauer, K Diepold - Artificial Intelligence Review, 2022 - Springer

The advances in reinforcement learning have recorded sublime success in various domains.
Although the multi-agent domain has been overshadowed by its single-agent counterpart …

Salva Cita Citato da 699 Articoli correlati Tutte e 8 le versioni

[Free GPT-4]

[PDF] arxiv.org

A survey of meta-reinforcement learning

J Beck, R Vuorio, EZ Liu, Z **ong, L Zintgraf… - arxiv preprint arxiv …, 2023 - arxiv.org

While deep reinforcement learning (RL) has fueled multiple high-profile successes in
machine learning, it is held back from more widespread adoption by its often poor data …

Salva Cita Citato da 169 Articoli correlati Tutte e 2 le versioni Versione HTML

[Free GPT-4]

[PDF] neurips.cc

Theseus: A library for differentiable nonlinear optimization

L Pineda, T Fan, M Monge… - Advances in …, 2022 - proceedings.neurips.cc

We present Theseus, an efficient application-agnostic open source library for differentiable
nonlinear least squares (DNLS) optimization built on PyTorch, providing a common …

Salva Cita Citato da 95 Articoli correlati Tutte e 6 le versioni Versione HTML

[Free GPT-4]

[PDF] jair.org

Towards continual reinforcement learning: A review and perspectives

K Khetarpal, M Riemer, I Rish, D Precup - Journal of Artificial Intelligence …, 2022 - jair.org

In this article, we aim to provide a literature review of different formulations and approaches
to continual reinforcement learning (RL), also known as lifelong or non-stationary RL. We …

Salva Cita Citato da 336 Articoli correlati Tutte e 9 le versioni Versione HTML

[Free GPT-4]

[PDF] nowpublishers.com

An introduction to deep reinforcement learning

V François-Lavet, P Henderson, R Islam… - … and Trends® in …, 2018 - nowpublishers.com

Deep reinforcement learning is the combination of reinforcement learning (RL) and deep
learning. This field of research has been able to solve a wide range of complex …

Salva Cita Citato da 1959 Articoli correlati Tutte e 16 le versioni Ricerca biblioteche Versione HTML

[Free GPT-4]

[PDF] arxiv.org

Deep reinforcement learning for multiagent systems: A review of challenges, solutions, and applications

TT Nguyen, ND Nguyen… - IEEE transactions on …, 2020 - ieeexplore.ieee.org

Reinforcement learning (RL) algorithms have been around for decades and employed to
solve various sequential decision-making problems. These algorithms, however, have faced …

Salva Cita Citato da 1187 Articoli correlati Tutte e 12 le versioni

Crea avviso

Cita

Ricerca avanzata

Salvato in La mia biblioteca

Learning with opponent-learning awareness

Social physics

An overview of multi-agent reinforcement learning from game theoretical perspective

Foundational challenges in assuring alignment and safety of large language models

The surprising effectiveness of ppo in cooperative multi-agent games

Multi-agent deep reinforcement learning: a survey

A survey of meta-reinforcement learning

Theseus: A library for differentiable nonlinear optimization

Towards continual reinforcement learning: A review and perspectives

An introduction to deep reinforcement learning

Deep reinforcement learning for multiagent systems: A review of challenges, solutions, and applications