- Academic Search

A Ecoffet, J Huizinga, J Lehman, KO Stanley, J Clune - Nature, 2021 - nature.com

Reinforcement learning promises to solve complex sequential-decision problems
autonomously by specifying a high-level reward function only. However, reinforcement …

Enregistrer Citer Cité 422 fois Autres articles Les 10 versions Free GPT-4

[Free GPT-4]

[PDF] arxiv.org

Go-explore: a new approach for hard-exploration problems

A Ecoffet, J Huizinga, J Lehman, KO Stanley… - arxiv preprint arxiv …, 2019 - arxiv.org

A grand challenge in reinforcement learning is intelligent exploration, especially when
rewards are sparse or deceptive. Two Atari games serve as benchmarks for such hard …

Enregistrer Citer Cité 467 fois Autres articles Les 2 versions Free GPT-4 Version HTML

[Free GPT-4]

[PDF] jair.org

Revisiting the arcade learning environment: Evaluation protocols and open problems for general agents

MC Machado, MG Bellemare, E Talvitie… - Journal of Artificial …, 2018 - jair.org

The Arcade Learning Environment (ALE) is an evaluation platform that poses the challenge
of building AI agents with general competency across dozens of Atari 2600 games. It …

Enregistrer Citer Cité 681 fois Autres articles Les 14 versions Free GPT-4 Version HTML

[Free GPT-4]

[PDF] gameaibook.org

[LIVRE][B] Artificial intelligence and games

GN Yannakakis, J Togelius - 2018 - Springer

Georgios N. Yannakakis Julian Togelius Page 1 Artificial Intelligence and Games Georgios N.
Yannakakis Julian Togelius Page 2 Artificial Intelligence and Games Page 3 Georgios N …

Enregistrer Citer Cité 746 fois Autres articles Les 9 versions Free GPT-4 Recherche dans les bibliothèques

[Free GPT-4]

[PDF] jair.org Full View

A survey of algorithms for black-box safety validation of cyber-physical systems

A Corso, R Moss, M Koren, R Lee… - Journal of Artificial …, 2021 - jair.org

Autonomous cyber-physical systems (CPS) can improve safety and efficiency for safety-
critical applications, but require rigorous testing before deployment. The complexity of these …

Enregistrer Citer Cité 215 fois Autres articles Les 9 versions Free GPT-4 Version HTML

[Free GPT-4]

[PDF] arxiv.org

The benchmark lottery

M Dehghani, Y Tay, AA Gritsenko, Z Zhao… - arxiv preprint arxiv …, 2021 - arxiv.org

The world of empirical machine learning (ML) strongly relies on benchmarks in order to
determine the relative effectiveness of different algorithms and methods. This paper …

Enregistrer Citer Cité 89 fois Autres articles Les 4 versions Free GPT-4 Version HTML

[Free GPT-4]

[PDF] enseeiht.fr

[LIVRE][B] Distributional reinforcement learning

MG Bellemare, W Dabney, M Rowland - 2023 - books.google.com

The first comprehensive guide to distributional reinforcement learning, providing a new
mathematical formalism for thinking about decisions from a probabilistic perspective …

Enregistrer Citer Cité 173 fois Autres articles Les 9 versions Free GPT-4 Recherche dans les bibliothèques

[Free GPT-4]

[PDF] arxiv.org

State of the art control of atari games using shallow reinforcement learning

Y Liang, MC Machado, E Talvitie, M Bowling - arxiv preprint arxiv …, 2015 - arxiv.org

The recently introduced Deep Q-Networks (DQN) algorithm has gained attention as one of
the first successful combinations of deep neural networks and reinforcement learning. Its …

Enregistrer Citer Cité 148 fois Autres articles Les 10 versions Free GPT-4 Version HTML

[Free GPT-4]

[PDF] aaai.org

Best-first width search: Exploration and exploitation in classical planning

N Lipovetzky, H Geffner - Proceedings of the AAAI Conference on …, 2017 - ojs.aaai.org

It has been shown recently that the performance of greedy best-first search (GBFS) for
computing plans that are not necessarily optimal can be improved by adding forms of …

Enregistrer Citer Cité 134 fois Autres articles Les 8 versions Free GPT-4 Version HTML

[Free GPT-4]

[PDF] arxiv.org

Model-free, model-based, and general intelligence

H Geffner - arxiv preprint arxiv:1806.02308, 2018 - arxiv.org

During the 60s and 70s, AI researchers explored intuitions about intelligence by writing
programs that displayed intelligent behavior. Many good ideas came out from this work but …

Enregistrer Citer Cité 83 fois Autres articles Les 8 versions Free GPT-4 Version HTML

Citer

Recherche avancée

Enregistré dans Ma bibliothèque

First return, then explore

Go-explore: a new approach for hard-exploration problems

Revisiting the arcade learning environment: Evaluation protocols and open problems for general agents

[LIVRE][B] Artificial intelligence and games

A survey of algorithms for black-box safety validation of cyber-physical systems

The benchmark lottery

[LIVRE][B] Distributional reinforcement learning

State of the art control of atari games using shallow reinforcement learning

Best-first width search: Exploration and exploitation in classical planning

Model-free, model-based, and general intelligence