The rise and potential of large language model based agents: A survey
For a long time, researchers have sought artificial intelligence (AI) that matches or exceeds
human intelligence. AI agents, which are artificial entities capable of sensing the …
human intelligence. AI agents, which are artificial entities capable of sensing the …
Towards continual reinforcement learning: A review and perspectives
In this article, we aim to provide a literature review of different formulations and approaches
to continual reinforcement learning (RL), also known as lifelong or non-stationary RL. We …
to continual reinforcement learning (RL), also known as lifelong or non-stationary RL. We …
Leveraging procedural generation to benchmark reinforcement learning
Abstract We introduce Procgen Benchmark, a suite of 16 procedurally generated game-like
environments designed to benchmark both sample efficiency and generalization in …
environments designed to benchmark both sample efficiency and generalization in …
Unity: A general platform for intelligent agents
Recent advances in artificial intelligence have been driven by the presence of increasingly
realistic and complex simulated environments. However, many of the existing environments …
realistic and complex simulated environments. However, many of the existing environments …
[PDF][PDF] On the measure of intelligence
F Chollet - arxiv preprint arxiv:1911.01547, 2019 - juanmirod.github.io
To make deliberate progress towards more intelligent and more human-like artificial
systems, we need to be following an appropriate feedback signal: we need to be able to …
systems, we need to be following an appropriate feedback signal: we need to be able to …
Quantifying generalization in reinforcement learning
In this paper, we investigate the problem of overfitting in deep reinforcement learning.
Among the most common benchmarks in RL, it is customary to use the same environments …
Among the most common benchmarks in RL, it is customary to use the same environments …
Maximum entropy RL (provably) solves some robust RL problems
Many potential applications of reinforcement learning (RL) require guarantees that the agent
will perform well in the face of disturbances to the dynamics or reward function. In this paper …
will perform well in the face of disturbances to the dynamics or reward function. In this paper …
Contrastive behavioral similarity embeddings for generalization in reinforcement learning
Reinforcement learning methods trained on few environments rarely learn policies that
generalize to unseen environments. To improve generalization, we incorporate the inherent …
generalize to unseen environments. To improve generalization, we incorporate the inherent …
Recurrent model-free rl can be a strong baseline for many pomdps
Many problems in RL, such as meta-RL, robust RL, generalization in RL, and temporal credit
assignment, can be cast as POMDPs. In theory, simply augmenting model-free RL with …
assignment, can be cast as POMDPs. In theory, simply augmenting model-free RL with …
The nethack learning environment
Abstract Progress in Reinforcement Learning (RL) algorithms goes hand-in-hand with the
development of challenging environments that test the limits of current methods. While …
development of challenging environments that test the limits of current methods. While …