Towards continual reinforcement learning: A review and perspectives

K Khetarpal, M Riemer, I Rish, D Precup - Journal of Artificial Intelligence …, 2022 - jair.org
In this article, we aim to provide a literature review of different formulations and approaches
to continual reinforcement learning (RL), also known as lifelong or non-stationary RL. We …

Webshop: Towards scalable real-world web interaction with grounded language agents

S Yao, H Chen, J Yang… - Advances in Neural …, 2022 - proceedings.neurips.cc
Most existing benchmarks for grounding language in interactive environments either lack
realistic linguistic elements, or prove difficult to scale up due to substantial human …

When do transformers shine in rl? decoupling memory from credit assignment

T Ni, M Ma, B Eysenbach… - Advances in Neural …, 2023 - proceedings.neurips.cc
Reinforcement learning (RL) algorithms face two distinct challenges: learning effective
representations of past and present observations, and determining how actions influence …

Facing off world model backbones: Rnns, transformers, and s4

F Deng, J Park, S Ahn - Advances in Neural Information …, 2023 - proceedings.neurips.cc
World models are a fundamental component in model-based reinforcement learning
(MBRL). To perform temporally extended and consistent simulations of the future in partially …

CrowdTransfer: Enabling Crowd Knowledge Transfer in AIoT Community

Y Liu, B Guo, N Li, Y Ding, Z Zhang… - … Surveys & Tutorials, 2024 - ieeexplore.ieee.org
Artificial Intelligence of Things (AIoT) is an emerging frontier based on the deep fusion of
Internet of Things (IoT) and Artificial Intelligence (AI) technologies. The fundamental goal of …

On the link between conscious function and general intelligence in humans and machines

A Juliani, K Arulkumaran, S Sasai, R Kanai - arxiv preprint arxiv …, 2022 - arxiv.org
In popular media, there is often a connection drawn between the advent of awareness in
artificial agents and those same agents simultaneously achieving human or superhuman …

Large-scale retrieval for reinforcement learning

P Humphreys, A Guez, O Tieleman… - Advances in …, 2022 - proceedings.neurips.cc
Effective decision making involves flexibly relating past experiences and relevant contextual
information to a novel situation. In deep reinforcement learning (RL), the dominant paradigm …

Memory gym: Partially observable challenges to memory-based agents

M Pleines, M Pallasch, F Zimmer… - … conference on learning …, 2023 - openreview.net
Memory Gym is a novel benchmark for challenging Deep Reinforcement Learning agents to
memorize events across long sequences, be robust to noise, and generalize. It consists of …

Large language models can segment narrative events similarly to humans

S Michelmann, M Kumar, KA Norman… - Behavior Research …, 2025 - Springer
Humans perceive discrete events such as “restaurant visits” and “train rides” in their
continuous experience. One important prerequisite for studying human event perception is …

Planning irregular object packing via hierarchical reinforcement learning

S Huang, Z Wang, J Zhou, J Lu - IEEE robotics and automation …, 2022 - ieeexplore.ieee.org
Object packing by autonomous robots is an important challenge in warehouses and logistics
industry. Most conventional data-driven packing planning approaches focus on regular …