Exploration-exploitation in multi-agent learning: Catastrophe theory meets game theory

S Leonardos, G Piliouras - Artificial Intelligence, 2022 - Elsevier
Exploration-exploitation is a powerful and practical tool in multi-agent learning (MAL);
however, its effects are far from understood. To make progress in this direction, we study a …

Interferobot: aligning an optical interferometer by a reinforcement learning agent

D Sorokin, A Ulanov, E Sazhina… - Advances in Neural …, 2020 - proceedings.neurips.cc
Limitations in acquiring training data restrict potential applications of deep reinforcement
learning (RL) methods to the training of real-world robots. Here we train an RL agent to align …

Ghost In the Grid: Challenges for Reinforcement Learning in Grid World Environments

C Bamford - 2023 - qmro.qmul.ac.uk
The current state-of-the-art deep reinforcement learning techniques require agents to gather
large amounts of diverse experiences to train effective and general models. In addition …

Contrastive introspection (ConSpec) to rapidly identify invariant steps for success

C Sun, W Yang, B Alsbury-Nealy, Y Bengio… - 2022 - openreview.net
Reinforcement learning (RL) algorithms have achieved notable success in recent years, but
still struggle with fundamental issues in long-term credit assignment. It remains difficult to …

[PDF][PDF] Value-Based Reinforcement Learning for Sequence-to-Sequence Models

F Retkowski, A Waibel - ala2021.vub.ac.be
Seq2seq models offer great promise for sequence generation problems such as machine
translation, text summarization, or dialogue generation. Nevertheless, fully supervised …