Alfworld: Aligning text and embodied environments for interactive learning

M Shridhar, X Yuan, MA Côté, Y Bisk… - arxiv preprint arxiv …, 2020 - arxiv.org
Given a simple request like Put a washed apple in the kitchen fridge, humans can reason in
purely abstract terms by imagining action sequences and scoring their likelihood of success …

Semantic exploration from language abstractions and pretrained representations

A Tam, N Rabinowitz, A Lampinen… - Advances in neural …, 2022 - proceedings.neurips.cc
Effective exploration is a challenge in reinforcement learning (RL). Novelty-based
exploration methods can suffer in high-dimensional state spaces, such as continuous …

Improving intrinsic exploration with language abstractions

J Mu, V Zhong, R Raileanu, M Jiang… - Advances in …, 2022 - proceedings.neurips.cc
Reinforcement learning (RL) agents are particularly hard to train when rewards are sparse.
One common solution is to use intrinsic rewards to encourage agents to explore their …

Representation-driven reinforcement learning

O Nabati, G Tennenholtz… - … Conference on Machine …, 2023 - proceedings.mlr.press
We present a representation-driven framework for reinforcement learning. By representing
policies as estimates of their expected values, we leverage techniques from contextual …

Learning to play chess from textbooks (LEAP): a corpus for evaluating chess moves based on sentiment analysis

H Alrdahi, R Batista-Navarro - arxiv preprint arxiv:2310.20260, 2023 - arxiv.org
Learning chess strategies has been investigated widely, with most studies focussing on
learning from previous games using search algorithms. Chess textbooks encapsulate …

LanGWM: Language Grounded World Model

RPK Poudel, H Pandya, C Zhang, R Cipolla - arxiv preprint arxiv …, 2023 - arxiv.org
Recent advances in deep reinforcement learning have showcased its potential in tackling
complex tasks. However, experiments on visual control tasks have revealed that state-of-the …

Multi-world Model in Continual Reinforcement Learning

K Shen - Proceedings of the AAAI Conference on Artificial …, 2024 - ojs.aaai.org
World Models are made of generative networks that can predict future states of a single
environment which it was trained on. This research proposes a Multi-world Model, a …

Natural Language-based State Representation in Deep Reinforcement Learning

MM Rahman, Y Xue - Findings of the Association for …, 2024 - aclanthology.org
This paper investigates the potential of using natural language descriptions as an alternative
to direct image-based observations for learning policies in reinforcement learning. Due to …

Review of Metrics to Measure the Stability, Robustness and Resilience of Reinforcement Learning

LL Pullum - arxiv preprint arxiv:2203.12048, 2022 - arxiv.org
Reinforcement learning has received significant interest in recent years, due primarily to the
successes of deep reinforcement learning at solving many challenging tasks such as …

Subgoal Proposition Using a Vision-Language Model

J Su, Q Zhang - CoRL 2023 Workshop on Learning Effective …, 2023 - openreview.net
Recent advances in large language models (LLMs) have inspired research on their potential
for robots in real-world tasks. This study investigates whether the architecture of the vision …