Data augmentation using llms: Data perspectives, learning paradigms and challenges

B Ding, C Qin, R Zhao, T Luo, X Li… - Findings of the …, 2024 - aclanthology.org
In the rapidly evolving field of large language models (LLMs), data augmentation (DA) has
emerged as a pivotal technique for enhancing model performance by diversifying training …

Large language models and video games: A preliminary sco** review

P Sweetser - Proceedings of the 6th ACM Conference on …, 2024 - dl.acm.org
Large language models (LLMs) hold interesting potential for the design, development, and
research of video games. Building on the decades of prior research on generative AI in …

A survey of reinforcement learning from human feedback

T Kaufmann, P Weng, V Bengs… - arxiv preprint arxiv …, 2023 - arxiv.org
Reinforcement learning from human feedback (RLHF) is a variant of reinforcement learning
(RL) that learns from human feedback instead of relying on an engineered reward function …

Reinforcement Learning: An Overview

K Murphy - arxiv preprint arxiv:2412.05265, 2024 - arxiv.org
This manuscript gives a big-picture, up-to-date overview of the field of (deep) reinforcement
learning and sequential decision making, covering value-based RL, policy-gradient …

GAVEL: Generating games via evolution and language models

G Todd, A Padula, M Stephenson, É Piette… - arxiv preprint arxiv …, 2024 - arxiv.org
Automatically generating novel and interesting games is a complex task. Challenges include
representing game rules in a computationally workable form, searching through the large …

[PDF][PDF] Structure in reinforcement learning: A survey and open problems

A Mohan, A Zhang, M Lindauer - arxiv preprint arxiv:2306.16021, 2023 - academia.edu
Reinforcement Learning (RL), bolstered by the expressive capabilities of Deep Neural
Networks (DNNs) for function approximation, has demonstrated considerable success in …

LLM-empowered state representation for reinforcement learning

B Wang, Y Qu, Y Jiang, J Shao, C Liu, W Yang… - arxiv preprint arxiv …, 2024 - arxiv.org
Conventional state representations in reinforcement learning often omit critical task-related
details, presenting a significant challenge for value networks in establishing accurate …

Balrog: Benchmarking agentic llm and vlm reasoning on games

D Paglieri, B Cupiał, S Coward, U Piterbarg… - arxiv preprint arxiv …, 2024 - arxiv.org
Large Language Models (LLMs) and Vision Language Models (VLMs) possess extensive
knowledge and exhibit promising reasoning abilities; however, they still struggle to perform …

A survey on large language model-based game agents

S Hu, T Huang, F Ilhan, S Tekin, G Liu… - arxiv preprint arxiv …, 2024 - arxiv.org
The development of game agents holds a critical role in advancing towards Artificial General
Intelligence (AGI). The progress of LLMs and their multimodal counterparts (MLLMs) offers …

diff History for Long-Context Language Agents

U Piterbarg, L Pinto, R Fergus - arxiv preprint arxiv:2312.07540, 2023 - arxiv.org
Language Models (LMs) offer an exciting solution for general-purpose embodied control.
However, a key technical issue arises when using an LM-based controller: environment …