- Academic Search

B Ding, C Qin, R Zhao, T Luo, X Li… - Findings of the …, 2024 - aclanthology.org

In the rapidly evolving field of large language models (LLMs), data augmentation (DA) has
emerged as a pivotal technique for enhancing model performance by diversifying training …

Save Cite Cited by 64 Related articles All 3 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] arxiv.org

Large language models and video games: A preliminary sco** review

P Sweetser - Proceedings of the 6th ACM Conference on …, 2024 - dl.acm.org

Large language models (LLMs) hold interesting potential for the design, development, and
research of video games. Building on the decades of prior research on generative AI in …

Save Cite Cited by 15 Related articles All 2 versions Free GPT-4

[Free GPT-4]

[PDF] arxiv.org

A survey of reinforcement learning from human feedback

T Kaufmann, P Weng, V Bengs… - arxiv preprint arxiv …, 2023 - arxiv.org

Reinforcement learning from human feedback (RLHF) is a variant of reinforcement learning
(RL) that learns from human feedback instead of relying on an engineered reward function …

Save Cite Cited by 115 Related articles All 4 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] arxiv.org

Reinforcement Learning: An Overview

K Murphy - arxiv preprint arxiv:2412.05265, 2024 - arxiv.org

This manuscript gives a big-picture, up-to-date overview of the field of (deep) reinforcement
learning and sequential decision making, covering value-based RL, policy-gradient …

Save Cite Cited by 1 Related articles View as HTML

[Free GPT-4]

[PDF] arxiv.org

GAVEL: Generating games via evolution and language models

G Todd, A Padula, M Stephenson, É Piette… - arxiv preprint arxiv …, 2024 - arxiv.org

Automatically generating novel and interesting games is a complex task. Challenges include
representing game rules in a computationally workable form, searching through the large …

Save Cite Cited by 8 Related articles All 9 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] academia.edu

[PDF][PDF] Structure in reinforcement learning: A survey and open problems

A Mohan, A Zhang, M Lindauer - arxiv preprint arxiv:2306.16021, 2023 - academia.edu

Reinforcement Learning (RL), bolstered by the expressive capabilities of Deep Neural
Networks (DNNs) for function approximation, has demonstrated considerable success in …

Save Cite Cited by 17 Related articles All 2 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] arxiv.org

LLM-empowered state representation for reinforcement learning

B Wang, Y Qu, Y Jiang, J Shao, C Liu, W Yang… - arxiv preprint arxiv …, 2024 - arxiv.org

Conventional state representations in reinforcement learning often omit critical task-related
details, presenting a significant challenge for value networks in establishing accurate …

Save Cite Cited by 5 Related articles View as HTML

[Free GPT-4]

[PDF] arxiv.org

Balrog: Benchmarking agentic llm and vlm reasoning on games

D Paglieri, B Cupiał, S Coward, U Piterbarg… - arxiv preprint arxiv …, 2024 - arxiv.org

Large Language Models (LLMs) and Vision Language Models (VLMs) possess extensive
knowledge and exhibit promising reasoning abilities; however, they still struggle to perform …

Save Cite Cited by 3 Related articles All 2 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] arxiv.org

A survey on large language model-based game agents

S Hu, T Huang, F Ilhan, S Tekin, G Liu… - arxiv preprint arxiv …, 2024 - arxiv.org

The development of game agents holds a critical role in advancing towards Artificial General
Intelligence (AGI). The progress of LLMs and their multimodal counterparts (MLLMs) offers …

Save Cite Cited by 34 Related articles All 2 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] arxiv.org

diff History for Long-Context Language Agents

U Piterbarg, L Pinto, R Fergus - arxiv preprint arxiv:2312.07540, 2023 - arxiv.org

Language Models (LMs) offer an exciting solution for general-purpose embodied control.
However, a key technical issue arises when using an LM-based controller: environment …

Save Cite Cited by 1 Related articles All 2 versions Free GPT-4 View as HTML

Create alert

Cite

Advanced search

Saved to My library

Motif: Intrinsic motivation from artificial intelligence feedback

Data augmentation using llms: Data perspectives, learning paradigms and challenges

Large language models and video games: A preliminary sco** review

A survey of reinforcement learning from human feedback

Reinforcement Learning: An Overview

GAVEL: Generating games via evolution and language models

[PDF][PDF] Structure in reinforcement learning: A survey and open problems

LLM-empowered state representation for reinforcement learning

Balrog: Benchmarking agentic llm and vlm reasoning on games

A survey on large language model-based game agents

diff History for Long-Context Language Agents