Survey on large language model-enhanced reinforcement learning: Concept, taxonomy, and methods

Y Cao, H Zhao, Y Cheng, T Shu, Y Chen… - … on Neural Networks …, 2024 - ieeexplore.ieee.org
With extensive pretrained knowledge and high-level general capabilities, large language
models (LLMs) emerge as a promising avenue to augment reinforcement learning (RL) in …

Vision-language models as a source of rewards

K Baumli, S Baveja, F Behbahani, H Chan… - arxiv preprint arxiv …, 2023 - arxiv.org
Building generalist agents that can accomplish many goals in rich open-ended
environments is one of the research frontiers for reinforcement learning. A key limiting factor …

Generative ai for self-adaptive systems: State of the art and research roadmap

J Li, M Zhang, N Li, D Weyns, Z **, K Tei - ACM Transactions on …, 2024 - dl.acm.org
Self-adaptive systems (SASs) are designed to handle changes and uncertainties through a
feedback loop with four core functionalities: monitoring, analyzing, planning, and execution …

Rl-vlm-f: Reinforcement learning from vision language foundation model feedback

Y Wang, Z Sun, J Zhang, Z **an, E Biyik, D Held… - arxiv preprint arxiv …, 2024 - arxiv.org
Reward engineering has long been a challenge in Reinforcement Learning (RL) research,
as it often requires extensive human effort and iterative processes of trial-and-error to design …

Curricullm: Automatic task curricula design for learning complex robot skills using large language models

K Ryu, Q Liao, Z Li, K Sreenath, N Mehr - arxiv preprint arxiv:2409.18382, 2024 - arxiv.org
Curriculum learning is a training mechanism in reinforcement learning (RL) that facilitates
the achievement of complex policies by progressively increasing the task difficulty during …

FreeMotion: MoCap-Free Human Motion Synthesis with Multimodal Large Language Models

Z Zhang, Y Li, H Huang, M Lin, L Yi - European Conference on Computer …, 2024 - Springer
Human motion synthesis is a fundamental task in computer animation. Despite recent
progress in this field utilizing deep learning and motion capture data, existing methods are …