- Academic Search

J Rocamonde, V Montesinos, E Nava, E Perez… - arxiv preprint arxiv …, 2023 - arxiv.org

Reinforcement learning (RL) requires either manually specifying a reward function, which is
often infeasible, or learning a reward model from a large amount of human feedback, which …

Save Cite Cited by 53 Related articles All 4 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] arxiv.org

Ram: Retrieval-based affordance transfer for generalizable zero-shot robotic manipulation

Y Kuang, J Ye, H Geng, J Mao, C Deng… - arxiv preprint arxiv …, 2024 - arxiv.org

This work proposes a retrieve-and-transfer framework for zero-shot robotic manipulation,
dubbed RAM, featuring generalizability across various objects, environments, and …

Save Cite Cited by 11 Related articles All 4 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] researchgate.net

Active preference-based Gaussian process regression for reward learning and optimization

E Bıyık, N Huynh, MJ Kochenderfer… - … Journal of Robotics …, 2024 - journals.sagepub.com

Designing reward functions is a difficult task in AI and robotics. The complex task of directly
specifying all the desirable behaviors a robot needs to optimize often proves challenging for …

Save Cite Cited by 14 Related articles All 6 versions Free GPT-4

[Free GPT-4]

[PDF] arxiv.org

Vision-language models as a source of rewards

K Baumli, S Baveja, F Behbahani, H Chan… - arxiv preprint arxiv …, 2023 - arxiv.org

Building generalist agents that can accomplish many goals in rich open-ended
environments is one of the research frontiers for reinforcement learning. A key limiting factor …

Save Cite Cited by 21 Related articles All 3 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] arxiv.org

Rl-vlm-f: Reinforcement learning from vision language foundation model feedback

Y Wang, Z Sun, J Zhang, Z **an, E Biyik, D Held… - arxiv preprint arxiv …, 2024 - arxiv.org

Reward engineering has long been a challenge in Reinforcement Learning (RL) research,
as it often requires extensive human effort and iterative processes of trial-and-error to design …

Save Cite Cited by 37 Related articles All 4 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] arxiv.org

Integrating reinforcement learning with foundation models for autonomous robotics: Methods and perspectives

A Moroncelli, V Soni, AA Shahid, M Maccarini… - arxiv preprint arxiv …, 2024 - arxiv.org

Foundation models (FMs), large deep learning models pre-trained on vast, unlabeled
datasets, exhibit powerful capabilities in understanding complex patterns and generating …

Save Cite Cited by 1 Related articles View as HTML

[Free GPT-4]

[PDF] arxiv.org

LLM-empowered state representation for reinforcement learning

B Wang, Y Qu, Y Jiang, J Shao, C Liu, W Yang… - arxiv preprint arxiv …, 2024 - arxiv.org

Conventional state representations in reinforcement learning often omit critical task-related
details, presenting a significant challenge for value networks in establishing accurate …

Save Cite Cited by 5 Related articles View as HTML

[Free GPT-4]

[PDF] springer.com

Vision-language model-based human-robot collaboration for smart manufacturing: A state-of-the-art survey

J Fan, Y Yin, T Wang, W Dong, P Zheng… - Frontiers of Engineering …, 2025 - Springer

Abstract human-robot collaboration (HRC) is set to transform the manufacturing paradigm by
leveraging the strengths of human flexibility and robot precision. The recent breakthrough of …

Evolution and Prospects of Foundation Models: From Large Language Models to Large Multimodal Models.

Z Chen, L Xu, H Zheng, L Chen… - Computers …, 2024 - search.ebscohost.com

Since the 1950s, when the Turing Test was introduced, there has been notable progress in
machine language intelligence. Language modeling, crucial for AI development, has …

Save Cite Cited by 7 Related articles All 3 versions Free GPT-4

[Free GPT-4]

[PDF] arxiv.org

Epo: Hierarchical llm agents with environment preference optimization

Q Zhao, H Fu, C Sun, G Konidaris - arxiv preprint arxiv:2408.16090, 2024 - arxiv.org

Long-horizon decision-making tasks present significant challenges for LLM-based agents
due to the need for extensive planning over multiple steps. In this paper, we propose a …

Save Cite Cited by 3 Related articles All 7 versions Free GPT-4 View as HTML

Create alert

Cite

Advanced search

Saved to My library

Roboclip: One demonstration is enough to learn robot policies

Vision-language models are zero-shot reward models for reinforcement learning

Ram: Retrieval-based affordance transfer for generalizable zero-shot robotic manipulation

Active preference-based Gaussian process regression for reward learning and optimization

Vision-language models as a source of rewards

Rl-vlm-f: Reinforcement learning from vision language foundation model feedback

Integrating reinforcement learning with foundation models for autonomous robotics: Methods and perspectives

LLM-empowered state representation for reinforcement learning

Vision-language model-based human-robot collaboration for smart manufacturing: A state-of-the-art survey

Evolution and Prospects of Foundation Models: From Large Language Models to Large Multimodal Models.

Epo: Hierarchical llm agents with environment preference optimization