- Academic Search

D Hafner, J Pasukonis, J Ba, T Lillicrap - ar** a general algorithm that learns to solve tasks across a wide range of
applications has been a fundamental challenge in artificial intelligence. Although current …

Save Cite Cited by 523 Related articles All 2 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] science.org

Learning agile soccer skills for a bipedal robot with deep reinforcement learning

T Haarnoja, B Moran, G Lever, SH Huang… - Science Robotics, 2024 - science.org

We investigated whether deep reinforcement learning (deep RL) is able to synthesize
sophisticated and safe movement skills for a low-cost, miniature humanoid robot that can be …

Save Cite Cited by 118 Related articles All 7 versions Free GPT-4

[Free GPT-4]

[PDF] springer.com

A practical guide to multi-objective reinforcement learning and planning

CF Hayes, R Rădulescu, E Bargiacchi… - Autonomous Agents and …, 2022 - Springer

Real-world sequential decision-making tasks are generally complex, requiring trade-offs
between multiple, often conflicting, objectives. Despite this, the majority of research in …

Save Cite Cited by 388 Related articles All 21 versions Free GPT-4

[Free GPT-4]

[PDF] neurips.cc

Rewarded soups: towards pareto-optimal alignment by interpolating weights fine-tuned on diverse rewards

A Rame, G Couairon, C Dancette… - Advances in …, 2024 - proceedings.neurips.cc

Foundation models are first pre-trained on vast unsupervised datasets and then fine-tuned
on labeled data. Reinforcement learning, notably from human feedback (RLHF), can further …

Save Cite Cited by 97 Related articles All 7 versions Free GPT-4 View as HTML

[Free GPT-4]

[HTML] sciencedirect.com

[HTML][HTML] dm_control: Software and tasks for continuous control

S Tunyasuvunakool, A Muldal, Y Doron, S Liu, S Bohez… - Software Impacts, 2020 - Elsevier

The dm_control software package is a collection of Python libraries and task suites for
reinforcement learning agents in an articulated-body simulation. Infrastructure includes a …

Save Cite Cited by 411 Related articles All 6 versions Free GPT-4

[Free GPT-4]

[PDF] springer.com

Challenges of real-world reinforcement learning: definitions, benchmarks and analysis

G Dulac-Arnold, N Levine, DJ Mankowitz, J Li… - Machine Learning, 2021 - Springer

Reinforcement learning (RL) has proven its worth in a series of artificial domains, and is
beginning to show some successes in real-world scenarios. However, much of the research …

Save Cite Cited by 571 Related articles All 6 versions Free GPT-4

[Free GPT-4]

[PDF] mlr.press

Multi-objective gflownets

M Jain, SC Raparthy… - International …, 2023 - proceedings.mlr.press

We study the problem of generating diverse candidates in the context of Multi-Objective
Optimization. In many applications of machine learning such as drug discovery and material …

Save Cite Cited by 71 Related articles All 7 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] neurips.cc

Pareto set learning for expensive multi-objective optimization

X Lin, Z Yang, X Zhang… - Advances in neural …, 2022 - proceedings.neurips.cc

Expensive multi-objective optimization problems can be found in many real-world
applications, where their objective function evaluations involve expensive computations or …

Save Cite Cited by 67 Related articles All 6 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] springer.com

Scalar reward is not enough: A response to silver, singh, precup and sutton (2021)

P Vamplew, BJ Smith, J Källström, G Ramos… - Autonomous Agents and …, 2022 - Springer

The recent paper “Reward is Enough” by Silver, Singh, Precup and Sutton posits that the
concept of reward maximisation is sufficient to underpin all intelligence, both natural and …

Save Cite Cited by 92 Related articles All 17 versions Free GPT-4

[Free GPT-4]

[PDF] arxiv.org

An empirical investigation of the challenges of real-world reinforcement learning

G Dulac-Arnold, N Levine, DJ Mankowitz, J Li… - arxiv preprint arxiv …, 2020 - arxiv.org

Reinforcement learning (RL) has proven its worth in a series of artificial domains, and is
beginning to show some successes in real-world scenarios. However, much of the research …

Save Cite Cited by 151 Related articles All 3 versions Free GPT-4 View as HTML

Create alert

Cite

Advanced search

Saved to My library

A distributional view on multi-objective policy optimization

Mastering diverse domains through world models

Learning agile soccer skills for a bipedal robot with deep reinforcement learning

A practical guide to multi-objective reinforcement learning and planning

Rewarded soups: towards pareto-optimal alignment by interpolating weights fine-tuned on diverse rewards

[HTML][HTML] dm_control: Software and tasks for continuous control

Challenges of real-world reinforcement learning: definitions, benchmarks and analysis

Multi-objective gflownets

Pareto set learning for expensive multi-objective optimization

Scalar reward is not enough: A response to silver, singh, precup and sutton (2021)

An empirical investigation of the challenges of real-world reinforcement learning