الباحث العلمي من Google

Rewarded soups: towards pareto-optimal alignment by interpolating weights fine-tuned on diverse rewards‏

A Rame, G Couairon, C Dancette… - Advances in …, 2023‏ - proceedings.neurips.cc‏

Foundation models are first pre-trained on vast unsupervised datasets and then fine-tuned
on labeled data. Reinforcement learning, notably from human feedback (RLHF), can further …‏

حفظ اقتباس تم اقتباسها في عدد: 111 مقالات ذات صلة الإصدارات الـ 7كلها إصدار HTML‏

[Free GPT-4]
[DeepSeek]

[PDF] springer.com

A practical guide to multi-objective reinforcement learning and planning‏

CF Hayes, R Rădulescu, E Bargiacchi… - Autonomous Agents and …, 2022‏ - Springer‏

Real-world sequential decision-making tasks are generally complex, requiring trade-offs
between multiple, often conflicting, objectives. Despite this, the majority of research in …‏

حفظ اقتباس تم اقتباسها في عدد: 405 مقالات ذات صلة الإصدارات الـ 21كلها

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Multi-objective multi-agent decision making: a utility-based analysis and survey‏

R Rădulescu, P Mannion, DM Roijers… - Autonomous Agents and …, 2020‏ - Springer‏

The majority of multi-agent system implementations aim to optimise agents' policies with
respect to a single objective, despite the fact that many real-world problem domains are …‏

حفظ اقتباس تم اقتباسها في عدد: 170 مقالات ذات صلة الإصدارات الـ 17كلها

MO-MIX: Multi-objective multi-agent cooperative decision-making with deep reinforcement learning‏

T Hu, B Luo, C Yang, T Huang - IEEE Transactions on Pattern …, 2023‏ - ieeexplore.ieee.org‏

Deep reinforcement learning (RL) has been applied extensively to solve complex decision-
making problems. In many real-world scenarios, tasks often have several conflicting …‏

حفظ اقتباس تم اقتباسها في عدد: 30 مقالات ذات صلة الإصدارات الـ 5كلها

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Multi-objective deep reinforcement learning‏

H Mossalam, YM Assael, DM Roijers… - arxiv preprint arxiv …, 2016‏ - arxiv.org‏

We propose Deep Optimistic Linear Support Learning (DOL) to solve high-dimensional multi-
objective decision problems where the relative importances of the objectives are not known …‏

حفظ اقتباس تم اقتباسها في عدد: 194 مقالات ذات صلة الإصدارات الـ 3كلها بحث عن المكتبات إصدار HTML‏

[Free GPT-4]
[DeepSeek]

[PDF] academia.edu

Human-aligned artificial intelligence is a multiobjective problem‏

P Vamplew, R Dazeley, C Foale, S Firmin… - Ethics and information …, 2018‏ - Springer‏

As the capabilities of artificial intelligence (AI) systems improve, it becomes important to
constrain their actions to ensure their behaviour remains beneficial to humanity. A variety of …‏

حفظ اقتباس تم اقتباسها في عدد: 172 مقالات ذات صلة الإصدارات الـ 10كلها

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

A multi-objective deep reinforcement learning framework‏

TT Nguyen, ND Nguyen, P Vamplew… - … Applications of Artificial …, 2020‏ - Elsevier‏

This paper introduces a new scalable multi-objective deep reinforcement learning (MODRL)
framework based on deep Q-networks. We develop a high-performance MODRL framework …‏

حفظ اقتباس تم اقتباسها في عدد: 153 مقالات ذات صلة الإصدارات الـ 9كلها

[كتاب][B] Metrics and benchmarks for self-aware computing systems‏

N Herbst, S Becker, S Kounev, H Koziolek, M Maggio… - 2017‏ - Springer‏

In this chapter, we propose a list of metrics grouped by the MAPE-K paradigm for quantifying
properties of self-aware computing systems. This set of metrics can be seen as a starting …‏

حفظ اقتباس تم اقتباسها في عدد: 139 مقالات ذات صلة الإصدارات الـ 13كلها بحث عن المكتبات

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Autonomy and intelligence in the computing continuum: Challenges, enablers, and future directions for orchestration‏

H Kokkonen, L Lovén, NH Motlagh, A Kumar… - arxiv preprint arxiv …, 2022‏ - arxiv.org‏

Future AI applications require performance, reliability and privacy that the existing, cloud-
dependant system architectures cannot provide. In this article, we study orchestration in the …‏

حفظ اقتباس تم اقتباسها في عدد: 37 مقالات ذات صلة الإصدارات الـ 6كلها إصدار HTML‏

[Free GPT-4]
[DeepSeek]

[PDF] aston.ac.uk

Self-improving system integration: Mastering continuous change‏

K Bellman, J Botev, A Diaconescu, L Esterle… - Future Generation …, 2021‏ - Elsevier‏

The research initiative “self-improving system integration”(SISSY) was established with the
goal to master the ever-changing demands of system organisation in the presence of …‏

حفظ اقتباس تم اقتباسها في عدد: 61 مقالات ذات صلة الإصدارات الـ 10كلها

إنشاء تنبيه

اقتباس

بحث متقدم

تم حفظ المقالة في مكتبتي.

A novel adaptive weight selection algorithm for multi-objective multi-agent reinforcement learning

Rewarded soups: towards pareto-optimal alignment by interpolating weights fine-tuned on diverse rewards‏

A practical guide to multi-objective reinforcement learning and planning‏

Multi-objective multi-agent decision making: a utility-based analysis and survey‏

MO-MIX: Multi-objective multi-agent cooperative decision-making with deep reinforcement learning‏

Multi-objective deep reinforcement learning‏

Human-aligned artificial intelligence is a multiobjective problem‏

A multi-objective deep reinforcement learning framework‏

[كتاب][B] Metrics and benchmarks for self-aware computing systems‏

Autonomy and intelligence in the computing continuum: Challenges, enablers, and future directions for orchestration‏

Self-improving system integration: Mastering continuous change‏