Turnitin
降AI改写
早检测系统
早降重系统
Turnitin-UK版
万方检测-期刊版
维普编辑部版
Grammarly检测
Paperpass检测
checkpass检测
PaperYY检测
A survey of multi-objective sequential decision-making
Sequential decision-making problems with multiple objectives arise naturally in practice and
pose unique challenges for research in decision-theoretic planning and learning, which has …
pose unique challenges for research in decision-theoretic planning and learning, which has …
Multiobjective reinforcement learning: A comprehensive overview
C Liu, X Xu, D Hu - IEEE Transactions on Systems, Man, and …, 2014 - ieeexplore.ieee.org
Reinforcement learning (RL) is a powerful paradigm for sequential decision-making under
uncertainties, and most RL algorithms aim to maximize some numerical value which …
uncertainties, and most RL algorithms aim to maximize some numerical value which …
Rewarded soups: towards pareto-optimal alignment by interpolating weights fine-tuned on diverse rewards
A Rame, G Couairon, C Dancette… - Advances in …, 2023 - proceedings.neurips.cc
Foundation models are first pre-trained on vast unsupervised datasets and then fine-tuned
on labeled data. Reinforcement learning, notably from human feedback (RLHF), can further …
on labeled data. Reinforcement learning, notably from human feedback (RLHF), can further …
A practical guide to multi-objective reinforcement learning and planning
Real-world sequential decision-making tasks are generally complex, requiring trade-offs
between multiple, often conflicting, objectives. Despite this, the majority of research in …
between multiple, often conflicting, objectives. Despite this, the majority of research in …
Scalar reward is not enough: A response to silver, singh, precup and sutton (2021)
The recent paper “Reward is Enough” by Silver, Singh, Precup and Sutton posits that the
concept of reward maximisation is sufficient to underpin all intelligence, both natural and …
concept of reward maximisation is sufficient to underpin all intelligence, both natural and …
[PDF][PDF] Multi-objective reinforcement learning using sets of pareto dominating policies
Many real-world problems involve the optimization of multiple, possibly conflicting
objectives. Multi-objective reinforcement learning (MORL) is a generalization of standard …
objectives. Multi-objective reinforcement learning (MORL) is a generalization of standard …
Empirical evaluation methods for multiobjective reinforcement learning algorithms
While a number of algorithms for multiobjective reinforcement learning have been proposed,
and a small number of applications developed, there has been very little rigorous empirical …
and a small number of applications developed, there has been very little rigorous empirical …
Human-aligned artificial intelligence is a multiobjective problem
As the capabilities of artificial intelligence (AI) systems improve, it becomes important to
constrain their actions to ensure their behaviour remains beneficial to humanity. A variety of …
constrain their actions to ensure their behaviour remains beneficial to humanity. A variety of …
A multi-objective deep reinforcement learning framework
This paper introduces a new scalable multi-objective deep reinforcement learning (MODRL)
framework based on deep Q-networks. We develop a high-performance MODRL framework …
framework based on deep Q-networks. We develop a high-performance MODRL framework …
A novel cache architecture with enhanced performance and security
Z Wang, RB Lee - … 41st IEEE/ACM International Symposium on …, 2008 - ieeexplore.ieee.org
Caches ideally should have low miss rates and short access times, and should be power
efficient at the same time. Such design goals are often contradictory in practice. Recent …
efficient at the same time. Such design goals are often contradictory in practice. Recent …