- Academic Search

H Bai, R Cheng, Y ** - Intelligent Computing, 2023 - spj.science.org

Reinforcement learning (RL) is a machine learning approach that trains agents to maximize
cumulative rewards through interactions with environments. The integration of RL with deep …

Lưu Trích dẫn Trích dẫn 59 bài viết Bài viết có liên quan Tất cả 5 phiên bản

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Personalized soups: Personalized large language model alignment via post-hoc parameter merging

J Jang, S Kim, BY Lin, Y Wang, J Hessel… - arxiv preprint arxiv …, 2023 - arxiv.org

While Reinforcement Learning from Human Feedback (RLHF) aligns Large Language
Models (LLMs) with general, aggregate human preferences, it is suboptimal for learning …

Lưu Trích dẫn Trích dẫn 100 bài viết Bài viết có liên quan Tất cả 3 phiên bản Xem dạng HTML

[Free GPT-4]
[DeepSeek]

[PDF] springer.com

A practical guide to multi-objective reinforcement learning and planning

CF Hayes, R Rădulescu, E Bargiacchi… - Autonomous Agents and …, 2022 - Springer

Real-world sequential decision-making tasks are generally complex, requiring trade-offs
between multiple, often conflicting, objectives. Despite this, the majority of research in …

Lưu Trích dẫn Trích dẫn 420 bài viết Bài viết có liên quan Tất cả 21 phiên bản

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

A toolkit for reliable benchmarking and research in multi-objective reinforcement learning

F Felten, LN Alegre, A Nowe… - Advances in …, 2023 - proceedings.neurips.cc

Multi-objective reinforcement learning algorithms (MORL) extend standard reinforcement
learning (RL) to scenarios where agents must optimize multiple---potentially conflicting …

Lưu Trích dẫn Trích dẫn 35 bài viết Bài viết có liên quan Tất cả 10 phiên bản Xem dạng HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Evolutionary multi-objective reinforcement learning based trajectory control and task offloading in UAV-assisted mobile edge computing

F Song, H **ng, X Wang, S Luo, P Dai… - IEEE Transactions …, 2022 - ieeexplore.ieee.org

This article studies the trajectory control and task offloading (TCTO) problem in an
unmanned aerial vehicle (UAV)-assisted mobile edge computing system, where a UAV flies …

Lưu Trích dẫn Trích dẫn 93 bài viết Bài viết có liên quan Tất cả 5 phiên bản

MO-MIX: Multi-objective multi-agent cooperative decision-making with deep reinforcement learning

T Hu, B Luo, C Yang, T Huang - IEEE Transactions on Pattern …, 2023 - ieeexplore.ieee.org

Deep reinforcement learning (RL) has been applied extensively to solve complex decision-
making problems. In many real-world scenarios, tasks often have several conflicting …

Lưu Trích dẫn Trích dẫn 30 bài viết Bài viết có liên quan Tất cả 5 phiên bản

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Hypervolume maximization: A geometric view of pareto set learning

X Zhang, X Lin, B Xue, Y Chen… - Advances in Neural …, 2023 - proceedings.neurips.cc

This paper presents a novel approach to multiobjective algorithms aimed at modeling the
Pareto set using neural networks. Whereas previous methods mainly focused on identifying …

Lưu Trích dẫn Trích dẫn 21 bài viết Bài viết có liên quan Tất cả 4 phiên bản Xem dạng HTML

[Free GPT-4]
[DeepSeek]

[PDF] acs.org

Discovering high entropy alloy electrocatalysts in vast composition spaces with multiobjective optimization

W Xu, E Diesen, T He, K Reuter… - Journal of the American …, 2024 - ACS Publications

High entropy alloys (HEAs) are a highly promising class of materials for electrocatalysis as
their unique active site distributions break the scaling relations that limit the activity of …

Lưu Trích dẫn Trích dẫn 18 bài viết Bài viết có liên quan Tất cả 12 phiên bản

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Reducing idleness in financial cloud services via multi-objective evolutionary reinforcement learning based load balancer

P Yang, L Zhang, H Liu, G Li - Science China Information Sciences, 2024 - Springer

In recent years, various companies have started to shift their data services from traditional
data centers to the cloud. One of the major motivations is to save on operational costs with …

Lưu Trích dẫn Trích dẫn 40 bài viết Bài viết có liên quan Tất cả 6 phiên bản

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Collaborative ground-space communications via evolutionary multi-objective deep reinforcement learning

J Li, G Sun, Q Wu, D Niyato, J Kang… - IEEE Journal on …, 2024 - ieeexplore.ieee.org

Low Earth Orbit (LEO) satellites have emerged as crucial enablers of direct connections with
remote terrestrial terminals. However, energy limitations and insufficient antenna capabilities …

Lưu Trích dẫn Trích dẫn 12 bài viết Bài viết có liên quan Tất cả 4 phiên bản

Tạo thông báo

Trích dẫn

Tìm kiếm nâng cao

Đã lưu vào Thư viện của tôi

Prediction-guided multi-objective reinforcement learning for continuous robot control

Evolutionary reinforcement learning: A survey

Personalized soups: Personalized large language model alignment via post-hoc parameter merging

A practical guide to multi-objective reinforcement learning and planning

A toolkit for reliable benchmarking and research in multi-objective reinforcement learning

Evolutionary multi-objective reinforcement learning based trajectory control and task offloading in UAV-assisted mobile edge computing

MO-MIX: Multi-objective multi-agent cooperative decision-making with deep reinforcement learning

Hypervolume maximization: A geometric view of pareto set learning

Discovering high entropy alloy electrocatalysts in vast composition spaces with multiobjective optimization

Reducing idleness in financial cloud services via multi-objective evolutionary reinforcement learning based load balancer

Collaborative ground-space communications via evolutionary multi-objective deep reinforcement learning