- Academic Search

M Xu, Y Shen, S Zhang, Y Lu, D Zhao… - international …, 2022 - proceedings.mlr.press

Human can leverage prior experience and learn novel tasks from a handful of
demonstrations. In contrast to offline meta-reinforcement learning, which aims to achieve …

Speichern Zitieren Zitiert von: 144 Ähnliche Artikel Alle 8 Versionen HTML-Version

[Free GPT-4]

[PDF] thecvf.com

Social nce: Contrastive learning of socially-aware motion representations

Y Liu, Q Yan, A Alahi - Proceedings of the IEEE/CVF …, 2021 - openaccess.thecvf.com

Learning socially-aware motion representations is at the core of recent advances in multi-
agent problems, such as human motion forecasting and robot navigation in crowds. Despite …

Speichern Zitieren Zitiert von: 130 Ähnliche Artikel Alle 9 Versionen HTML-Version

[Free GPT-4]

[PDF] neurips.cc

State regularized policy optimization on data with dynamics shift

Z Xue, Q Cai, S Liu, D Zheng… - Advances in neural …, 2024 - proceedings.neurips.cc

In many real-world scenarios, Reinforcement Learning (RL) algorithms are trained on data
with dynamics shift, ie, with different underlying environment dynamics. A majority of current …

Speichern Zitieren Zitiert von: 12 Ähnliche Artikel Alle 7 Versionen HTML-Version

[Free GPT-4]

[HTML] sciencedirect.com

[HTML][HTML] Machine learning meets advanced robotic manipulation

S Nahavandi, R Alizadehsani, D Nahavandi, CP Lim… - Information …, 2024 - Elsevier

Automated industries lead to high quality production, lower manufacturing cost and better
utilization of human resources. Robotic manipulator arms have major role in the automation …

Speichern Zitieren Zitiert von: 10 Ähnliche Artikel Alle 8 Versionen Web of Science: 3

[Free GPT-4]

[PDF] neurips.cc

Offline imitation learning with a misspecified simulator

S Jiang, J Pang, Y Yu - Advances in neural information …, 2020 - proceedings.neurips.cc

In real-world decision-making tasks, learning an optimal policy without a trial-and-error
process is an appealing challenge. When expert demonstrations are available, imitation …

Speichern Zitieren Zitiert von: 30 Ähnliche Artikel Alle 4 Versionen HTML-Version

Multi-objective Deep Reinforcement Learning for Function Offloading in Serverless Edge Computing

Y Yang, X Du, Y Ye, J Ding, T Wang… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org

Function offloading problems play a crucial role in optimizing the performance of
applications in serverless edge computing (SEC). Existing research has extensively …

Speichern Zitieren Ähnliche Artikel Alle 2 Versionen

[Free GPT-4]

[PDF] liv.ac.uk

[PDF][PDF] Near on-policy experience sampling in multi-objective reinforcement learning

S Wang, M Reymond, AA Irissappane… - Proceedings of the …, 2022 - aamas.csc.liv.ac.uk

In multi-objective decision problems, the same state-action pair under different preference
weights between the objectives, constitutes different optimal policies. The introduction of …

Speichern Zitieren Zitiert von: 6 Ähnliche Artikel Alle 10 Versionen HTML-Version

[Free GPT-4]

[PDF] arxiv.org

Successive convex approximation based off-policy optimization for constrained reinforcement learning

C Tian, A Liu, G Huang, W Luo - IEEE Transactions on Signal …, 2022 - ieeexplore.ieee.org

Constrained reinforcement learning (CRL), also termed as safe reinforcement learning, is a
promising technique enabling the deployment of RL agent in real-world systems. In this …

Speichern Zitieren Zitiert von: 10 Ähnliche Artikel Alle 3 Versionen Web of Science: 5

[Free GPT-4]

[PDF] arxiv.org

A Bi-objective Perspective on Controllable Language Models: Reward Dropout Improves Off-policy Control Performance

C Lee, C Lim - arxiv preprint arxiv:2310.04483, 2023 - arxiv.org

We study the theoretical aspects of CLMs (Controllable Language Models) from a bi-
objective optimization perspective. Specifically, we consider the CLMs as an off-policy RL …

Speichern Zitieren Ähnliche Artikel Alle 2 Versionen HTML-Version

[Free GPT-4]

[PDF] cmu.edu

[PDF][PDF] Building Adaptable Generalist Robots

M Xu - 2024 - kilthub.cmu.edu

Over the past decade, advancements in deep robot learning have enabled robots to acquire
remarkable capabilities. However, these robots often struggle to generalize to new, unseen …

Speichern Zitieren Ähnliche Artikel Alle 2 Versionen HTML-Version

Alert erstellen

Zitieren

Erweiterte Suche

In „Meine Bibliothek“ gespeichert

Off-policy policy gradient algorithms by constraining the state distribution shift

Prompting decision transformer for few-shot policy generalization

Social nce: Contrastive learning of socially-aware motion representations

State regularized policy optimization on data with dynamics shift

[HTML][HTML] Machine learning meets advanced robotic manipulation

Offline imitation learning with a misspecified simulator

Multi-objective Deep Reinforcement Learning for Function Offloading in Serverless Edge Computing

[PDF][PDF] Near on-policy experience sampling in multi-objective reinforcement learning

Successive convex approximation based off-policy optimization for constrained reinforcement learning

A Bi-objective Perspective on Controllable Language Models: Reward Dropout Improves Off-policy Control Performance

[PDF][PDF] Building Adaptable Generalist Robots