- Academic Search

Articles

Scholar

Environ 27 résultats (0,02 s)

Mon profil Ma bibliothèque

Sample and feedback efficient hierarchical reinforcement learning from human preferences

Rechercher parmi les articles qui s'y rapportent

[Free GPT-4]

[PDF] arxiv.org

A survey of reinforcement learning from human feedback

T Kaufmann, P Weng, V Bengs… - ar** strategies with human instructions

T Osa, J Peters, G Neumann - Advanced Robotics, 2018 - Taylor & Francis

Gras** is an essential component for robotic manipulation and has been investigated for
decades. Prior work on gras** often assumes that a sufficient amount of training data is …

Enregistrer Citer Cité 33 fois Autres articles Les 9 versions Free GPT-4

[Free GPT-4]

[PDF] arxiv.org

Distributed personalized gradient tracking with convex parametric models

I Notarnicola, A Simonetto, F Farina… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org

We present a distributed optimization algorithm for solving online personalized optimization
problems over a network of computing and communicating nodes, each of which linked to a …

Enregistrer Citer Cité 20 fois Autres articles Les 7 versions Free GPT-4

Créer l'alerte

Citer

Recherche avancée

Enregistré dans Ma bibliothèque

Sample and feedback efficient hierarchical reinforcement learning from human preferences

A survey of reinforcement learning from human feedback

Distributed personalized gradient tracking with convex parametric models