A survey of reinforcement learning from human feedback

T Kaufmann, P Weng, V Bengs… - ar** strategies with human instructions
T Osa, J Peters, G Neumann - Advanced Robotics, 2018 - Taylor & Francis
Gras** is an essential component for robotic manipulation and has been investigated for
decades. Prior work on gras** often assumes that a sufficient amount of training data is …

Distributed personalized gradient tracking with convex parametric models

I Notarnicola, A Simonetto, F Farina… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
We present a distributed optimization algorithm for solving online personalized optimization
problems over a network of computing and communicating nodes, each of which linked to a …