Segueix
Daan Wout
Títol
Citada per
Citada per
Any
Deep reinforcement learning with feedback-based exploration
J Scholten, D Wout, C Celemin, J Kober
2019 IEEE 58th Conference on Decision and Control (CDC), 803-808, 2019
62019
Learning Gaussian policies from corrective human feedback
D Wout, J Scholten, C Celemin, J Kober
arXiv preprint arXiv:1903.05216, 2019
52019
Policy Learning with Human Teachers
D Wout
2019
Policy Learning with Human Teachers: Using directive feedback in a Gaussian framework
D Wout
2019
En aquests moments el sistema no pot dur a terme l'operació. Torneu-ho a provar més tard.
Articles 1–4