Multi-lingual agents through multi-headed neural networks JD Thomas, R Santos-Rodriguez, R Piechocki, M Anca arXiv preprint arXiv:2111.11129, 2021 | 6 | 2021 |
Twin delayed hierarchical actor-critic M Anca, M Studley 2021 7th International Conference on Automation, Robotics and Applications …, 2021 | 6 | 2021 |
Achieving goals using reward shaping and curriculum learning M Anca, JD Thomas, D Pedamonti, M Hansen, M Studley Proceedings of the Future Technologies Conference, 316-331, 2023 | 4 | 2023 |
Effects of reward shaping on curriculum learning in goal conditioned tasks M Anca, M Studley, M Hansen, JD Thomas, D Pedamonti arXiv preprint arXiv:2206.02462, 2022 | 3 | 2022 |
Learning Long Chain of Actions through Hierarchical Reinforcement Learning M Anca, M Anca distances 1, 1, 2024 | | 2024 |
Modular Hierarchical Reinforcement Learning for Robotics: Improving Scalability and Generalizability M Anca, MF Hansen, M Studley ICML Workshop on New Frontiers in Learning, Control, and Dynamical Systems, 0 | | |