Segui
Minttu Alakuijala
Titolo
Citata da
Citata da
Anno
Residual reinforcement learning from demonstrations
M Alakuijala, G Dulac-Arnold, J Mairal, J Ponce, C Schmid
arXiv preprint arXiv:2106.08050, 2021
332021
Learning reward functions for robotic manipulation by observing humans
M Alakuijala, G Dulac-Arnold, J Mairal, J Ponce, C Schmid
2023 IEEE International Conference on Robotics and Automation (ICRA), 5006-5012, 2023
232023
Generating Code World Models with Large Language Models Guided by Monte Carlo Tree Search
N Dainese, M Merler, M Alakuijala, P Marttinen
38th Conference on Neural Information Processing Systems (NeurIPS 2024), 2024
22024
Recursive Decomposition with Dependencies for Generic Divide-and-Conquer Reasoning
S Hernández-Gutiérrez, M Alakuijala, AV Nikitin, P Marttinen
The First Workshop on System-2 Reasoning at Scale, NeurIPS'24, 2024
12024
Memento No More: Coaching AI Agents to Master Multiple Tasks via Hints Internalization
M Alakuijala, Y Gao, G Ananov, S Kaski, P Marttinen, A Ilin, H Valpola
arXiv preprint arXiv:2502.01562, 2025
2025
Video-Language Critic: Transferable Reward Functions for Language-Conditioned Robotics
M Alakuijala, R McLean, I Woungang, N Farsad, S Kaski, P Marttinen, ...
Transactions on Machine Learning Research, 2025
2025
Self-taught Robots: Autonomous and Weakly-Supervised Learning for Robotic Manipulation
M Alakuijala
ENS Paris - Ecole Normale Superieure de Paris, 2022
2022
Discovering Actions by Jointly Clustering Video and Narration Streams Across Tasks
M Alakuijala, J Mairal, J Ponce, C Schmid
CVPR 2020 Workshop on Learning from Instructional Videos, 2020
2020
Il sistema al momento non può eseguire l'operazione. Riprova più tardi.
Articoli 1–8