팔로우
Clément ROMAC
Clément ROMAC
Research Scientist at Hugging Face / Ph.D. student at Inria (Flowers team)
inria.fr의 이메일 확인됨 - 홈페이지
제목
인용
인용
연도
Grounding large language models in interactive environments with online reinforcement learning
T Carta, C Romac, T Wolf, S Lamprier, O Sigaud, PY Oudeyer
International Conference on Machine Learning, 3676-3713, 2023
1612023
Teachmyagent: a benchmark for automatic curriculum learning in deep rl
C Romac, R Portelas, K Hofmann, PY Oudeyer
International Conference on Machine Learning, 9052-9063, 2021
342021
Jack of All Trades, Master of Some, a Multi-Purpose Transformer Agent
Q Gallouédec, E Beeching, C Romac, E Dellandréa
arXiv preprint arXiv:2402.09844, 2024
102024
Meta automatic curriculum learning
R Portelas, C Romac, K Hofmann, PY Oudeyer
arXiv preprint arXiv:2011.08463, 2020
102020
Deep recurrent Q-learning vs deep Q-learning on a simple partially observable Markov decision process with minecraft
C Romac, V Béraud
arXiv preprint arXiv:1903.04311, 2019
82019
Reinforcement Learning for Aligning Large Language Models Agents with Interactive Environments: Quantifying and Mitigating Prompt Overfitting
MS Aissi, C Romac, T Carta, S Lamprier, PY Oudeyer, O Sigaud, L Soulier, ...
arXiv preprint arXiv:2410.19920, 2024
2024
SAC-GLAM: Improving Online RL for LLM agents with Soft Actor-Critic and Hindsight Relabeling
L Gaven, C Romac, T Carta, S Lamprier, O Sigaud, PY Oudeyer
arXiv preprint arXiv:2410.12481, 2024
2024
Reinforcement Learning for Aligning Large Language Models Agents with Interactive Environments: Quantifying and Mitigating Prompt Overfitting
M Salim Aissi, C Romac, T Carta, S Lamprier, PY Oudeyer, O Sigaud, ...
arXiv e-prints, arXiv: 2410.19920, 2024
2024
Les algorithmes des IA peuvent-ils comprendre notre monde?
C Romac, T Carta, PY Oudeyer
Pour la science, 2024
2024
Les IA face au réel
C Romac, T Carta, PY Oudeyer
Pour la Science 557 (3), 24-31, 2024
2024
현재 시스템이 작동되지 않습니다. 나중에 다시 시도해 주세요.
학술자료 1–10