Ahmed Touati

Cituota

	Visi	Nuo 2020
Šaltiniai	566	499
h-rodyklė	13	13
i10-rodyklė	16	15

140

105

20162017201820192020202120222023202420254 8 9 40 63 81 99 121 107 28

Viešas pasiekiamumas

Peržiūrėti viską

5 straipsniai

0 straipsnių

pasiekiami

nepasiekiami

Pagal finansavimo įpareigojimus

Bendraautoriai

Pascal VincentFacebook AI Research; U. Montreal (Professor, Computer Sc. & Op. Res.); MILA; CIFARPatvirtintas el. paštas iro.umontreal.ca
Joshua RomoffUbisoft La ForgePatvirtintas el. paštas ubisoft.com
Pierre-Luc BaconUniversity of MontrealPatvirtintas el. paštas mila.quebec
Simon Lacoste-JulienAssociate Professor - Canada CIFAR AI Chair, University of Montreal / MilaPatvirtintas el. paštas iro.umontreal.ca
Nan Rosemary KeGoogle, Deepmind, MilaPatvirtintas el. paštas google.com
Gabriel HuangPhD candidate, Mila & Visiting Researcher, ServiceNowPatvirtintas el. paštas umontreal.ca
Gauthier GidelAssistant professor at Mila, University of Montréal (DIRO)Patvirtintas el. paštas umontreal.ca
Doina PrecupDeepMind and McGill UniversityPatvirtintas el. paštas cs.mcgill.ca
Chin-Wei HuangMicrosoft ResearchPatvirtintas el. paštas microsoft.com
Aaron CourvilleProfessor, DIRO, Université de Montréal, Mila, Cifar CAI chairPatvirtintas el. paštas umontreal.ca
Amy ZhangAssistant Professor of Electrical and Computer Engineering at University of Texas at AustinPatvirtintas el. paštas austin.utexas.edu
Harsh SatijaMcGill University, MilaPatvirtintas el. paštas mail.mcgill.ca
Laurent DinhApplePatvirtintas el. paštas apple.com
Jerome Le NyProfessor of Electrical Engineering, Polytechnique Montreal, and GERADPatvirtintas el. paštas polymtl.ca
Marc G. BellemareReliant AIPatvirtintas el. paštas reliant.ai
Adrien Ali TaïgaUniversité de MontréalPatvirtintas el. paštas umontreal.ca

Stebėti

Ahmed Touati

Meta AI

Patvirtintas el. paštas umontreal.ca

Machine learning Reinforcement learning


Pavadinimas Rūšiuoti pagal šaltinius Rūšiuoti pagal metus Rūšiuoti pagal pavadinimą	Cituota Cituota	Metai
Learning Dynamics Model in Reinforcement Learning by Incorporating the Long Term Future NR Ke, A Singh, A Touati, A Goyal, Y Bengio, D Parikh, D Batra ICLR 2019 - Proceedings of the Seventh International Conference on Learning …, 2019	94*	2019
Learning One Representation to Optimize All Rewards A Touati, Y Ollivier NeurIPS 2021: Thirty-fifth Conference on Neural Information Processing Systems, 2021	75	2021
Randomized value functions via multiplicative normalizing flows A Touati, H Satija, J Romoff, J Pineau, P Vincent UAI2019: Conference on Uncertainty in Artificial Intelligence 2019, 2018	47	2018
Convergent Tree Backup and Retrace with Function Approximation A Touati, PL Bacon, D Precup, P Vincent ICML 2018, Proceedings of the 35th International Conference on Machine …, 2017	46	2017
Does Zero-Shot Reinforcement Learning Exist? A Touati, J Rapin, Y Ollivier ICLR 2023, 2022	42	2022
Efficient learning in non-stationary linear markov decision processes A Touati, P Vincent arXiv preprint arXiv:2010.12870, 2020	40	2020
Learnable explicit density for continuous latent space and variational inference CW Huang, A Touati, L Dinh, M Drozdzal, M Havaei, L Charlin, ... ICML 2017 Workshop on Principle Approaches to Deep Learning (padl), 2017	31	2017
Real-time privacy-preserving model-based estimation of traffic flows J Le Ny, A Touati, GJ Pappas 2014 ACM/IEEE International Conference on Cyber-Physical Systems (ICCPS), 92-102, 2014	31	2014
Separable value functions across time-scales J Romoff, P Henderson, A Touati, Y Ollivier, J Pineau, E Brunskill ICML 2019, Proceedings of the 36th International Conference on Machine …, 2019	28*	2019
Stable Policy Optimization via Off-Policy Divergence Regularization A Touati, A Zhang, J Pineau, P Vincent UAI2020: Conference on Uncertainty in Artificial Intelligence 2020, 2020	22	2020
Zooming for efficient model-free reinforcement learning in metric spaces A Touati, AA Taiga, MG Bellemare arXiv preprint arXiv:2003.04069, 2020	17	2020
Parametric adversarial divergences are good task losses for generative modeling G Huang, H Berard, A Touati, G Gidel, P Vincent, S Lacoste-Julien MAIS18, Montreal AI Symposium 2018, 2017	17*	2017
Maximum reward formulation in reinforcement learning SK Gottipati, Y Pathak, R Nuttall, R Chunduru, A Touati, SG Subramanian, ... arXiv preprint arXiv:2010.03744, 2020	15	2020
SVRG for policy evaluation with fewer gradient evaluations Z Peng, A Touati, P Vincent, D Precup IJCAI2020: the 29th International Joint Conference on Artificial Intelligence, 2019	11	2019
TDprop: Does Adaptive Optimization With Jacobi Preconditioning Help Temporal Difference Learning? J Romoff, P Henderson, D Kanaa, E Bengio, A Touati, PL Bacon, J Pineau Proceedings of the 20th International Conference on Autonomous Agents and …, 2021	10*	2021
Stochastic Neural Network with Kronecker Flow CW Huang, A Touati, P Vincent, GK Dziugaite, A Lacoste, A Courville AISTATS 2020 - Proceedings of the 23nd International Conference on …, 2019	10	2019
SMORE: Score Models for Offline Goal-Conditioned Reinforcement Learning H Sikchi, R Chitnis, A Touati, A Geramifard, A Zhang, S Niekum arXiv preprint arXiv:2311.02013, 2023	7	2023
Fast imitation via behavior foundation models M Pirotta, A Tirinzoni, A Touati, A Lazaric, Y Ollivier NeurIPS 2023 Foundation Models for Decision Making Workshop, 2023	7	2023
Scalable Representation Learning in Linear Contextual Bandits with Constant Regret Guarantees A Tirinzoni, M Papini, A Touati, A Lazaric, M Pirotta NeurIPS 2022, 2022	4	2022
Simple ingredients for offline reinforcement learning E Cetin, A Tirinzoni, M Pirotta, A Lazaric, Y Ollivier, A Touati arXiv preprint arXiv:2403.13097, 2024	3	2024

Sistema negali atlikti operacijos. Bandykite vėliau dar kartą.

Straipsniai 1–20

Šaltinių per metus

Dubliuoti šaltiniai

Sujungti šaltiniai

Pridėti bendraautoriusBendraautoriai

Stebėti

Cituota

Bendraautoriai