Suivre
Mudit Gaur
Mudit Gaur
Adresse e-mail validée de purdue.edu
Titre
Citée par
Citée par
Année
On the global convergence of fitted Q-iteration with two-layer neural network parametrization
M Gaur, V Aggarwal, M Agarwal
International Conference on Machine Learning, 11013-11049, 2023
32023
Closing the gap: Achieving global convergence (last iterate) of actor-critic under markovian sampling with neural network parametrization
M Gaur, AS Bedi, D Wang, V Aggarwal
International Conference of Machine Learning, 15153-15179, 2024
22024
On the Global Convergence of Natural Actor-Critic with Two-layer Neural Network Parametrization
M Gaur, AS Bedi, D Wang, V Aggarwal
arXiv preprint arXiv:2306.10486, 2023
22023
On The Global Convergence Of Online RLHF With Neural Parametrization
M Gaur, AS Bedi, R Pasupathy, V Aggarwal
arXiv preprint arXiv:2410.15610, 2024
2024
Le système ne peut pas réaliser cette opération maintenant. Veuillez réessayer plus tard.
Articles 1–4