Прати
Abhishek Naik
Abhishek Naik
Верификована је имејл адреса на nrc-cnrc.gc.ca - Почетна страница
Наслов
Навело
Навело
Година
Learning and planning in average-reward markov decision processes
Y Wan, A Naik, RS Sutton
International Conference on Machine Learning, 10653-10662, 2021
862021
Discounted Reinforcement Learning is Not an Optimization Problem
A Naik, R Shariff, N Yasui, H Yao, RS Sutton
arXiv preprint arXiv:1910.02140, 2019
762019
MADRaS: Multi Agent Driving Simulator
A Santara, S Rudra, SA Buridi, M Kaushik, A Naik, B Kaul, B Ravindran
Journal of Artificial Intelligence Research 70, 1517–1555, 2021
362021
Rail: Risk-averse imitation learning
A Santara, A Naik, B Ravindran, D Das, D Mudigere, S Avancha, B Kaul
Proceedings of the 17th International Conference on Autonomous Agents and …, 2018
262018
Average-Reward Learning and Planning with Options
Y Wan, A Naik, RS Sutton
Advances in Neural Information Processing Systems 34, 2021
142021
Identifying User Survival Types via Clustering of Censored Social Network Data
SC Mouli, A Naik, B Ribeiro, J Neville
arXiv preprint arXiv:1703.03401, 2017
102017
Reward Centering
A Naik, Y Wan, M Tomar, RS Sutton
arXiv preprint arXiv:2405.09999, 2024
92024
Reinforcement Learning for Continuing Problems Using Average Reward
A Naik
12024
Multi-Step Average-Reward Prediction via Differential TD(λ)
A Naik, RS Sutton
The Multi-disciplinary Conference on Reinforcement Learning and Decision Making, 2022
12022
Planning with Expectation Models for Control
K Kudashkina, Y Wan, A Naik, RS Sutton
arXiv preprint arXiv:2104.08543, 2021
12021
Energy-Efficient Satellite IoT Optical Downlinks Using Weather-Adaptive Reinforcement Learning
E Fettes, PG Madoery, H Yanikomeroglu, G Karabulut-Kurt, A Naik, ...
arXiv preprint arXiv:2501.11198, 2025
2025
Investigating Action-Space Generalization in Reinforcement Learning for Recommendation Systems
A Naik, B Chang, A Karatzoglou, M Mladenov, EH Chi, M Chen
Companion Proceedings of the ACM Web Conference 2023, 966-972, 2023
2023
Deep Reinforcement Learning: Reliability and Multi-Agent Environments
A Naik
Indian Institute of Technology Madras, 2018
2018
Систем тренутно не може да изврши ову радњу. Пробајте поново касније.
Чланци 1–13