Ishan Durugkar

Получаване на мой собствен потребителски профил

Позовавания

	Всички	От 2020
Позовавания	1595	1274
h-индекс	10	10
i10-индекс	10	10

320

160

240

201620172018201920202021202220232024202514 30 105 163 196 227 243 305 254 48

Публичен достъп

Преглед на всички

8 статии

0 статии

налични

неналични

Въз основа на изисквания при финансирането

Съавтори

Peter StoneProfessor of Computer Science, The University of Texas at AustinПотвърден имейл адрес: cs.utexas.edu
Sridhar MahadevanDirector, Adobe Research & Professor, University of Massachusetts, AmherstПотвърден имейл адрес: cs.umass.edu
Ian GempGoogle DeepMindПотвърден имейл адрес: google.com
Rajarshi DasAWS AI LabsПотвърден имейл адрес: cs.washington.edu
Luke VilnisResearch Scientist, Google DeepMindПотвърден имейл адрес: google.com
Shehzaad DhuliawalaETH ZurichПотвърден имейл адрес: inf.ethz.ch
Akshay KrishnamurthyUniversity of Massachusetts AmherstПотвърден имейл адрес: cs.umass.edu
Andrew McCallumDistinguished Professor of Computer Science, University of Massachusetts AmherstПотвърден имейл адрес: cs.umass.edu
Alex SmolaBoson AIПотвърден имейл адрес: smola.org
Manzil ZaheerGoogle ResearchПотвърден имейл адрес: cmu.edu
Josiah P. HannaAssistant Professor, University of Wisconsin--MadisonПотвърден имейл адрес: cs.wisc.edu
Garrett WarnellResearch Scientist, Army Research LaboratoryПотвърден имейл адрес: army.mil
Mauricio TecHarvard UniversityПотвърден имейл адрес: g.harvard.edu
Elad LiebmanApplied Scientist, Amazon & Assistant Prof. of Instruction, UT AustinПотвърден имейл адрес: cs.utexas.edu
Haresh KarnanPh.D., The University of Texas at AustinПотвърден имейл адрес: utexas.edu
Dr Anand J KulkarniProfessor & Associate Director, Institute of Artificial Intelligence, MITWPU, Pune, IndiaПотвърден имейл адрес: ntu.edu.sg
Philip ThomasUniversity of Massachusetts AmherstПотвърден имейл адрес: cs.umass.edu
Emma BrunskillAssociate Professor of Computer Science, Stanford UniversityПотвърден имейл адрес: cs.stanford.edu
Georgios TheocharousAdobe ResearchПотвърден имейл адрес: adobe.com
Scott NiekumAssociate Professor, University of Massachusetts AmherstПотвърден имейл адрес: cs.umass.edu

Следене

Ishan Durugkar

Research Scientist, Sony AI

Потвърден имейл адрес: sony.com - Начална страница

Reinforcement Learning Generative Models Machine Learning Multi-agent systems Artificial Intelligence


Заглавие Сортиране по цитати Сортиране по година Сортиране по заглавие	Позовавания Позовавания	Година
Go for a walk and arrive at the answer: Reasoning over paths in knowledge bases using reinforcement learning R Das, S Dhuliawala, M Zaheer, L Vilnis, I Durugkar, A Krishnamurthy, ... arXiv preprint arXiv:1711.05851, 2017	678	2017
Generative Multi-Adversarial Networks I Durugkar, I Gemp, S Mahadevan International Conference on Learning Representations, 2017, 2017	479	2017
Cohort intelligence: a self supervised learning behavior AJ Kulkarni, IP Durugkar, M Kumar 2013 IEEE international conference on systems, man, and cybernetics, 1396-1400, 2013	141	2013
Predictive off-policy policy evaluation for nonstationary decision problems, with applications to digital marketing P Thomas, G Theocharous, M Ghavamzadeh, I Durugkar, E Brunskill Proceedings of the AAAI Conference on Artificial Intelligence 31 (2), 4740-4745, 2017	67	2017
An imitation from observation approach to transfer learning with dynamics mismatch S Desai, I Durugkar, H Karnan, G Warnell, J Hanna, P Stone Advances in Neural Information Processing Systems 33, 3917-3929, 2020	62	2020
Adversarial intrinsic motivation for reinforcement learning I Durugkar, M Tec, S Niekum, P Stone Advances in Neural Information Processing Systems 34, 8622-8636, 2021	44	2021
Deep reinforcement learning with macro-actions IP Durugkar, C Rosenbaum, S Dernbach, S Mahadevan arXiv preprint arXiv:1606.04615, 2016	31	2016
Balancing individual preferences and shared objectives in multiagent reinforcement learning I Durugkar, E Liebman, P Stone International Joint Conference on Artificial Intelligence, 2020	25	2020
Reducing sampling error in batch temporal difference learning B Pavse, I Durugkar, J Hanna, P Stone International Conference on Machine Learning, 7543-7552, 2020	16	2020
TD learning with constrained gradients I Durugkar, P Stone	15	2018
f-policy gradients: A general framework for goal-conditioned rl using f-divergences S Agarwal, I Durugkar, P Stone, A Zhang Advances in Neural Information Processing Systems 36, 12100-12123, 2023	7	2023
Towards a real-time, low-resource, end-to-end object detection pipeline for robot soccer SK Narayanaswami, M Tec, I Durugkar, S Desai, B Masetty, S Narvekar, ... Robot World Cup, 62-74, 2022	7	2022
Wasserstein distance maximizing intrinsic control I Durugkar, S Hansen, S Spencer, V Mnih arXiv preprint arXiv:2110.15331, 2021	4	2021
Unmixing in the presence of nuisances with deep generative models M Parente, I Gemp, I Durugkar 2017 IEEE International Geoscience and Remote Sensing Symposium (IGARSS …, 2017	3	2017
N-agent ad hoc teamwork C Wang, MA Rahman, I Durugkar, E Liebman, P Stone Advances in Neural Information Processing Systems 37, 111832-111862, 2025	2	2025
Estimation and control of visitation distributions for reinforcement learning I Durugkar	2	2023
Abc: Adversarial behavioral cloning for offline mode-seeking imitation learning E Hudson, I Durugkar, G Warnell, P Stone arXiv preprint arXiv:2211.04005, 2022	2	2022
DM : Distributed multi-agent reinforcement learning via distribution matching C Wang	2	2022
An imitation from observation approach to sim-to-real transfer S Desai, I Durugkar, H Karnan, G Warnell, J Hanna, P Stone, AI Sony 2nd Workshop on Closing the Reality Gap in Sim2Real Transfer for Robotics. RSS, 2020	2	2020
Multi-preference actor critic I Durugkar, M Hausknecht, A Swaminathan, P MacAlpine arXiv preprint arXiv:1904.03295, 2019	2	2019

Системата не може да изпълни операцията сега. Опитайте отново по-късно.

Статии 1–20

Позовавания годишно

Дублирани описания

Обединени библиографски описания

Добавяне на съавториСъавтори

Следене

Позовавания

Съавтори