Yash Chandak

引用先

	すべて	2020 年以来
引用	729	706
h 指標	11	11
i10 指標	13	11

260

130

195

201920202021202220232024202519 55 92 139 156 249 15

オープンアクセス

すべて表示

11 件の論文

0 件の論文

利用可能

利用不可

助成機関の要件に基づく

共著者

Philip ThomasUniversity of Massachusetts Amherst確認したメールアドレス: cs.umass.edu
Georgios TheocharousAdobe Research確認したメールアドレス: adobe.com
Scott M. JordanPostdoctoral Fellow, University of Alberta確認したメールアドレス: ualberta.ca
Emma BrunskillAssociate Professor of Computer Science, Stanford University確認したメールアドレス: cs.stanford.edu
James KostasPhD Student, University of Massachusetts Amherst確認したメールアドレス: umass.edu
Martha WhiteUniversity of Alberta確認したメールアドレス: ualberta.ca
Scott NiekumAssociate Professor, University of Massachusetts Amherst確認したメールアドレス: cs.umass.edu
Sridhar MahadevanDirector, Data Science Lab, Adobe Research & Professor, University of Massachusetts, Amherst確認したメールアドレス: cs.umass.edu
Bruno Castro da SilvaUniversity of Massachusetts確認したメールアドレス: cs.umass.edu
Rémi MunosFAIR, Meta確認したメールアドレス: inria.fr
Will DabneyDeepMind確認したメールアドレス: google.com
Chris NotaUniversity of Massachusetts, Amherst確認したメールアドレス: cs.umass.edu
Balaraman RavindranProfessor of Computer Science, Indian Institute of Technology Madras確認したメールアドレス: cse.iitm.ac.in
Erik Learned-MillerProfessor of Computer Science, University of Massachusetts Amherst確認したメールアドレス: cs.umass.edu
Shiv Shankar

フォロー

Yash Chandak

Postdoctoral Scholar, Stanford University

確認したメールアドレス: stanford.edu - ホームページ

Reinforcement Learning Machine Learning


タイトル引用回数順公開年順タイトル順	引用先引用先	年
Learning action representations for reinforcement learning Y Chandak, G Theocharous, J Kostas, S Jordan, P Thomas International conference on machine learning, 941-950, 2019	212	2019
Evaluating the performance of reinforcement learning algorithms S Jordan, Y Chandak, D Cohen, M Zhang, P Thomas International Conference on Machine Learning, 4962-4973, 2020	79	2020
Optimizing for the future in non-stationary mdps Y Chandak, G Theocharous, S Shankar, M White, S Mahadevan, ... International Conference on Machine Learning, 1414-1425, 2020	79	2020
Supervised pretraining can learn in-context reinforcement learning J Lee, A Xie, A Pacchiano, Y Chandak, C Finn, O Nachum, E Brunskill Advances in Neural Information Processing Systems 36, 43057-43083, 2023	63	2023
Universal off-policy evaluation Y Chandak, S Niekum, B da Silva, E Learned-Miller, E Brunskill, ... Advances in Neural Information Processing Systems 34, 27475-27490, 2021	57	2021
Understanding self-predictive learning for reinforcement learning Y Tang, ZD Guo, PH Richemond, BA Pires, Y Chandak, R Munos, ... International Conference on Machine Learning, 33632-33656, 2023	34	2023
Lifelong learning with a changing action set Y Chandak, G Theocharous, C Nota, P Thomas Proceedings of the AAAI Conference on Artificial Intelligence 34 (04), 3373-3380, 2020	34	2020
Towards safe policy improvement for non-stationary MDPs Y Chandak, S Jordan, G Theocharous, M White, PS Thomas Advances in Neural Information Processing Systems 33, 9156-9168, 2020	31	2020
The GPT Surprise: Offering Large Language Model Chat in a Massive Coding Class Reduced Engagement but Increased Adopters’ Exam Performances A Nie, Y Chandak, M Suzara, A Malik, J Woodrow, M Peng, M Sahami, ... OSF Preprints, 2024	16	2024
Reinforcement learning for strategic recommendations G Theocharous, Y Chandak, PS Thomas, F de Nijs arXiv preprint arXiv:2009.07346, 2020	12	2020
Behavior alignment via reward function optimization D Gupta, Y Chandak, S Jordan, PS Thomas, B C da Silva Advances in Neural Information Processing Systems 36, 52759-52791, 2023	11	2023
Fusion graph convolutional networks P Vijayan, Y Chandak, MM Khapra, S Parthasarathy, B Ravindran arXiv preprint arXiv:1805.12528, 2018	11	2018
Reinforcement learning when all actions are not always available Y Chandak, G Theocharous, B Metevier, P Thomas Proceedings of the AAAI Conference on Artificial Intelligence 34 (04), 3381-3388, 2020	10	2020
Off-policy evaluation for action-dependent non-stationary environments Y Chandak, S Shankar, N Bastian, B da Silva, E Brunskill, PS Thomas Advances in Neural Information Processing Systems 35, 9217-9232, 2022	8	2022
Adaptive instrument design for indirect experiments Y Chandak, S Shankar, V Syrgkanis, E Brunskill The Twelfth International Conference on Learning Representations, 2023	7	2023
High-confidence off-policy (or counterfactual) variance estimation Y Chandak, S Shankar, PS Thomas Proceedings of the AAAI Conference on Artificial Intelligence 35 (8), 6939-6947, 2021	7	2021
Factored DRO: Factored distributionally robust policies for contextual bandits T Mu, Y Chandak, TB Hashimoto, E Brunskill Advances in Neural Information Processing Systems 35, 8318-8331, 2022	6	2022
On optimizing interventions in shared autonomy W Tan, D Koleczek, S Pradhan, N Perello, V Chettiar, V Rohra, A Rajaram, ... Proceedings of the AAAI Conference on Artificial Intelligence 36 (5), 5341-5349, 2022	6	2022
Sope: Spectrum of off-policy estimators C Yuan, Y Chandak, S Giguere, PS Thomas, S Niekum Advances in Neural Information Processing Systems 34, 18958-18969, 2021	6	2021
Representations and exploration for deep reinforcement learning using singular value decomposition Y Chandak, S Thakoor, ZD Guo, Y Tang, R Munos, W Dabney, DL Borsa International Conference on Machine Learning, 4009-4034, 2023	5	2023

現在システムで処理を実行できません。しばらくしてからもう一度お試しください。

論文 1–20

年間引用数

重複した引用

結合された引用

共著者を追加共著者

フォロー

引用先

共著者