ติดตาม
Joey Hong
Joey Hong
UC Berkeley, Google Research
ยืนยันอีเมลแล้วที่ berkeley.edu
ชื่อ
อ้างโดย
อ้างโดย
ปี
Rules of the road: Predicting driving behavior with a convolutional model of semantic interactions
J Hong, B Sapp, J Philbin
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2019
3192019
When Should We Prefer Offline Reinforcement Learning Over Behavioral Cloning?
A Kumar, J Hong, A Singh, S Levine
International Conference on Learning Representations, 2021
152*2021
Trajectory prediction on top-down scenes
XJ Hong, BJ Sapp
US Patent 11,169,531, 2021
892021
Latent bandits revisited
J Hong, B Kveton, M Zaheer, Y Chow, A Ahmed, C Boutilier
Advances in Neural Information Processing Systems 33, 13423-13433, 2020
562020
Hierarchical bayesian bandits
J Hong, B Kveton, M Zaheer, M Ghavamzadeh
International Conference on Artificial Intelligence and Statistics, 7724-7741, 2022
482022
Trajectory prediction on top-down scenes and associated model
XJ Hong, BJ Sapp, JWV Philbin, KZ Wang
US Patent 11,195,418, 2021
392021
On the sensitivity of reward inference to misspecified human models
J Hong, K Bhatia, A Dragan
arXiv preprint arXiv:2212.04717, 2022
262022
Deep Hierarchy in Bandits
J Hong, B Kveton, S Katariya, M Zaheer, M Ghavamzadeh
International Conference on Machine Learning, 2022
262022
Lmrl gym: Benchmarks for multi-turn reinforcement learning with language models
M Abdulhai, I White, C Snell, C Sun, J Hong, Y Zhai, K Xu, S Levine
arXiv preprint arXiv:2311.18232, 2023
232023
Confidence-conditioned value functions for offline reinforcement learning
J Hong, A Kumar, S Levine
arXiv preprint arXiv:2212.04607, 2022
232022
Latent programmer: Discrete latent codes for program synthesis
J Hong, D Dohan, R Singh, C Sutton, M Zaheer
International Conference on Machine Learning, 4308-4318, 2021
232021
Learning to influence human behavior with offline reinforcement learning
J Hong, S Levine, A Dragan
Advances in Neural Information Processing Systems 36, 36094-36105, 2023
182023
Thompson sampling with a mixture prior
J Hong, B Kveton, M Zaheer, M Ghavamzadeh, C Boutilier
International Conference on Artificial Intelligence and Statistics, 7565-7586, 2022
182022
Exedec: Execution decomposition for compositional generalization in neural program synthesis
K Shi, J Hong, Y Deng, P Yin, M Zaheer, C Sutton
arXiv preprint arXiv:2307.13883, 2023
172023
Zero-shot goal-directed dialogue via rl on imagined conversations
J Hong, S Levine, A Dragan
arXiv preprint arXiv:2311.05584, 2023
162023
Non-stationary latent bandits
J Hong, B Kveton, M Zaheer, Y Chow, A Ahmed, M Ghavamzadeh, ...
arXiv preprint arXiv:2012.00386, 2020
122020
Multi-task off-policy learning from bandit feedback
J Hong, B Kveton, M Zaheer, S Katariya, M Ghavamzadeh
International Conference on Machine Learning, 13157-13173, 2023
102023
Non-stationary off-policy optimization
J Hong, B Kveton, M Zaheer, Y Chow, A Ahmed
International Conference on Artificial Intelligence and Statistics, 2494-2502, 2021
102021
Trajectory prediction on top-down scenes and associated model
XJ Hong, BJ Sapp, JWV Philbin, KZ Wang
US Patent 12,183,204, 2024
82024
Ensemble Maximum Entropy Classification and Linear Regression for Author Age Prediction
J Hong, C Mattmann, P Ramirez
2017 IEEE International Conference on Information Reuse and Integration (IRI), 2017
82017
ระบบไม่สามารถดำเนินการได้ในขณะนี้ โปรดลองใหม่อีกครั้งในภายหลัง
บทความ 1–20