Segueix
Jing Yu Koh
Jing Yu Koh
Meta
Correu electrònic verificat a meta.com - Pàgina d'inici
Títol
Citada per
Citada per
Any
Scaling autoregressive models for content-rich text-to-image generation
J Yu, Y Xu, JY Koh, T Luong, G Baid, Z Wang, V Vasudevan, A Ku, Y Yang, ...
TMLR, 2022
11352022
Vector-quantized image modeling with improved vqgan
J Yu, X Li, JY Koh, H Zhang, R Pang, J Qin, A Ku, Y Xu, J Baldridge, Y Wu
ICLR, 2021
4652021
Cross-Modal Contrastive Learning for Text-to-Image Generation
H Zhang*, JY Koh*, J Baldridge, H Lee, Y Yang
CVPR, 2021
4122021
Generating images with multimodal language models
JY Koh, D Fried, R Salakhutdinov
NeurIPS, 2023
2452023
Grounding Language Models to Images for Multimodal Inputs and Outputs
JY Koh, R Salakhutdinov, D Fried
ICML, 2023
1942023
Visualwebarena: Evaluating multimodal agents on realistic visual web tasks
JY Koh, R Lo, L Jang, V Duvvur, MC Lim, PY Huang, G Neubig, S Zhou, ...
ACL, 2024
1252024
Pathdreamer: A World Model for Indoor Navigation
JY Koh, H Lee, Y Yang, J Baldridge, P Anderson
ICCV, 2021
772021
Text-to-image generation grounded by fine-grained user attention
JY Koh, J Baldridge, H Lee, Y Yang
WACV, 2021
762021
A New Path: Scaling Vision-and-Language Navigation with Synthetic Instructions and Imitation Learning
A Kamath, P Anderson, S Wang, JY Koh, A Ku, A Waters, Y Yang, ...
CVPR, 2022
51*2022
OmniACT: A Dataset and Benchmark for Enabling Multimodal Generalist Autonomous Agents for Desktop and Web
R Kapoor, YP Butala, M Russak, JY Koh, K Kamble, W Alshikh, ...
ECCV, 2024
41*2024
Tree Search for Language Model Agents
JY Koh, S McAleer, D Fried, R Salakhutdinov
arXiv preprint arXiv:2407.01476, 2024
302024
Revisiting hierarchical approach for persistent long-term video prediction
W Lee, W Jung, H Zhang, T Chen, JY Koh, T Huang, H Yoon, H Lee, ...
ICLR, 2021
292021
Vq3d: Learning a 3d-aware generative model on imagenet
K Sargent, JY Koh, H Zhang, H Chang, C Herrmann, P Srinivasan, J Wu, ...
ICCV, 2023
282023
Improving Customer Satisfaction in Bike Sharing Systems through Dynamic Repositioning
S Ghosh*, JY Koh*, P Jaillet
IJCAI, 2019
282019
Simple and Effective Synthesis of Indoor 3D Scenes
JY Koh*, H Agrawal*, D Batra, R Tucker, A Waters, H Lee, Y Yang, ...
AAAI, 2022
272022
Urban Zoning Using Higher-Order Markov Random Fields on Multi-View Imagery Data
T Feng, QT Truong, T Nguyen, JY Koh, LF Yu, SK Yeung, A Binder
ECCV, 2018
212018
Dissecting Adversarial Robustness of Multimodal LM Agents
CH Wu, RR Shah, JY Koh, R Salakhutdinov, D Fried, A Raghunathan
NeurIPS 2024 Workshop on Open-World Agents, 2024
17*2024
Twitter-informed crowd flow prediction
G Goh, JY Koh, Y Zhang
ICDM Workshops, 2018
172018
Multimodal graph learning for generative tasks
M Yoon, JY Koh, B Hooi, R Salakhutdinov
arXiv preprint arXiv:2310.07478, 2023
82023
Vector-Quantized Image Modeling
YU Jiahui, X Li, H Zhang, V Vasudevan, AYS Ku, JM Baldridge, Y Xu, ...
US Patent App. 18/520,083, 2024
62024
En aquests moments el sistema no pot dur a terme l'operació. Torneu-ho a provar més tard.
Articles 1–20