ติดตาม
Jae Sung (James) Park
Jae Sung (James) Park
ยืนยันอีเมลแล้วที่ cs.washington.edu - หน้าแรก
ชื่อ
อ้างโดย
อ้างโดย
ปี
Merlot: Multimodal neural script knowledge models
R Zellers, X Lu, J Hessel, Y Yu, JS Park, J Cao, A Farhadi, Y Choi
Advances in neural information processing systems 34, 23634-23651, 2021
4032021
VisualCOMET: Reasoning about the Dynamic Context of a Still Image
JS Park, C Bhagavatula, R Mottaghi, A Farhadi, Y Choi
arXiv preprint arXiv:2004.10796, 2020
1332020
Adversarial inference for multi-sentence video description
JS Park, M Rohrbach, T Darrell, A Rohrbach
Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2019
1132019
Molmo and pixmo: Open weights and open data for state-of-the-art multimodal models
M Deitke, C Clark, S Lee, R Tripathi, Y Yang, JS Park, M Salehi, ...
arXiv preprint arXiv:2409.17146, 2024
73*2024
Natural language rationales with full-stack visual reasoning: From pixels to semantic frames to commonsense graphs
A Marasović, C Bhagavatula, JS Park, RL Bras, NA Smith, Y Choi
arXiv preprint arXiv:2010.07526, 2020
682020
Agent ai: Surveying the horizons of multimodal interaction
Z Durante, Q Huang, N Wake, R Gong, JS Park, B Sarkar, R Taori, Y Noda, ...
arXiv preprint arXiv:2401.03568, 2024
572024
The Abduction of Sherlock Holmes: A Dataset for Visual Abductive Reasoning
J Hessel, JD Hwang, JS Park, R Zellers, C Bhagavatula, A Rohrbach, ...
European Conference on Computer Vision, 558-575, 2022
482022
Fusing pre-trained language models with multimodal prompts through reinforcement learning
Y Yu, J Chung, H Yun, J Hessel, JS Park, X Lu, R Zellers, P Ammanabrolu, ...
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023
44*2023
Exposing the limits of video-text models through contrast sets
JS Park, S Shen, A Farhadi, T Darrell, Y Choi, A Rohrbach
Proceedings of the 2022 Conference of the North American Chapter of the …, 2022
292022
Identity-aware multi-sentence video description
JS Park, T Darrell, A Rohrbach
Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23 …, 2020
252020
Llc: Accurate, multi-purpose learnt low-dimensional binary codes
A Kusupati, M Wallingford, V Ramanujan, R Somani, JS Park, K Pillutla, ...
Advances in neural information processing systems 34, 23900-23913, 2021
122021
Localized symbolic knowledge distillation for visual commonsense models
JS Park, J Hessel, K Chandu, PP Liang, X Lu, P West, Y Yu, Q Huang, ...
Advances in Neural Information Processing Systems 36, 11338-11352, 2023
102023
Multimodal Knowledge Alignment with Reinforcement Learning. CoRR abs/2205.12630 (2022)
Y Yu, J Chung, H Yun, J Hessel, JS Park, X Lu, P Ammanabrolu, R Zellers, ...
52022
Superposed decoding: Multiple generations from a single autoregressive inference pass
E Shen, A Fan, SM Pratt, JS Park, M Wallingford, SM Kakade, A Holtzman, ...
arXiv preprint arXiv:2405.18400, 2024
32024
Ark: Augmented reality with knowledge interactive emergent ability
Q Huang, JS Park, A Gupta, P Bennett, R Gong, S Som, B Peng, ...
arXiv preprint arXiv:2305.00970, 2023
22023
Certainly Uncertain: A Benchmark and Metric for Multimodal Epistemic and Aleatoric Awareness
K Raghavi Chandu, L Li, A Awadalla, X Lu, JS Park, J Hessel, L Wang, ...
arXiv e-prints, arXiv: 2407.01942, 2024
1*2024
BLIP3-KALE: Knowledge Augmented Large-Scale Dense Captions
A Awadalla, L Xue, M Shu, A Yan, J Wang, S Purushwalkam, S Shen, ...
arXiv preprint arXiv:2411.07461, 2024
2024
ActionAtlas: A VideoQA Benchmark for Domain-specialized Action Recognition
M Salehi, JS Park, T Yadav, A Kusupati, R Krishna, Y Choi, H Hajishirzi, ...
arXiv preprint arXiv:2410.05774, 2024
2024
ระบบไม่สามารถดำเนินการได้ในขณะนี้ โปรดลองใหม่อีกครั้งในภายหลัง
บทความ 1–18