OK-VQA: A Visual Question Answering Benchmark Requiring External Knowledge K Marino, M Rastegari, A Farhadi, R Mottaghi Proceedings of the IEEE Conference on Computer Vision and Pattern …, 2019 | 1048 | 2019 |
The more you know: Using knowledge graphs for image classification K Marino, R Salakhutdinov, A Gupta CVPR 2017, 2016 | 436 | 2016 |
A-okvqa: A benchmark for visual question answering using world knowledge D Schwenk, A Khandelwal, C Clark, K Marino, R Mottaghi European conference on computer vision, 146-162, 2022 | 428 | 2022 |
The pose knows: Video forecasting by generating pose futures J Walker, K Marino, A Gupta, M Hebert Proceedings of the IEEE international conference on computer vision, 3332-3341, 2017 | 428 | 2017 |
Krisp: Integrating implicit and symbolic knowledge for open-domain knowledge-based vqa K Marino, X Chen, D Parikh, A Gupta, M Rohrbach Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2021 | 234 | 2021 |
Same object, different grasps: Data and semantic knowledge for task-oriented grasping A Murali, W Liu, K Marino, S Chernova, A Gupta Conference on robot learning, 1540-1557, 2021 | 70 | 2021 |
Collaborating with language models for embodied reasoning I Dasgupta, C Kaeser-Chen, K Marino, A Ahuja, S Babayan, F Hill, ... arXiv preprint arXiv:2302.00763, 2023 | 69 | 2023 |
Ask your humans: Using human instructions to improve generalization in reinforcement learning V Chen, A Gupta, K Marino arXiv preprint arXiv:2011.00517, 2020 | 46 | 2020 |
Distilling internet-scale vision-language models into embodied agents T Sumers, K Marino, A Ahuja, R Fergus, I Dasgupta arXiv preprint arXiv:2301.12507, 2023 | 25 | 2023 |
Hierarchical RL Using an Ensemble of Proprioceptive Periodic Policies K Marino, A Gupta, R Fergus, A Szlam ICLR 2019, 2018 | 20 | 2018 |
Learning to navigate wikipedia by taking random walks M Zaheer, K Marino, W Grathwohl, J Schultz, W Shang, S Babayan, ... Advances in Neural Information Processing Systems 35, 1529-1541, 2022 | 5 | 2022 |
Ical: Continual learning of multimodal agents by transforming trajectories into actionable insights G Sarch, L Jang, MJ Tarr, WW Cohen, K Marino, K Fragkiadaki arXiv e-prints, arXiv: 2406.14596, 2024 | 3 | 2024 |
Vlm agents generate their own memories: Distilling experience into embodied programs of thought GH Sarch, L Jang, MJ Tarr, WW Cohen, K Marino, K Fragkiadaki The Thirty-eighth Annual Conference on Neural Information Processing Systems, 2024 | 2 | 2024 |
Empirically verifying hypotheses using reinforcement learning K Marino, R Fergus, A Szlam, A Gupta arXiv preprint arXiv:2006.15762, 2020 | 2 | 2020 |
Real time human pose estimation for boosted random forests and pose machines K Marino Robotics Institute Summer Scholars (RISS) Working Papers 2, 45-49, 2014 | 2 | 2014 |
Vlm agents generate their own memories: Distilling experience into embodied programs G Sarch, L Jang, MJ Tarr, WW Cohen, K Marino, K Fragkiadaki arXiv preprint arXiv:2406.14596, 2024 | 1 | 2024 |
Towards knowledge-capable ai: Agents that see, speak, act and know K Marino Carnegie Mellon University, 2021 | 1 | 2021 |
Efficient Exploration and Discriminative World Model Learning with an Object-Centric Abstraction A GX-Chen, K Marino, R Fergus arXiv preprint arXiv:2408.11816, 2024 | | 2024 |
Controlling agents using reporter neural networks I Dasgupta, S Chen, KD Marino, W Shang, A Ahuja US Patent App. 18/475,157, 2024 | | 2024 |
Efficient Exploration and Discriminative World Model Learning with an Object-Centric Abstraction GXC Anthony, K Marino, R Fergus CoRR, 2024 | | 2024 |