Прати
Andrea Burns
Andrea Burns
Google DeepMind
Верификована је имејл адреса на bu.edu - Почетна страница
Наслов
Навело
Навело
Година
Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context
G Team, P Georgiev, VI Lei, R Burnell, L Bai, A Gulati, G Tanzer, ...
arXiv preprint arXiv:2403.05530, 2024
11952024
A Dataset for Interactive Vision-Language Navigation with Unknown Command Feasibility
A Burns, D Arsan, S Agrawal, R Kumar, K Saenko, BA Plummer
Accepted at ECCV 2022, arXiv preprint arXiv:2202.02312, 2022
73*2022
Language features matter: Effective language representations for vision-language tasks
A Burns, R Tan, K Saenko, S Sclaroff, BA Plummer
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2019
372019
Learning to Scale Multilingual Representations for Vision-Language Tasks
A Burns, D Kim, D Wijaya, K Saenko, BA Plummer
Accepted at ECCV 2020, arXiv preprint arXiv:2004.04312, 2020
362020
Mobile app tasks with iterative feedback (motif): Addressing task feasibility in interactive visual environments
A Burns, D Arsan, S Agrawal, R Kumar, K Saenko, BA Plummer
arXiv preprint arXiv:2104.08560, 2021
202021
ImageInWords: Unlocking Hyper-Detailed Image Descriptions
R Garg, A Burns, B Karagol Ayan, Y Bitton, C Montgomery, Y Onoe, ...
arXiv preprint arXiv:2405.02793, 2024
192024
Language-guided audio-visual source separation via trimodal consistency
R Tan, A Ray, A Burns, BA Plummer, J Salamon, O Nieto, B Russell, ...
Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2023
172023
A Suite of Generative Tasks for Multi-Level Multimodal Webpage Understanding
A Burns, K Srinivasan, J Ainslie, G Brown, BA Plummer, K Saenko, J Ni, ...
EMNLP 2023, arXiv preprint arXiv:2305.03668, 2023
122023
Unsupervised Disentanglement without Autoencoding: Pitfalls and Future Directions
A Burns, A Sarna, D Krishnan, A Maschinot
arXiv preprint arXiv:2108.06613, 2021
42021
Tell Me What's Next: Textual Foresight for Generic UI Representations
A Burns, K Saenko, BA Plummer
ACL Findings 2024, arXiv preprint arXiv:2406.07822, 2024
32024
Wikiweb2m: A page-level multimodal wikipedia dataset
A Burns, K Srinivasan, J Ainslie, G Brown, BA Plummer, K Saenko, J Ni, ...
arXiv preprint arXiv:2305.05432, 2023
32023
Multispectral imaging for improved liquid classification in security sensor systems
A Burns, WU Bajwa
Algorithms and Technologies for Multispectral, Hyperspectral, and …, 2018
12018
Language-Guided Audio-Visual Source Separation via Trimodal Consistency Supplemental Material
R Tan, A Ray, A Burns, BA Plummer, J Salamon, O Nieto, B Russell, ...
Систем тренутно не може да изврши ову радњу. Пробајте поново касније.
Чланци 1–13