Folgen
Zhuowan Li
Zhuowan Li
Google Deepmind
Bestätigte E-Mail-Adresse bei google.com - Startseite
Titel
Zitiert von
Zitiert von
Jahr
Fd-gan: Pose-guided feature distilling gan for robust person re-identification
Y Ge, Z Li, H Zhao, G Yin, S Yi, X Wang, H Li
Proceedings of 32nd Conference on Neural Information Processing Systems …, 2018
4362018
Swapmix: Diagnosing and regularizing the over-reliance on visual context in visual question answering
V Gupta, Z Li, A Kortylewski, C Zhang, Y Li, A Yuille
Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2022
67*2022
Super-clevr: A virtual benchmark to diagnose domain robustness in visual reasoning
Z Li, X Wang, E Stengel-Eskin, A Kortylewski, W Ma, B Van Durme, ...
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023
502023
Context-aware group captioning via self-attention and contrastive features
Z Li, Q Tran, L Mai, Z Lin, AL Yuille
Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2020
492020
Visual commonsense in pretrained unimodal and multimodal models
C Zhang, B Van Durme, Z Li, E Stengel-Eskin
Proceedings of the 2022 Conference of the North American Chapter of the …, 2022
452022
Calibrating concepts and operations: Towards symbolic reasoning on real images
Z Li, E Stengel-Eskin, Y Zhang, C Xie, QH Tran, B Van Durme, A Yuille
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2021
172021
Retrieval augmented generation or long-context llms? a comprehensive study and hybrid approach
Z Li, C Li, M Zhang, Q Mei, M Bendersky
Proceedings of the 2024 Conference on Empirical Methods in Natural Language …, 2024
142024
Synthesize Step-by-Step: Tools, Templates and LLMs as Data Generators for Reasoning-Based Chart VQA
Z Li, B Jasani, P Tang, S Ghadar
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024
82024
3D-Aware Visual Question Answering about Parts, Poses and Occlusions
X Wang, W Ma, Z Li, A Kortylewski, A Yuille
Thirty-seventh Conference on Neural Information Processing Systems, 2023
62023
Localization vs. semantics: Visual representations in unimodal and multimodal models
Z Li, C Xie, B Van Durme, A Yuille
Proceedings of the 18th Conference of the European Chapter of the …, 2024
4*2024
Causal-CoG: A Causal-Effect Look at Context Generation for Boosting Multi-modal Language Models
S Zhao, Z Li, Y Lu, A Yuille, Y Wang
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024
42024
ExoViP: Step-by-step Verification and Exploration with Exoskeleton Modules for Compositional Visual Reasoning
Y Wang, A Yuille, Z Li, Z Zheng
arXiv preprint arXiv:2408.02210, 2024
2024
Contrastive captioning for image groups
T Quan, M Long, L Zhe, L Zhuowan
US Patent US20240037939A1, 2024
2024
On the Diagnosis and Generalization of Compositional Visual Reasoning
Z Li
Johns Hopkins University, 2024
2024
Das System kann den Vorgang jetzt nicht ausführen. Versuchen Sie es später erneut.
Artikel 1–14