Scalability in perception for autonomous driving: Waymo open dataset P Sun, H Kretzschmar, X Dotiwalla, A Chouard, V Patnaik, P Tsui, J Guo, ... Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2020 | 3361 | 2020 |
Detecting cancer metastases on gigapixel pathology images Y Liu, K Gadepalli, M Norouzi, GE Dahl, T Kohlberger, A Boyko, ... arXiv preprint arXiv:1703.02442, 2017 | 801 | 2017 |
Detecting cancer metastases on gigapixel pathology images. arXiv 2017 Y Liu, K Gadepalli, M Norouzi, GE Dahl, T Kohlberger, A Boyko, ... arXiv preprint arXiv:1703.02442, 2017 | 65 | 2017 |
Graph-rise: Graph-regularized image semantic embedding DC Juan, CT Lu, Z Li, F Peng, A Timofeev, YT Chen, Y Gao, T Duerig, ... arXiv preprint arXiv:1902.10814, 2019 | 39 | 2019 |
From scarcity to efficiency: Improving clip training via visual-enriched captions Z Lai, H Zhang, W Wu, H Bai, A Timofeev, X Du, Z Gan, J Shan, ... | 27 | 2023 |
Ultra fine-grained image semantic embedding DC Juan, CT Lu, Z Li, F Peng, A Timofeev, YT Chen, Y Gao, T Duerig, ... Proceedings of the 13th international conference on web search and data …, 2020 | 20 | 2020 |
Veclip: Improving clip training via visual-enriched captions Z Lai, H Zhang, B Zhang, W Wu, H Bai, A Timofeev, X Du, Z Gan, J Shan, ... European Conference on Computer Vision, 111-127, 2025 | 18 | 2025 |
Training image and text embedding models Z Li, YT Chen, N Ye, Y Gao, Z Guo, A Timofeev, F Peng, TJ Duerig US Patent App. 16/265,811, 2020 | 18 | 2020 |
Mm1. 5: Methods, analysis & insights from multimodal llm fine-tuning H Zhang, M Gao, Z Gan, P Dufter, N Wenzel, F Huang, D Shah, X Du, ... arXiv preprint arXiv:2409.20566, 2024 | 16 | 2024 |
Training image and text embedding models Z Li, YT Chen, Y Gao, DC Juan, A Timofeev, CT Lu, F Peng, S Ravi, ... US Patent 11,586,927, 2023 | 11 | 2023 |
MOFI: Learning Image Representations from Noisy Entity Annotated Images W Wu, A Timofeev, C Chen, B Zhang, K Duan, S Liu, Y Zheng, J Shlens, ... arXiv preprint arXiv:2306.07952, 2023 | 8 | 2023 |
Inferring context from pixels for multimodal image classification M Shah, K Viswanathan, CT Lu, A Fuxman, Z Li, A Timofeev, C Jia, C Sun Proceedings of the 28th ACM International Conference on Information and …, 2019 | 7 | 2019 |
Multimodal image classifier using textual and visual embeddings A Fuxman, A Timofeev, Z Li, CT Lu, M Shah, C Sun, K Viswanathan, C Jia US Patent 11,907,337, 2024 | 6 | 2024 |
Detecting cancer metastases on gigapixel pathology images. CoRR abs/1703.02442 (2017) Y Liu, K Gadepalli, M Norouzi, GE Dahl, T Kohlberger, A Boyko, ... arXiv preprint arXiv:1703.02442, 2017 | 5 | 2017 |
Spatio-temporal pose/object database BA White, A Timofeev US Patent App. 17/063,330, 2021 | 2 | 2021 |
Graph-RISE: Graph-Regularized Image Semantic Embedding A Timofeev, A Tomkins, CT Lu, DC Juan, F Peng, K Viswanathan, L Gao, ... ACM WSDM, 2020 | 2 | 2020 |
From Multimodal LLMs to Generalist Embodied Agents: Methods and Lessons A Szot, B Mazoure, O Attia, A Timofeev, H Agrawal, D Hjelm, Z Gan, Z Kira, ... arXiv preprint arXiv:2412.08442, 2024 | | 2024 |
Training Image and Text Embedding Models Z Li, YT Chen, Y Gao, DC Juan, A Timofeev, CT Lu, F Peng, S Ravi, ... US Patent App. 18/741,082, 2024 | | 2024 |
SPATIO-TEMPORAL POSE/OBJECT DATABASE BA White, A Timofeev US Patent App. 18/440,363, 2024 | | 2024 |
Training image and text embedding models Z Li, YT Chen, Y Gao, DC Juan, A Timofeev, CT Lu, F Peng, S Ravi, ... US Patent 12,038,970, 2024 | | 2024 |