关注
Le Xue
Le Xue
Senior Applied Scientist, Salesforce Research
在 salesforce.com 的电子邮件经过验证
标题
引用次数
引用次数
年份
Ulip: Learning a unified representation of language, images, and point clouds for 3d understanding
L Xue, M Gao, C Xing, R Martín-Martín, J Wu, C Xiong, R Xu, JC Niebles, ...
Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2023
2372023
ULIP-2: Towards Scalable Multimodal Pre-training for 3D Understanding
L Xue, N Yu, S Zhang, J Li, R Martín-Martín, J Wu, C Xiong, R Xu, ...
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023
992023
Bolaa: Benchmarking and orchestrating llm-augmented autonomous agents
Z Liu, W Yao, J Zhang, L Xue, S Heinecke, R Murthy, Y Feng, Z Chen, ...
arXiv preprint arXiv:2308.05960, 2023
772023
Retroformer: Retrospective large language agents with policy gradient optimization
W Yao, S Heinecke, JC Niebles, Z Liu, Y Feng, L Xue, R Murthy, Z Chen, ...
arXiv preprint arXiv:2308.02151, 2023
592023
xgen-mm (blip-3): A family of open large multimodal models
L Xue, M Shu, A Awadalla, J Wang, A Yan, S Purushwalkam, H Zhou, ...
arXiv preprint arXiv:2408.08872, 2024
432024
X-instructblip: A framework for aligning x-modal instruction-aware representations to llms and emergent cross-modal reasoning
A Panagopoulou, L Xue, N Yu, J Li, D Li, S Joty, R Xu, S Savarese, ...
arXiv preprint arXiv:2311.18799, 2023
422023
MINT-1T: Scaling Open-Source Multimodal Data by 10x: A Multimodal Dataset with One Trillion Tokens
A Awadalla, L Xue, O Lo, M Shu, H Lee, EK Guha, M Jordan, S Shen, ...
arXiv preprint arXiv:2406.11271, 2024
212024
Directed weighted network structure analysis of complex impedance measurements for characterizing oil-in-water bubbly flow
ZK Gao, WD Dang, L Xue, SS Zhang
Chaos: An Interdisciplinary Journal of Nonlinear Science 27 (3), 2017
152017
Rex: Rapid exploration and exploitation for ai agents
R Murthy, S Heinecke, JC Niebles, Z Liu, L Xue, W Yao, Y Feng, Z Chen, ...
arXiv preprint arXiv:2307.08962, 2023
82023
Robustness evaluation of transformer-based form field extractors via form attacks
L Xue, M Gao, Z Chen, C Xiong, R Xu
International Conference on Document Analysis and Recognition, 167-184, 2023
62023
xgen-mm-vid (blip-3-video): You only need 32 tokens to represent a video even in vlms
MS Ryoo, H Zhou, S Kendre, C Qin, L Xue, M Shu, S Savarese, R Xu, ...
arXiv preprint arXiv:2410.16267, 2024
52024
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition
L Xue, M Gao, C Xing, R Martín-Martín, J Wu, C Xiong, R Xu, JC Niebles, ...
52023
Docquerynet: Value retrieval with arbitrary queries for form-like documents
M Gao, L Xue, C Ramaiah, C Xing, R Xu, C Xiong
Proceedings of the 29th International Conference on Computational …, 2022
5*2022
xGen-VideoSyn-1: High-fidelity Text-to-Video Synthesis with Compressed Representations
C Qin, C Xia, K Ramakrishnan, M Ryoo, L Tu, Y Feng, M Shu, H Zhou, ...
arXiv preprint arXiv:2408.12590, 2024
22024
Image analysis based document processing for inference of key-value pairs in non-fixed digital documents
M Gao, C Zeyuan, L Xue, R Xu, C Xiong
US Patent 11,699,297, 2023
22023
Model-Agnostic Hierarchical Attention for 3D Object Detection
M Shu, L Xue, N Yu, R Martín-Martín, JC Niebles, C Xiong, R Xu
arXiv preprint arXiv:2301.02650, 2023
22023
ProVision: Programmatically Scaling Vision-centric Instruction Data for Multimodal Language Models
J Zhang, L Xue, L Song, J Wang, W Huang, M Shu, A Yan, Z Ma, ...
arXiv preprint arXiv:2412.07012, 2024
12024
X-InstructBLIP: A Framework for Aligning Image, 3D, Audio, Video to LLMs and its Emergent Cross-Modal Reasoning
A Panagopoulou, L Xue, N Yu, J Li, D Li, S Joty, R Xu, S Savarese, ...
European Conference on Computer Vision, 177-197, 2024
12024
SYSTEMS AND METHODS FOR LANGUAGE AGENT OPTIMIZATION
W Yao, S Heinecke, JC Niebles Duque, Z Liu, Y Feng, L Xue, R Murthy, ...
US Patent App. 18/498,257, 2025
2025
BLIP3-KALE: Knowledge Augmented Large-Scale Dense Captions
A Awadalla, L Xue, M Shu, A Yan, J Wang, S Purushwalkam, S Shen, ...
arXiv preprint arXiv:2411.07461, 2024
2024
系统目前无法执行此操作,请稍后再试。
文章 1–20