Deductive verification of chain-of-thought reasoning Z Ling*, Y Fang*, X Li, Z Huang, M Lee, R Memisevic, H Su
Advances in Neural Information Processing Systems 36, 2024
114 2024 Vila-u: a unified foundation model integrating visual understanding and generation Y Wu, Z Zhang, J Chen, H Tang, D Li, Y Fang, L Zhu, E Xie, H Yin, L Yi, ...
arXiv preprint arXiv:2409.04429, 2024
34 2024 Longvila: Scaling long-context visual language models for long videos F Xue, Y Chen, D Li, Q Hu, L Zhu, X Li, Y Fang, H Tang, S Yang, Z Liu, ...
arXiv preprint arXiv:2408.10188, 2024
29 2024 Distilling large vision-language model with out-of-distribution generalizability X Li*, Y Fang*, M Liu, Z Ling, Z Tu, H Su
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023
29 2023 VILA : VILA Augmented VILA Y Fang*, L Zhu*, Y Lu, Y Wang, P Molchanov, JH Cho, M Pavone, S Han, ...
arXiv preprint arXiv:2407.17453, 2024
12 2024 PartSLIP++: Enhancing Low-Shot 3D Part Segmentation via Multi-View Instance Segmentation and Maximum Likelihood Estimation Y Zhou, J Gu, X Li, M Liu, Y Fang, H Su
arXiv preprint arXiv:2312.03015, 2023
9 2023 NVILA: Efficient frontier visual language models Z Liu, L Zhu, B Shi, Z Zhang, Y Lou, S Yang, H Xi, S Cao, Y Gu, D Li, X Li, ...
arXiv preprint arXiv:2412.04468, 2024
6 2024 Unleashing the Creative Mind: Language Model As Hierarchical Policy For Improved Exploration on Challenging Problem Solving Z Ling, Y Fang, X Li, T Mu, M Lee, R Pourreza, R Memisevic, H Su
2 2023 Policy Decorator: Model-Agnostic Online Refinement for Large Policy Model X Yuan, T Mu, S Tao, Y Fang, M Zhang, H Su
arXiv preprint arXiv:2412.13630, 2024
2024