Следене
Yunhao Fang
Yunhao Fang
Research Scientist @ ByteDance
Потвърден имейл адрес: bytedance.com - Начална страница
Заглавие
Позовавания
Позовавания
Година
Deductive verification of chain-of-thought reasoning
Z Ling*, Y Fang*, X Li, Z Huang, M Lee, R Memisevic, H Su
Advances in Neural Information Processing Systems 36, 2024
1172024
Vila-u: a unified foundation model integrating visual understanding and generation
Y Wu, Z Zhang, J Chen, H Tang, D Li, Y Fang, L Zhu, E Xie, H Yin, L Yi, ...
arXiv preprint arXiv:2409.04429, 2024
392024
Longvila: Scaling long-context visual language models for long videos
F Xue, Y Chen, D Li, Q Hu, L Zhu, X Li, Y Fang, H Tang, S Yang, Z Liu, ...
arXiv preprint arXiv:2408.10188, 2024
352024
Distilling large vision-language model with out-of-distribution generalizability
X Li*, Y Fang*, M Liu, Z Ling, Z Tu, H Su
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023
312023
VILA: VILA Augmented VILA
Y Fang*, L Zhu*, Y Lu, Y Wang, P Molchanov, JH Cho, M Pavone, S Han, ...
arXiv preprint arXiv:2407.17453, 2024
152024
Partslip++: Enhancing low-shot 3d part segmentation via multi-view instance segmentation and maximum likelihood estimation
Y Zhou, J Gu, X Li, M Liu, Y Fang, H Su
arXiv preprint arXiv:2312.03015, 2023
102023
NVILA: Efficient frontier visual language models
Z Liu, L Zhu, B Shi, Z Zhang, Y Lou, S Yang, H Xi, S Cao, Y Gu, D Li, X Li, ...
arXiv preprint arXiv:2412.04468, 2024
72024
Unleashing the creative mind: Language model as hierarchical policy for improved exploration on challenging problem solving
Z Ling, Y Fang, X Li, T Mu, M Lee, R Pourreza, R Memisevic, H Su
arXiv preprint arXiv:2311.00694, 2023
22023
Policy Decorator: Model-Agnostic Online Refinement for Large Policy Model
X Yuan, T Mu, S Tao, Y Fang, M Zhang, H Su
arXiv preprint arXiv:2412.13630, 2024
2024
Системата не може да изпълни операцията сега. Опитайте отново по-късно.
Статии 1–9