עקוב אחר
Yifan Du
Yifan Du
כתובת אימייל מאומתת בדומיין ruc.edu.cn - דף הבית
כותרת
צוטט על ידי
צוטט על ידי
שנה
A survey of large language models
WX Zhao, K Zhou, J Li, T Tang, X Wang, Y Hou, Y Min, B Zhang, J Zhang, ...
arXiv preprint arXiv:2303.18223 1 (2), 2023
4441*2023
Evaluating object hallucination in large vision-language models
Y Li, Y Du, K Zhou, J Wang, WX Zhao, JR Wen
arXiv preprint arXiv:2305.10355, 2023
8042023
A survey of vision-language pre-trained models
Y Du, Z Liu, J Li, WX Zhao
IJCAI 2022, 2022
2272022
What makes for good visual instructions? synthesizing complex visual reasoning instructions for visual instruction tuning
Y Du, H Guo, K Zhou, WX Zhao, J Wang, C Wang, M Cai, R Song, JR Wen
arXiv preprint arXiv:2311.01487, 2023
172023
Learning to imagine: Visually-augmented natural language generation
T Tang, Y Chen, Y Du, J Li, WX Zhao, JR Wen
arXiv preprint arXiv:2305.16944, 2023
132023
Needle in a video haystack: A scalable synthetic framework for benchmarking video mllms
Z Zhao, H Lu, Y Huo, Y Du, T Yue, L Guo, B Wang, W Chen, J Liu
arXiv e-prints, arXiv: 2406.09367, 2024
92024
Zero-shot visual question answering with language model feedback
Y Du, J Li, T Tang, WX Zhao, JR Wen
arXiv preprint arXiv:2305.17006, 2023
92023
Towards event-oriented long video understanding
Y Du, K Zhou, Y Huo, Y Li, WX Zhao, H Lu, Z Zhao, B Wang, W Chen, ...
arXiv preprint arXiv:2406.14129, 2024
72024
Virgo: A Preliminary Exploration on Reproducing o1-like MLLM
Y Du, Z Liu, Y Li, WX Zhao, Y Huo, B Wang, W Chen, Z Liu, Z Wang, ...
arXiv preprint arXiv:2501.01904, 2025
32025
Exploring the design space of visual context representation in video mllms
Y Du, Y Huo, K Zhou, Z Zhao, H Lu, H Huang, WX Zhao, B Wang, W Chen, ...
arXiv preprint arXiv:2410.13694, 2024
12024
Needle In A Video Haystack: A Scalable Synthetic Evaluator for Video MLLMs
Z Zhao, H Lu, Y Huo, Y Du, T Yue, L Guo, B Wang, W Chen, J Liu
arXiv preprint arXiv:2406.09367, 2024
2024
המערכת אינה יכולה לבצע את הפעולה כעת. נסה שוב מאוחר יותר.
מאמרים 1–11