Svit: Scaling up visual instruction tuning B Zhao, B Wu, M He, T Huang arXiv preprint arXiv:2307.04087, 2023 | 126 | 2023 |
Efficient Multimodal Learning from Data-centric Perspective M He, Y Liu, B Wu, J Yuan, Y Wang, T Huang, B Zhao arXiv preprint arXiv:2402.11530, 2024 | 89 | 2024 |
Efficient Multimodal Large Language Models: A Survey Y Jin, J Li, Y Liu, T Gu, K Wu, Z Jiang, M He, B Zhao, X Tan, Z Gan, ... arXiv preprint arXiv:2405.10739, 2024 | 45 | 2024 |
Large-scale dataset pruning with dynamic uncertainty M He, S Yang, T Huang, B Zhao Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024 | 25 | 2024 |
Unveiling the Ignorance of MLLMs: Seeing Clearly, Answering Incorrectly Y Liu, Z Liang, Y Wang, X Wu, F Tang, M He, J Li, Z Liu, H Yang, S Lim, ... arXiv preprint arXiv:2406.10638, 2024 | 7* | 2024 |