Lvlm-ehub: A comprehensive evaluation benchmark for large vision-language models P Xu, W Shao, K Zhang, P Gao, S Liu, M Lei, F Meng, S Huang, Y Qiao, ... TPAMI, 2023 | 183 | 2023 |
Mmt-bench: A comprehensive multimodal benchmark for evaluating large vision-language models towards multitask agi K Ying*, F Meng*, J Wang*, Z Li, H Lin, Y Yang, H Zhang, W Zhang, Y Lin, ... ICML 2024, 2024 | 61 | 2024 |
Chartassisstant: A universal chart multimodal language model via chart-to-table pre-training and multitask instruction tuning F Meng, W Shao, Q Lu, P Gao, K Zhang, Y Qiao, P Luo ACL 2024, 2024 | 47 | 2024 |
Tiny lvlm-ehub: Early multimodal experiments with bard W Shao, Y Hu, P Gao, M Lei, K Zhang, F Meng, P Xu, S Huang, H Li, ... TBD, 2023 | 34 | 2023 |
Gui odyssey: A comprehensive dataset for cross-app gui navigation on mobile devices Q Lu, W Shao, Z Liu, F Meng, B Li, B Chen, S Huang, K Zhang, Y Qiao, ... arXiv preprint arXiv:2406.08451, 2024 | 22 | 2024 |
MMIU: Multimodal Multi-image Understanding for Evaluating Large Vision-Language Models F Meng*, J Wang*, C Li*, Q Lu, H Tian, J Liao, X Zhu, J Dai, Y Qiao, P Luo, ... ICLR 2025, 2024 | 11 | 2024 |
Foundation Model is Efficient Multimodal Multitask Model Selector F Meng, W Shao, Z Peng, C Jiang, K Zhang, Y Qiao, P Luo NIPS 2023, 2024 | 11 | 2024 |
Mipi 2023 challenge on rgbw remosaic: Methods and results Q Sun, Q Yang, C Li, S Zhou, R Feng, Y Dai, W Sun, Q Zhu, CC Loy, J Gu, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023 | 8 | 2023 |
Towards world simulator: Crafting physical commonsense-based benchmark for video generation F Meng, J Liao, X Tan, W Shao, Q Lu, K Zhang, Y Cheng, D Li, Y Qiao, ... arXiv preprint arXiv:2410.05363, 2024 | 7 | 2024 |
Phybench: A physical commonsense benchmark for evaluating text-to-image models F Meng, W Shao, L Luo, Y Wang, Y Chen, Q Lu, Y Yang, T Yang, K Zhang, ... arXiv preprint arXiv:2406.11802, 2024 | 4 | 2024 |
Otst: A two-phase framework for joint denoising and remosaicing in rgbw cfa Z Fan, X Wu, F Meng, Y Wu, F Zhang Proceedings of the ieee/cvf conference on computer vision and pattern …, 2023 | 4 | 2023 |
Tinylvlm-ehub: Towards comprehensive and efficient evaluation for large vision-language models W Shao, M Lei, Y Hu, P Gao, P Xu, K Zhang, F Meng, S Huang, H Li, ... IEEE Transactions on Big Data, 2025 | 2 | 2025 |
CAU: A causality attention unit for spatial-temporal sequence forecast B Qin, F Meng, S Yuan, B Mu IEEE Transactions on Multimedia 26, 4749-4763, 2023 | 2 | 2023 |
An Efficient Transformer For Demosaicing Via Compressed Multi-Branch Attention Mechanism X Wu*, F Meng*, Y Wu, J Zhang, F Zhang ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | 1 | 2024 |