Mmt-bench: A comprehensive multimodal benchmark for evaluating large vision-language models towards multitask agi K Ying, F Meng, J Wang, Z Li, H Lin, Y Yang, H Zhang, W Zhang, Y Lin, ... ICML 2024, 2024 | 57 | 2024 |
B-AVIBench: Towards Evaluating the Robustness of Large Vision-Language Model on Black-box Adversarial Visual-Instructions H Zhang, W Shao, H Liu, Y Ma, P Luo, Y Qiao, N Zheng, K Zhang IEEE Transactions on Information Forensics and Security, 2024 | 18* | 2024 |
Convbench: A multi-turn conversation evaluation benchmark with hierarchical capability for large vision-language models S Liu, K Ying, H Zhang, Y Yang, Y Lin, T Zhang, C Li, Y Qiao, P Luo, ... arXiv preprint arXiv:2403.20194, 2024 | 13 | 2024 |
Scgnet: Shifting and cascaded group network H Zhang, S Lai, Y Wang, Z Da, Y Dun, X Qian IEEE Transactions on Circuits and Systems for Video Technology 33 (9), 4997-5008, 2023 | 13 | 2023 |
HF-HRNet: a simple hardware friendly high-resolution network H Zhang, Y Dun, Y Pei, S Lai, C Liu, K Zhang, X Qian IEEE Transactions on Circuits and Systems for Video Technology, 2024 | 10 | 2024 |
Open-Vocabulary Animal Keypoint Detection with Semantic-Feature Matching H Zhang, L Xu, S Lai, W Shao, N Zheng, P Luo, Y Qiao, K Zhang International Journal of Computer Vision, 1-18, 2024 | 6* | 2024 |
FMGNet: An efficient feature-multiplex group network for real-time vision task H Zhang, Y Ma, K Zhang, N Zheng, S Lai Pattern Recognition, 110698, 2024 | 4 | 2024 |
Data Adaptive Traceback for Vision-Language Foundation Models in Image Classification W Peng, K Zhang, Y Yang, H Zhang, Y Qiao Proceedings of the AAAI Conference on Artificial Intelligence 38 (5), 4506-4514, 2024 | 1 | 2024 |
HRVMamba: High-Resolution Visual State Space Model for Dense Prediction H Zhang, Y Ma, W Shao, P Luo, N Zheng, K Zhang arXiv preprint arXiv:2410.03174, 2024 | | 2024 |