Video-mme: The first-ever comprehensive evaluation benchmark of multi-modal llms in video analysis C Fu, Y Dai, Y Luo, L Li, S Ren, R Zhang, Z Wang, C Zhou, Y Shen, ... arXiv preprint arXiv:2405.21075, 2024 | 137 | 2024 |
Occlude them all: Occlusion-aware attention network for occluded person re-id P Chen, W Liu, P Dai, J Liu, Q Ye, M Xu, Q Chen, R Ji Proceedings of the IEEE/CVF international conference on computer vision …, 2021 | 132 | 2021 |
Dual distribution alignment network for generalizable person re-identification P Chen, P Dai, J Liu, F Zheng, M Xu, Q Tian, R Ji Proceedings of the AAAI conference on artificial intelligence 35 (2), 1054-1062, 2021 | 57 | 2021 |
A challenger to gpt-4v? early explorations of gemini in visual expertise C Fu, R Zhang, H Lin, Z Wang, T Gao, Y Luo, Y Huang, Z Zhang, L Qiu, ... arXiv preprint arXiv:2312.12436, 2023 | 51 | 2023 |
Arm: Any-time super-resolution method B Chen, M Lin, K Sheng, M Zhang, P Chen, K Li, L Cao, R Ji European Conference on Computer Vision, 254-270, 2022 | 35 | 2022 |
Multi-modal queried object detection in the wild Y Xu, M Zhang, C Fu, P Chen, X Yang, K Li, C Xu Advances in Neural Information Processing Systems 36, 2024 | 32 | 2024 |
Aligning and prompting everything all at once for universal visual perception Y Shen, C Fu, P Chen, M Zhang, K Li, X Sun, Y Wu, S Lin, R Ji Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024 | 27 | 2024 |
Open vocabulary object detection with proposal mining and prediction equalization P Chen, K Sheng, M Zhang, M Lin, Y Shen, S Lin, B Ren, K Li arXiv preprint arXiv:2206.11134, 2022 | 23 | 2022 |
Efficient decoder-free object detection with transformers P Chen, M Zhang, Y Shen, K Sheng, Y Gao, X Sun, K Li, C Shen European Conference on Computer Vision, 70-86, 2022 | 19 | 2022 |
Aha! adaptive history-driven attack for decision-based black-box models J Li, R Ji, P Chen, B Zhang, X Hong, R Zhang, S Li, J Li, F Huang, Y Wu Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2021 | 19 | 2021 |
Deep adversarial data augmentation with attribute guided for person re-identification Q Wu, P Dai, P Chen, Y Huang Signal, Image and Video Processing 15 (4), 655-662, 2021 | 17 | 2021 |
Mme: A comprehensive evaluation benchmark for multimodal large language models, 2024 C Fu, P Chen, Y Shen, Y Qin, M Zhang, X Lin, J Yang, X Zheng, K Li, ... URL https://arxiv. org/abs/2306.13394 2, 0 | 16 | |
Disentangling task-oriented representations for unsupervised domain adaptation P Dai, P Chen, Q Wu, X Hong, Q Ye, Q Tian, CW Lin, R Ji IEEE Transactions on Image Processing 31, 1012-1026, 2021 | 14 | 2021 |
Cantor: Inspiring multimodal chain-of-thought of mllm T Gao, P Chen, M Zhang, C Fu, Y Shen, Y Zhang, S Zhang, X Zheng, ... Proceedings of the 32nd ACM International Conference on Multimedia, 9096-9105, 2024 | 10 | 2024 |
MME: a comprehensive evaluation benchmark for multimodal large language models. CoRR abs/2306.13394 (2023) C Fu, P Chen, Y Shen, Y Qin, M Zhang, X Lin, Z Qiu, W Lin, J Yang, ... | 5 | 2023 |
Video-mme: The firstever comprehensive evaluation benchmark of multi-modal llms in video analysis, 2024 C Fu, Y Dai, Y Luo, L Li, S Ren, R Zhang, Z Wang, C Zhou, Y Shen, ... URL https://arxiv. org/abs/2405.21075, 0 | 5 | |
VEGA: Learning Interleaved Image-Text Comprehension in Vision-Language Large Models C Zhou, M Zhang, P Chen, C Fu, Y Shen, X Zheng, X Sun, R Ji arXiv preprint arXiv:2406.10228, 2024 | 2 | 2024 |
Video-based Person Re-identification with Two-stream Convolutional Network and Co-attentive Snippet Embedding P Chen, P Dai, Q Wu, Y Huang arXiv preprint arXiv:1905.11862, 2019 | 1 | 2019 |
Multimodal Inplace Prompt Tuning for Open-set Object Detection G Li, M Zhang, X Zheng, P Chen, Z Wang, Y Shen, M Zhuge, C Wu, ... Proceedings of the 32nd ACM International Conference on Multimedia, 8062-8071, 2024 | | 2024 |
Mme: A comprehensive evaluation benchmark for multimodal large language models C Fu, P Chen, Y Shen, Y Qin, M Zhang, X Lin, J Yang, X Zheng, K Li, ... arXiv preprint arXiv:2306.13394, 2023 | | 2023 |