Recognize Anything: A Strong Image Tagging Model Y Zhang, X Huang, J Ma, Z Li, Z Luo, Y Xie, Y Qin, T Luo, Y Li, S Liu, ... arXiv preprint arXiv:2306.03514, 2023 | 193 | 2023 |
Self-distillation from the last mini-batch for consistency regularization Y Shen, L Xu, Y Yang, Y Li, Y Guo Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022 | 80 | 2022 |
Tag2text: Guiding vision-language model via image tagging X Huang, Y Zhang, J Ma, W Tian, R Feng, Y Zhang, Y Li, Y Guo, L Zhang arXiv preprint arXiv:2303.05657, 2023 | 74 | 2023 |
Personalized image aesthetics assessment with rich attributes Y Yang, L Xu, L Li, N Qie, Y Li, P Zhang, Y Guo Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022 | 71 | 2022 |
Theme-aware Visual Attribute Reasoning for Image Aesthetics Assessment L Li, Y Huang, J Wu, Y Yang, Y Li, Y Guo, G Shi IEEE Transactions on Circuits and Systems for Video Technology, 2023 | 49 | 2023 |
Learning personalized image aesthetics from subjective and objective attributes H Zhu, Y Zhou, L Li, Y Li, Y Guo IEEE Transactions on Multimedia, 2021 | 46 | 2021 |
Simple and robust loss design for multi-label learning with missing labels Y Zhang, Y Cheng, X Huang, F Wen, R Feng, Y Li, Y Guo arXiv preprint arXiv:2112.07368, 2021 | 40 | 2021 |
Explainable and Generalizable Blind Image Quality Assessment via Semantic Attribute Reasoning Y Huang, L Li, Y Yang, Y Li, Y Guo IEEE Transactions on Multimedia, 2022 | 26 | 2022 |
Box-Level Active Detection M Lyu, J Zhou, H Chen, Y Huang, D Yu, Y Li, Y Guo, Y Guo, L Xiang, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023 | 23 | 2023 |
Image Aesthetics Assessment with Attribute-Assisted Multimodal Memory Network L Li, T Zhu, P Chen, Y Yang, Y Li, W Lin IEEE Transactions on Circuits and Systems for Video Technology, 2023 | 20 | 2023 |
AesCLIP: Multi-Attribute Contrastive Learning for Image Aesthetics Assessment X Sheng, L Li, P Chen, J Wu, W Dong, Y Yang, L Xu, Y Li, G Shi Proceedings of the 31st ACM International Conference on Multimedia, 1117-1126, 2023 | 18 | 2023 |
u-LLaVA: Unifying Multi-Modal Tasks via Large Language Model J Xu, L Xu, Y Yang, X Li, Y Xie, YJ Huang, Y Li arXiv preprint arXiv:2311.05348, 2023 | 17 | 2023 |
Transductive aesthetic preference propagation for personalized image aesthetics assessment Y Li, Y Yang, H Li, H Chen, L Xu, L Li, Y Li, Y Guo Proceedings of the 30th ACM International Conference on Multimedia, 896-904, 2022 | 17 | 2022 |
Anchor-based knowledge embedding for image aesthetics assessment L Li, T Zhi, G Shi, Y Yang, L Xu, Y Li, Y Guo Neurocomputing 539, 126197, 2023 | 15 | 2023 |
A Survey for Foundation Models in Autonomous Driving H Gao, Y Li, K Long, M Yang, Y Shen arXiv preprint arXiv:2402.01105, 2024 | 14 | 2024 |
Idea: Increasing text diversity via online multi-label recognition for vision-language pre-training X Huang, Y Zhang, Y Cheng, W Tian, R Zhao, R Feng, Y Zhang, Y Li, ... Proceedings of the 30th ACM International Conference on Multimedia, 4573-4583, 2022 | 14 | 2022 |
On the efficacy of small self-supervised contrastive models without distillation signals H Shi, Y Zhang, S Tang, W Zhu, Y Li, Y Guo, Y Zhuang Proceedings of the AAAI Conference on Artificial Intelligence 36 (2), 2225-2234, 2022 | 14 | 2022 |
Psychology inspired model for hierarchical image aesthetic attribute prediction L Li, J Duan, Y Yang, L Xu, Y Li, Y Guo 2022 IEEE International Conference on Multimedia and Expo (ICME), 1-6, 2022 | 12 | 2022 |
Knowledge-Guided Blind Image Quality Assessment With Few Training Samples T Song, L Li, J Wu, Y Yang, Y Li, Y Guo, G Shi IEEE Transactions on Multimedia, 2022 | 11 | 2022 |
Mosaic Representation Learning for Self-supervised Visual Pre-training Z Wang, Z Chen, Y Li, Y Guo, J Yu, M Gong, T Liu The Eleventh International Conference on Learning Representations, 2022 | 11 | 2022 |