Exploiting cross-modal prediction and relation consistency for semisupervised image captioning Y Yang, H Wei, H Zhu, D Yu, H Xiong, J Yang IEEE Transactions on Cybernetics 54 (2), 890-902, 2022 | 41 | 2022 |
S2OSC: A holistic semi-supervised approach for open set classification Y Yang, H Wei, ZQ Sun, GY Li, Y Zhou, H Xiong, J Yang ACM Transactions on Knowledge Discovery from Data (TKDD) 16 (2), 1-27, 2021 | 27 | 2021 |
Visual context window extension: A new perspective for long video understanding H Wei, Z Chen arXiv preprint arXiv:2409.20018, 2024 | 3 | 2024 |
Improving Generalization of Image Captioning with Unsupervised Prompt Learning H Wei, Z Chen arXiv preprint arXiv:2308.02862, 2023 | 1 | 2023 |
LongCaptioning: Unlocking the Power of Long Caption Generation in Large Multimodal Models H Wei, Z Tan, Y Hu, C Chen, Z Chen arXiv preprint arXiv:2502.15393, 2025 | | 2025 |
Remote Sensing Semantic Segmentation Quality Assessment based on Vision Language Model H Shi, Z Tan, Z Zhang, H Wei, Y Hu, Y Zhang, Z Chen arXiv preprint arXiv:2502.13990, 2025 | | 2025 |
Improving Domain Generalization for Image Captioning with Unsupervised Prompt Learning H Wei, Z Chen ACM Transactions on Multimedia Computing, Communications and Applications, 2025 | | 2025 |