mplug-owl: Modularization empowers large language models with multimodality Q Ye, H Xu, G Xu, J Ye, M Yan, Y Zhou, J Wang, A Hu, P Shi, Y Shi, C Li, ... arXiv preprint arXiv:2304.14178, 2023 | 833 | 2023 |
mplug-2: A modularized multi-modal foundation model across text, image and video H Xu, Q Ye, M Yan, Y Shi, J Ye, Y Xu, C Li, B Bi, Q Qian, W Wang, G Xu, ... International Conference on Machine Learning, 38728-38748, 2023 | 132 | 2023 |
Towards understanding label smoothing Y Xu, Y Xu, Q Qian, H Li, R Jin arXiv preprint arXiv:2006.11653, 2020 | 52 | 2020 |
Unsupervised visual representation learning by online constrained k-means Q Qian, Y Xu, J Hu, H Li, R Jin Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022 | 37 | 2022 |
Intra-modal proxy learning for zero-shot visual categorization with clip Q Qian, Y Xu, J Hu Advances in Neural Information Processing Systems 36, 25461-25474, 2023 | 18 | 2023 |
An empirical study on distribution shift robustness from the perspective of pre-training and data augmentation Z Liu, Y Xu, Y Xu, Q Qian, H Li, R Jin, X Ji, AB Chan arXiv preprint arXiv:2205.12753, 2022 | 16 | 2022 |
Weakly supervised representation learning with coarse labels Y Xu, Q Qian, H Li, R Jin, J Hu Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2021 | 13 | 2021 |
Improved fine-tuning by leveraging pre-training data: Theory and practice Z Liu, Y Xu, Y Xu, Q Qian, H Li, AB Chan, R Jin | 11 | 2021 |
Improved visual fine-tuning with natural language supervision J Wang, Y Xu, J Hu, M Yan, J Sang, Q Qian Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023 | 7 | 2023 |
K2NN: Self-Supervised Learning with Hierarchical Nearest Neighbors for Remote Sensing J Yuan, Y Xu, Z Wang ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 1 | 2023 |
Representation Learning with Fine-grained Patterns Y Xu, Q Qian, H Li, R Jin, J Hu arXiv preprint ArXiv:2005.09681, 2020 | 1 | 2020 |
SeA: Semantic Adversarial Augmentation for Last Layer Features from Unsupervised Representation Learning Q Qian, Y Xu, J Hu European Conference on Computer Vision, 1-17, 2024 | | 2024 |