Supervision exists everywhere: A data efficient contrastive language-image pre-training paradigm Y Li*, F Liang*, L Zhao*, Y Cui, W Ouyang, J Shao, F Yu, J Yan International Conference on Learning Representations(ICLR) 2022, 2021 | 485 | 2021 |
Emu: Generative Pretraining in Multimodality Q Sun*, Q Yu*, Y Cui*, F Zhang*, X Zhang*, Y Wang, H Gao, J Liu, ... The Twelfth International Conference on Learning Representations, 2023 | 218* | 2023 |
Emu2: Generative multimodal models are in-context learners Q Sun*, Y Cui*, X Zhang*, F Zhang*, Q Yu*, Z Luo, Y Wang, Y Rao, J Liu, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023 | 197* | 2023 |
Emu3: Next-token prediction is all you need X Wang*, X Zhang*, Z Luo*, Q Sun*, Y Cui*, J Wang*, F Zhang*, Y Wang*, ... arXiv preprint arXiv:2409.18869, 2024 | 70 | 2024 |
Fast-BEV: A Fast and Strong Bird's-Eye View Perception Baseline Y Li, B Huang, Z Chen, Y Cui, F Liang, M Shen, F Liu, E Xie, L Sheng, ... IEEE Transactions on Pattern Analysis and Machine Intelligence, 2023 | 44 | 2023 |
Capsfusion: Rethinking image-text data at scale Q Yu, Q Sun, X Zhang, Y Cui, F Zhang, Y Cao, X Wang, J Liu Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024 | 43 | 2024 |
Democratizing contrastive language-image pre-training: A clip benchmark of data, model, and supervision Y Cui, L Zhao, F Liang, Y Li, J Shao ICML First Workshop on Pre-training 2022, 2022 | 40 | 2022 |
Multi-modal gait recognition via effective spatial-temporal feature fusion Y Cui, Y Kang Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023 | 35 | 2023 |
Eva-clip-18b: Scaling clip to 18 billion parameters Q Sun, J Wang, Q Yu, Y Cui, F Zhang, X Zhang, X Wang arXiv preprint arXiv:2402.04252, 2024 | 29 | 2024 |
Unveiling Encoder-Free Vision-Language Models H Diao*, Y Cui*, X Li, Y Wang, H Lu, X Wang arXiv preprint arXiv:2406.11832, 2024 | 13 | 2024 |
GaitTransformer: Multiple-temporal-scale transformer for cross-view gait recognition Y Cui, Y Kang 2022 IEEE International Conference on Multimedia and Expo (ICME), 1-6, 2022 | 13 | 2022 |
Learning Multiple Granularity Features for Unsupervised Person Re-Identification S Wang*, Y Cui*, Y Kang 2022 IEEE International Conference on Multimedia and Expo (ICME), 1-6, 2022 | 2 | 2022 |
Autoregressive Video Generation without Vector Quantization H Deng, T Pan, H Diao, Z Luo, Y Cui, H Lu, S Shan, Y Qi, X Wang arXiv preprint arXiv:2412.14169, 2024 | | 2024 |