Spremljaj
Yufeng Cui
Yufeng Cui
Druga imena崔 玉峰
Beijing Academy of Artificial Intelligence
Preverjeni e-poštni naslov na buaa.edu.cn - Domača stran
Naslov
Navedeno
Navedeno
Leto
Supervision exists everywhere: A data efficient contrastive language-image pre-training paradigm
Y Li*, F Liang*, L Zhao*, Y Cui, W Ouyang, J Shao, F Yu, J Yan
International Conference on Learning Representations(ICLR) 2022, 2021
4892021
Emu: Generative Pretraining in Multimodality
Q Sun*, Q Yu*, Y Cui*, F Zhang*, X Zhang*, Y Wang, H Gao, J Liu, ...
The Twelfth International Conference on Learning Representations, 2023
2352023
Emu2: Generative multimodal models are in-context learners
Q Sun*, Y Cui*, X Zhang*, F Zhang*, Q Yu*, Z Luo, Y Wang, Y Rao, J Liu, ...
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023
213*2023
Emu3: Next-token prediction is all you need
X Wang*, X Zhang*, Z Luo*, Q Sun*, Y Cui*, J Wang*, F Zhang*, Y Wang*, ...
arXiv preprint arXiv:2409.18869, 2024
88*2024
Capsfusion: Rethinking image-text data at scale
Q Yu, Q Sun, X Zhang, Y Cui, F Zhang, Y Cao, X Wang, J Liu
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024
472024
Fast-BEV: A Fast and Strong Bird's-Eye View Perception Baseline
Y Li, B Huang, Z Chen, Y Cui, F Liang, M Shen, F Liu, E Xie, L Sheng, ...
IEEE Transactions on Pattern Analysis and Machine Intelligence, 2023
462023
Multi-modal gait recognition via effective spatial-temporal feature fusion
Y Cui, Y Kang
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023
402023
Democratizing contrastive language-image pre-training: A clip benchmark of data, model, and supervision
Y Cui, L Zhao, F Liang, Y Li, J Shao
ICML First Workshop on Pre-training 2022, 2022
402022
Eva-clip-18b: Scaling clip to 18 billion parameters
Q Sun, J Wang, Q Yu, Y Cui, F Zhang, X Zhang, X Wang
arXiv preprint arXiv:2402.04252, 2024
322024
EVE: Unveiling Encoder-Free Vision-Language Models
H Diao*, Y Cui*, X Li, Y Wang, H Lu, X Wang
arXiv preprint arXiv:2406.11832, 2024
19*2024
Gaittransformer: Multiple-temporal-scale transformer for cross-view gait recognition
Y Cui, Y Kang
2022 IEEE International Conference on Multimedia and Expo (ICME), 1-6, 2022
142022
Infinity-mm: Scaling multimodal performance with large-scale and high-quality instruction data
S Gu, J Zhang, S Zhou, K Yu, Z Xing, L Wang, Z Cao, J Jia, Z Zhang, ...
arXiv preprint arXiv:2410.18558, 2024
82024
Learning Multiple Granularity Features for Unsupervised Person Re-Identification
S Wang*, Y Cui*, Y Kang
2022 IEEE International Conference on Multimedia and Expo (ICME), 1-6, 2022
22022
Autoregressive Video Generation without Vector Quantization
H Deng, T Pan, H Diao, Z Luo, Y Cui, H Lu, S Shan, Y Qi, X Wang
arXiv preprint arXiv:2412.14169, 2024
12024
EVEv2: Improved Baselines for Encoder-Free Vision-Language Models
H Diao*, X Li*, Y Cui*, Y Wang*, H Deng, T Pan, W Wang, H Lu, X Wang
arXiv preprint arXiv:2502.06788, 2025
2025
Sistem trenutno ne more izvesti postopka. Poskusite znova pozneje.
Članki 1–15