SOLOv2: Dynamic and fast instance segmentation X Wang, R Zhang, T Kong, L Li, C Shen Neural Information Processing Systems (NeurIPS), 2020, 2020 | 1072* | 2020 |
Vision mamba: Efficient visual representation learning with bidirectional state space model L Zhu, B Liao, Q Zhang, X Wang, W Liu, X Wang International Conference on Machine Learning (ICML), 2024, 2024 | 998 | 2024 |
SOLO: Segmenting objects by locations X Wang, T Kong, C Shen, Y Jiang, L Li European Conference on Computer Vision (ECCV), 2020, 2020 | 896 | 2020 |
Conditional Positional Encodings for Vision Transformers X Chu, Z Tian, B Zhang, X Wang, X Wei, H Xia, C Shen International Conference on Learning Representations (ICLR), 2023, 2021 | 852* | 2021 |
End-to-End Video Instance Segmentation with Transformers Y Wang, Z Xu, X Wang, C Shen, B Cheng, H Shen, H Xia Computer Vision and Pattern Recognition (CVPR), 2021, 2021 | 839 | 2021 |
Dense Contrastive Learning for Self-Supervised Visual Pre-Training X Wang, R Zhang, C Shen, T Kong, L Li Computer Vision and Pattern Recognition (CVPR), 2021, 2021 | 811 | 2021 |
EVA: Exploring the limits of masked visual representation learning at scale Y Fang, W Wang, B Xie, Q Sun, L Wu, X Wang, T Huang, X Wang, Y Cao Computer Vision and Pattern Recognition (CVPR), 2023, 2023 | 697 | 2023 |
Repulsion loss: Detecting pedestrians in a crowd X Wang, T Xiao, Y Jiang, S Shao, J Sun, C Shen Computer Vision and Pattern Recognition (CVPR), 2018, 2018 | 651 | 2018 |
EVA-CLIP: Improved training techniques for clip at scale Q Sun, Y Fang, L Wu, X Wang, Y Cao arXiv preprint arXiv:2303.15389, 2023 | 422 | 2023 |
SegGPT: Segmenting Everything In Context X Wang, X Zhang, Y Cao, W Wang, C Shen, T Huang International Conference on Computer Vision (ICCV), 2023, 2023 | 319* | 2023 |
Associatively segmenting instances and semantics in point clouds X Wang, S Liu, X Shen, C Shen, J Jia Computer Vision and Pattern Recognition (CVPR), 2019, 2019 | 308 | 2019 |
BoxInst: High-Performance Instance Segmentation with Box Annotations Z Tian, C Shen, X Wang, H Chen Computer Vision and Pattern Recognition (CVPR), 2021, 2021 | 298 | 2021 |
Images speak in images: A generalist painter for in-context visual learning X Wang, W Wang, Y Cao, C Shen, T Huang Computer Vision and Pattern Recognition (CVPR), 2023, 2023 | 240 | 2023 |
EVA-02: A visual representation for neon genesis Y Fang, Q Sun, X Wang, T Huang, X Wang, Y Cao Image and Vision Computing 149, 105171, 2023 | 226 | 2023 |
Emu: Generative pretraining in multimodality Q Sun, Q Yu, Y Cui, F Zhang, X Zhang, Y Wang, H Gao, J Liu, T Huang, ... International Conference on Learning Representations (ICLR), 2024, 2023 | 218* | 2023 |
Generative multimodal models are in-context learners Q Sun, Y Cui, X Zhang, F Zhang, Q Yu, Y Wang, Y Rao, J Liu, T Huang, ... Computer Vision and Pattern Recognition (CVPR), 2024, 2023 | 195 | 2023 |
Poseur: Direct human pose regression with transformers W Mao, Y Ge, C Shen, Z Tian, X Wang, Z Wang, A den Hengel European Conference on Computer Vision (ECCV), 2022, 2022 | 192* | 2022 |
FreeSOLO: Learning to segment objects without annotations X Wang, Z Yu, S De Mello, J Kautz, A Anandkumar, C Shen, JM Alvarez Computer Vision and Pattern Recognition (CVPR), 2022, 2022 | 131 | 2022 |
SOLO: A Simple Framework for Instance Segmentation X Wang, R Zhang, C Shen, T Kong, L Li IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2021 | 109 | 2021 |
Zero-shot video editing using off-the-shelf image diffusion models W Wang, Y Jiang, K Xie, Z Liu, H Chen, Y Cao, X Wang, C Shen arXiv preprint arXiv:2303.17599, 2023 | 102 | 2023 |