All in one: Exploring unified video-language pre-training J Wang, Y Ge, R Yan, Y Ge, KQ Lin, S Tsutsui, X Lin, G Cai, J Wu, Y Shan, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023 | 233 | 2023 |
Egocentric video-language pretraining KQ Lin, J Wang, M Soldan, M Wray, R Yan, EZ XU, D Gao, RC Tu, W Zhao, ... Advances in Neural Information Processing Systems 35, 7575-7586, 2022 | 208* | 2022 |
Univtg: Towards unified video-language temporal grounding KQ Lin, P Zhang, J Chen, S Pramanick, D Gao, AJ Wang, R Yan, MZ Shou Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023 | 126 | 2023 |
Removing the background by adding the background: Towards background robust self-supervised video representation learning J Wang, Y Gao, K Li, Y Lin, AJ Ma, H Cheng, P Peng, F Huang, R Ji, ... Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2021 | 107 | 2021 |
Object-aware video-language pre-training for retrieval J Wang, Y Ge, G Cai, R Yan, X Lin, Y Shan, X Qie, MZ Shou Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2022 | 88 | 2022 |
Enhancing Unsupervised Video Representation Learning by Decoupling the Scene and the Motion J Wang, Y Gao, K Li, X Jiang, X Guo, R Ji, X Sun Proceedings of the AAAI Conference on Artificial Intelligence (AAAI), 2021 | 63 | 2021 |
Miles: Visual bert pre-training with injected language semantics for video-text retrieval Y Ge, Y Ge, X Liu, J Wang, J Wu, Y Shan, X Qie, P Luo European conference on computer vision, 691-708, 2022 | 48 | 2022 |
Position-guided text prompt for vision-language pre-training J Wang, P Zhou, MZ Shou, S Yan Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2023 | 37 | 2023 |
Video-text pre-training with learned regions R Yan, MZ Shou, Y Ge, AJ Wang, X Lin, G Cai, J Tang Proceedings of the AAAI Conference on Artificial Intelligence (AAAI), 2021 | 27* | 2021 |
Too Large; Data Reduction for Vision-Language Pre-Training AJ Wang, KQ Lin, DJ Zhang, SW Lei, MZ Shou ICCV, 2023 | 23 | 2023 |
Adversarial open set domain adaptation via progressive selection of transferable target samples Y Gao, AJ Ma, Y Gao, J Wang, YS Pan Neurocomputing 410, 174-184, 2020 | 22 | 2020 |
Multi-level temporal dilated dense prediction for action recognition J Wang, Y Lin, M Zhang, Y Gao, AJ Ma IEEE Transactions on Multimedia 24, 2553-2566, 2021 | 19 | 2021 |
Self-supervised temporal discriminative learning for video representation learning J Wang, Y Lin, AJ Ma, PC Yuen arXiv preprint arXiv:2008.02129, 2020 | 17 | 2020 |
Hierarchical feature disentangling network for universal domain adaptation Y Gao, P Chen, Y Gao, J Wang, Y Pan, AJ Ma Pattern Recognition 127, 108616, 2022 | 15 | 2022 |
Multi-scale adversarial cross-domain detection with robust discriminative learning YS Pan, AJ Ma, Y Gao, JP Wang, Y Lin Proceedings of the IEEE/CVF Winter Conference on Applications of Computer …, 2020 | 13 | 2020 |
Parrot captions teach clip to spot text Y Lin, C He, AJ Wang, B Wang, W Li, MZ Shou European Conference on Computer Vision, 368-385, 2024 | 10 | 2024 |
Cosmo: Contrastive streamlined multimodal model with interleaved pre-training AJ Wang, L Li, KQ Lin, J Wang, K Lin, Z Yang, L Wang, MZ Shou arXiv preprint arXiv:2401.00849, 2024 | 10 | 2024 |
Suppressing Static Visual Cues via Normalizing Flows for Self-Supervised Video Representation Learning M Zhang, J Wang, AJ Ma AAAI2022, 2021 | 8 | 2021 |
Enhancing visual grounding in vision-language pre-training with position-guided text prompts AJ Wang, P Zhou, MZ Shou, S Yan IEEE Transactions on Pattern Analysis and Machine Intelligence 46 (5), 3406-3421, 2023 | 7 | 2023 |
Revisiting hard example for action recognition J Wang, J Hu, S Li, Z Yuan IEEE Transactions on Circuits and Systems for Video Technology 31 (2), 546-556, 2020 | 7 | 2020 |