All in one: Exploring unified video-language pre-training J Wang, Y Ge, R Yan, Y Ge, KQ Lin, S Tsutsui, X Lin, G Cai, J Wu, Y Shan, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023 | 231 | 2023 |
Egocentric video-language pretraining KQ Lin, J Wang, M Soldan, M Wray, R Yan, EZ XU, D Gao, RC Tu, W Zhao, ... Advances in Neural Information Processing Systems 35, 7575-7586, 2022 | 178 | 2022 |
Removing the background by adding the background: Towards background robust self-supervised video representation learning J Wang, Y Gao, K Li, Y Lin, AJ Ma, H Cheng, P Peng, F Huang, R Ji, ... Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2021 | 108 | 2021 |
Object-aware video-language pre-training for retrieval J Wang, Y Ge, G Cai, R Yan, X Lin, Y Shan, X Qie, MZ Shou Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2022 | 89 | 2022 |
Enhancing Unsupervised Video Representation Learning by Decoupling the Scene and the Motion J Wang, Y Gao, K Li, X Jiang, X Guo, R Ji, X Sun Proceedings of the AAAI Conference on Artificial Intelligence (AAAI), 2021 | 62 | 2021 |
Miles: Visual bert pre-training with injected language semantics for video-text retrieval Y Ge, Y Ge, X Liu, J Wang, J Wu, Y Shan, X Qie, P Luo European conference on computer vision, 691-708, 2022 | 48 | 2022 |
Position-guided text prompt for vision-language pre-training J Wang, P Zhou, MZ Shou, S Yan Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023 | 40 | 2023 |
Too Large; Data Reduction for Vision-Language Pre-Training AJ Wang, KQ Lin, DJ Zhang, SW Lei, MZ Shou ICCV, 2023 | 25 | 2023 |
Adversarial open set domain adaptation via progressive selection of transferable target samples Y Gao, AJ Ma, Y Gao, J Wang, YS Pan Neurocomputing 410, 174-184, 2020 | 22 | 2020 |
Video-text pre-training with learned regions R Yan, MZ Shou, Y Ge, AJ Wang, X Lin, G Cai, J Tang Proceedings of the AAAI Conference on Artificial Intelligence (AAAI), 2021 | 20 | 2021 |
Multi-level temporal dilated dense prediction for action recognition J Wang, Y Lin, M Zhang, Y Gao, AJ Ma IEEE Transactions on Multimedia 24, 2553-2566, 2021 | 18 | 2021 |
Hierarchical feature disentangling network for universal domain adaptation Y Gao, P Chen, Y Gao, J Wang, Y Pan, AJ Ma Pattern Recognition 127, 108616, 2022 | 16 | 2022 |
Self-supervised temporal discriminative learning for video representation learning J Wang, Y Lin, AJ Ma, PC Yuen arXiv preprint arXiv:2008.02129, 2020 | 14 | 2020 |
Multi-scale adversarial cross-domain detection with robust discriminative learning YS Pan, AJ Ma, Y Gao, JP Wang, Y Lin Proceedings of the IEEE/CVF Winter Conference on Applications of Computer …, 2020 | 13 | 2020 |
Parrot captions teach clip to spot text Y Lin, C He, AJ Wang, B Wang, W Li, MZ Shou European Conference on Computer Vision, 368-385, 2024 | 9 | 2024 |
COSMO: COntrastive Streamlined MultimOdal Model with Interleaved Pre-Training AJ Wang, L Li, KQ Lin, J Wang, K Lin, Z Yang, L Wang, MZ Shou arXiv preprint arXiv:2401.00849, 2024 | 9 | 2024 |
Suppressing Static Visual Cues via Normalizing Flows for Self-Supervised Video Representation Learning M Zhang, J Wang, AJ Ma AAAI2022, 2021 | 8 | 2021 |
Revisiting hard example for action recognition J Wang, J Hu, S Li, Z Yuan IEEE Transactions on Circuits and Systems for Video Technology 31 (2), 546-556, 2020 | 7 | 2020 |
Enhancing Visual Grounding in Vision-Language Pre-Training With Position-Guided Text Prompts AJ Wang, P Zhou, MZ Shou, S Yan IEEE Transactions on Pattern Analysis and Machine Intelligence, 2023 | 6 | 2023 |
MUP: Multi-granularity Unified Perception for Panoramic Activity Recognition M Cao, R Yan, X Shu, J Zhang, J Wang, GS Xie Proceedings of the 31st ACM International Conference on Multimedia, 7666-7675, 2023 | 5 | 2023 |