Internvideo: General video foundation models via generative and discriminative learning Y Wang, K Li, Y Li, Y He, B Huang, Z Zhao, H Zhang, J Xu, Y Liu, Z Wang, ... arXiv preprint arXiv:2212.03191, 2022 | 327 | 2022 |
Hybrid models for open set recognition H Zhang, A Li, J Guo, Y Guo Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23 …, 2020 | 201 | 2020 |
Internvideo2: Scaling foundation models for multimodal video understanding Y Wang, K Li, X Li, J Yu, Y He, G Chen, B Pei, R Zheng, Z Wang, Y Shi, ... European Conference on Computer Vision, 396-416, 2024 | 118 | 2024 |
Internvideo-ego4d: A pack of champion solutions to ego4d challenges G Chen, S Xing, Z Chen, Y Wang, K Li, Y Li, Y Liu, J Wang, YD Zheng, ... arXiv preprint arXiv:2211.09529, 2022 | 43 | 2022 |
Real-time vision-based system of fault detection for freight trains Y Zhang, M Liu, Y Chen, H Zhang, Y Guo IEEE Transactions on Instrumentation and Measurement 69 (7), 5274-5284, 2019 | 34 | 2019 |
EgoExoLearn: A Dataset for Bridging Asynchronous Ego-and Exo-centric View of Procedural Activities in Real World Y Huang, G Chen, J Xu, M Zhang, L Yang, B Pei, H Zhang, L Dong, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024 | 23 | 2024 |
Correlation-preserving photo collage L Liu, H Zhang, G Jing, Y Guo, Z Chen, W Wang IEEE transactions on visualization and computer graphics 24 (6), 1956-1968, 2017 | 22 | 2017 |
Improving open set domain adaptation using image-to-image translation H Zhang, A Li, X Han, Z Chen, Y Zhang, Y Guo 2019 IEEE International Conference on Multimedia and Expo (ICME), 1258-1263, 2019 | 18 | 2019 |
Movqa: A benchmark of versatile question-answering for long-form movie understanding H Zhang, Y Liu, L Dong, Y Huang, ZH Ling, Y Wang, L Wang, Y Qiao arXiv preprint arXiv:2312.04817, 2023 | 17 | 2023 |
Viewpoint assessment and recommendation for photographing architectures J He, L Wang, W Zhou, H Zhang, X Cui, Y Guo IEEE transactions on visualization and computer graphics 25 (8), 2636-2649, 2018 | 8 | 2018 |
Learning Discriminative Feature Representation for Open Set Action Recognition H Zhang, Y Liu, Y Wang, L Wang, Y Qiao Proceedings of the 31st ACM International Conference on Multimedia, 7696-7705, 2023 | 4 | 2023 |
Matching Compound Prototypes for Few-Shot Action Recognition Y Huang, L Yang, G Chen, H Zhang, F Lu, Y Sato International Journal of Computer Vision, 1-26, 2024 | 3 | 2024 |
Elastic temporal alignment for few‐shot action recognition F Pan, C Xu, H Zhang, J Guo, Y Guo IET Computer Vision 17 (1), 39-50, 2023 | 2 | 2023 |
Viewpoint Selection for Taking a good Photograph of Architecture. J He, W Zhou, L Wang, H Zhang, Y Guo, E Grinspun, B Bickel, Y Dobashi PG (Short Papers), 39-44, 2016 | 1 | 2016 |
Internvideo2: Scaling foundation models for multimodal video understanding Y Wang, K Li, X Li, J Yu, Y He, G Chen, B Pei, R Zheng, Z Wang, Y Shi, ... European Conference on Computer Vision, 396-416, 2025 | | 2025 |
Improving Open Set Domain Adaptation Using Image-to-Image Translation and Instance-Weighted Adversarial Learning HJ Zhang, A Li, J Guo, YW Guo Journal of Computer Science and Technology 38 (3), 644-658, 2023 | | 2023 |
Supplementary File of InternVideo2: Scaling Foundation Models for Multimodal Video Understanding Y Wang, K Li, X Li, J Yu, Y He, G Chen, B Pei, R Zheng, Z Wang, Y Shi, ... | | |