Obserwuj
Yi Wang
Yi Wang
Shanghai AI Laboratory
Zweryfikowany adres z cse.cuhk.edu.hk
Tytuł
Cytowane przez
Cytowane przez
Rok
Videochat: Chat-centric video understanding
KC Li, Y He, Y Wang, Y Li, W Wang, P Luo, Y Wang, L Wang, Y Qiao
arXiv preprint arXiv:2305.06355, 2023
5852023
Image inpainting via generative multi-column convolutional neural networks
Y Wang, X Tao, X Qi, X Shen, J Jia
Advances in Neural Information Processing Systems, 331-340, 2018
4172018
Videomae v2: Scaling video masked autoencoders with dual masking
L Wang, B Huang, Z Zhao, Z Tong, Y He, Y Wang, Y Wang, Y Qiao
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023
3822023
Mat: Mask-aware transformer for large hole image inpainting
W Li, Z Lin, K Zhou, L Qi, Y Wang, J Jia
Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2022
3752022
InternVideo: general video foundation models via generative and discriminative learning
Y Wang, K Li, Y Li, Y He, B Huang, Z Zhao, H Zhang, J Xu, Y Liu, Z Wang, ...
arXiv preprint arXiv:2212.03191, 2022
3282022
Mvbench: A comprehensive multi-modal video understanding benchmark
K Li, Y Wang, Y He, Y Li, Y Wang, Y Liu, Z Wang, J Xu, G Chen, P Luo, ...
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024
2522024
Lavie: High-quality video generation with cascaded latent diffusion models
Y Wang, X Chen, X Ma, S Zhou, Z Huang, Y Wang, C Yang, Y He, J Yu, ...
International Journal of Computer Vision, 1-20, 2024
2202024
Internvid: A large-scale video-text dataset for multimodal understanding and generation
Y Wang, Y He, Y Li, K Li, J Yu, X Ma, X Li, G Chen, X Chen, Y Wang, C He, ...
arXiv preprint arXiv:2307.06942, 2023
2182023
Videomamba: State space model for efficient video understanding
K Li, X Li, Y Wang, Y He, Y Wang, L Wang, Y Qiao
European Conference on Computer Vision, 237-255, 2024
1472024
Unmasked teacher: Towards training-efficient video foundation models
K Li, Y Wang, Y Li, Y Wang, Y He, L Wang, Y Qiao
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023
1442023
Wide-context semantic image extrapolation
Y Wang, X Tao, X Shen, J Jia
Proceedings of the IEEE Conference on Computer Vision and Pattern …, 2019
1382019
Uniformerv2: Spatiotemporal learning by arming image vits with video uniformer
K Li, Y Wang, Y He, Y Li, Y Wang, L Wang, Y Qiao
arXiv preprint arXiv:2211.09552, 2022
1282022
Fast visual object counting via example-based density estimation
Y Wang, Y Zou
2016 IEEE international conference on image processing (ICIP), 3653-3657, 2016
1252016
Internvideo2: Scaling foundation models for multimodal video understanding
Y Wang, K Li, X Li, J Yu, Y He, G Chen, B Pei, R Zheng, Z Wang, Y Shi, ...
European Conference on Computer Vision, 396-416, 2024
1182024
Towards implicit text-guided 3d shape generation
Z Liu, Y Wang, X Qi, CW Fu
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022
1082022
VCNet: a robust approach to blind image inpainting
Y Wang, YC Chen, X Tao, J Jia
European Conference on Computer Vision, 2020
952020
Learning open-vocabulary semantic segmentation models from natural language supervision
J Xu, J Hou, Y Zhang, R Feng, Y Wang, Y Qiao, W Xie
Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2023
922023
Interngpt: Solving vision-centric tasks by interacting with chatgpt beyond language
Z Liu, Y He, W Wang, W Wang, Y Wang, S Chen, Q Zhang, Z Lai, Y Yang, ...
arXiv preprint arXiv:2305.05662, 2023
892023
Classifying digestive organs in wireless capsule endoscopy images based on deep convolutional neural network
Y Zou, L Li, Y Wang, J Yu, Y Li, WJ Deng
2015 IEEE International Conference on Digital Signal Processing (DSP), 1274-1278, 2015
872015
Videollm: Modeling video sequence with large language models
G Chen, YD Zheng, J Wang, J Xu, Y Huang, J Pan, Y Wang, Y Wang, ...
arXiv preprint arXiv:2305.13292, 2023
832023
Nie można teraz wykonać tej operacji. Spróbuj ponownie później.
Prace 1–20