Följ
Shiwei Zhang
Shiwei Zhang
Alibaba Group
Verifierad e-postadress på alibaba-inc.com
Titel
Citeras av
Citeras av
År
Modelscope text-to-video technical report
J Wang, H Yuan, D Chen, Y Zhang, X Wang, S Zhang
arXiv preprint arXiv:2308.06571, 2023
3422023
Videocomposer: Compositional video synthesis with motion controllability
X Wang, H Yuan, S Zhang, D Chen, J Wang, Y Zhang, Y Shen, D Zhao, ...
Advances in Neural Information Processing Systems 36, 7594-7611, 2023
2882023
End-to-end temporal action detection with transformer
X Liu, Q Wang, Y Hu, X Tang, S Zhang, S Bai, X Bai
IEEE Transactions on Image Processing 31, 5427-5441, 2022
2702022
TCTrack: Temporal contexts for aerial tracking
Z Cao, Z Huang, L Pan, S Zhang, Z Liu, C Fu
Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2022
1882022
I2VGen-XL: High-quality image-to-video synthesis via cascaded diffusion models
S Zhang, J Wang, Y Zhang, K Zhao, H Yuan, Z Qin, X Wang, D Zhao, ...
arXiv preprint arXiv:2311.04145, 2023
1712023
Oadtr: Online action detection with transformers
X Wang, S Zhang, Z Qing, Y Shao, Z Zuo, C Gao, N Sang
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2021
1432021
Hybrid relation guided set matching for few-shot action recognition
X Wang, S Zhang, Z Qing, M Tang, Z Zuo, C Gao, R Jin, N Sang
Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2022
1082022
Tacnet: Transition-aware context network for spatio-temporal action detection
L Song, S Zhang, G Yu, H Sun
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2019
1072019
DreamVideo: Composing your dream videos with customized subject and motion
Y Wei, S Zhang, Z Qing, H Yuan, Z Liu, Y Liu, Y Zhang, J Zhou, H Shan
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024
852024
Self-supervised learning for semi-supervised temporal action proposal
X Wang, S Zhang, Z Qing, Y Shao, C Gao, N Sang
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2021
802021
MoLo: Motion-augmented Long-short Contrastive Learning for Few-shot Action Recognition
X Wang, S Zhang, Z Qing, C Gao, Y Zhang, D Zhao, N Sang
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023
692023
TAda! Temporally-Adaptive Convolutions for Video Understanding
Z Huang, S Zhang, L Pan, Z Qing, M Tang, Z Liu, MH Ang Jr
International Conference on Learning Representations, 2022
682022
MAR: Masked Autoencoders for Efficient Action Recognition
Z Qing, S Zhang, Z Huang, X Wang, Y Wang, Y Lv, C Gao, N Sang
IEEE Transactions on Multimedia 26, 218-233, 2023
622023
CLIP-guided prototype modulating for few-shot action recognition
X Wang, S Zhang, J Cen, C Gao, Y Zhang, D Zhao, N Sang
International Journal of Computer Vision, 2023, 2023
582023
DreamTalk: When expressive talking head generation meets diffusion probabilistic models
Y Ma, S Zhang, J Wang, X Wang, Y Zhang, Z Deng
arXiv preprint arXiv:2312.09767, 2023
532023
Support-set based cross-supervision for video grounding
X Ding, N Wang, S Zhang, D Cheng, X Li, Z Huang, M Tang, X Gao
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2021
502021
VideoLCM: Video latent consistency model
X Wang, S Zhang, H Zhang, Y Liu, Y Zhang, C Gao, N Sang
arXiv preprint arXiv:2312.09109, 2023
432023
Towards real-world visual tracking with temporal contexts
Z Cao, Z Huang, L Pan, S Zhang, Z Liu, C Fu
IEEE Transactions on Pattern Analysis and Machine Intelligence, 2023
432023
Glnet: Global local network for weakly supervised action localization
S Zhang, L Song, C Gao, N Sang
IEEE Transactions on Multimedia 22 (10), 2610-2622, 2019
432019
Rlipv2: Fast scaling of relational language-image pre-training
H Yuan, S Zhang, X Wang, S Albanie, Y Pan, T Feng, J Jiang, D Ni, ...
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023
362023
Systemet kan inte utföra åtgärden just nu. Försök igen senare.
Artiklar 1–20