Obserwuj
Zikai Song
Tytuł
Cytowane przez
Cytowane przez
Rok
Transformer tracking with cyclic shifting window attention
Z Song, J Yu, YPP Chen, W Yang
Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2022
1742022
Compact transformer tracker with correlative masked modeling
Z Song, R Luo, J Yu, YPP Chen, W Yang
Proceedings of the AAAI conference on artificial intelligence 37 (2), 2321-2329, 2023
622023
Comprehensive dataset of broadcast soccer videos
J Yu, A Lei, Z Song, T Wang, H Cai, N Feng
2018 IEEE Conference on Multimedia Information Processing and Retrieval …, 2018
452018
SSET: a dataset for shot segmentation, event detection, player tracking in soccer videos
N Feng, Z Song, J Yu, YPP Chen, Y Zhao, Y He, T Guan
Multimedia Tools and Applications 79, 28971-28992, 2020
372020
Diffusiontrack: Diffusion model for multi-object tracking
R Luo, Z Song, L Ma, J Wei, W Yang, M Yang
Proceedings of the AAAI Conference on Artificial Intelligence 38 (5), 3991-3999, 2024
182024
Deem: Diffusion models serve as the eyes of large language models for image perception
R Luo, Y Li, L Chen, W He, TE Lin, Z Liu, L Zhang, Z Song, X Xia, T Liu, ...
arXiv preprint arXiv:2405.15232, 2024
122024
EfficientGS: Streamlining Gaussian Splatting for Large-Scale High-Resolution Scene Representation
W Liu, T Guan, B Zhu, L Ju, Z Song, D Li, Y Wang, W Yang
arXiv preprint arXiv:2404.12777, 2024
92024
MA-VLAD: a fine-grained local feature aggregation scheme for action recognition
N Feng, Y Tang, Z Song, J Yu, YPP Chen, W Yang
Multimedia Systems 30 (3), 1-13, 2024
42024
Distractor-aware tracker with a domain-special optimized benchmark for soccer player tracking
Z Song, Z Wan, W Yuan, Y Tang, J Yu, YPP Chen
Proceedings of the 2021 International Conference on Multimedia Retrieval …, 2021
42021
Autogenic language embedding for coherent point tracking
Z Song, Y Tang, R Luo, L Ma, J Yu, YPP Chen, W Yang
Proceedings of the 32nd ACM International Conference on Multimedia, 2021-2030, 2024
32024
Coupled mamba: Enhanced multi-modal fusion with coupled state space model
W Li, H Zhou, J Yu, Z Song, W Yang
arXiv preprint arXiv:2405.18014, 2024
32024
Amd: Anatomical motion diffusion with interpretable motion decomposition and fusion
B Jing, Y Zhang, Z Song, J Yu, W Yang
Proceedings of the AAAI Conference on Artificial Intelligence 38 (3), 2643-2651, 2024
32024
Progressive Text-to-Image Diffusion with Soft Latent Direction
YT Ye, J Cai, H Zhou, G Li, Y Zhang, Z Song, C Gao, J Yu, W Yang
Proceedings of the AAAI Conference on Artificial Intelligence 38 (7), 6693-6701, 2024
22024
Optimized View and Geometry Distillation from Multi-view Diffuser
Y Zhang, Z Song, J Yu, Y Luo, W Yang
arXiv preprint arXiv:2312.06198, 2023
12023
Fine-grained Appearance Transfer with Diffusion Models
Y Ye, G Li, H Zhou, C Jiale, J Yu, Y Luo, Z Song, Q Xing, Y Zhang, ...
arXiv preprint arXiv:2311.16513, 2023
12023
Fine-Grain Level Sports Video Search Engine
Z Song, J Yu, H Cai, Y Hu, YPP Chen
MultiMedia Modeling: 26th International Conference, MMM 2020, Daejeon, South …, 2020
12020
Video Anomaly Detection with Motion and Appearance Guided Patch Diffusion Model
H Zhou, J Cai, Y Ye, Y Feng, C Gao, J Yu, Z Song, W Yang
arXiv preprint arXiv:2412.09026, 2024
2024
Ref-GS: Directional Factorization for 2D Gaussian Splatting
Y Zhang, A Chen, Y Wan, Z Song, J Yu, Y Luo, W Yang
arXiv preprint arXiv:2412.00905, 2024
2024
IP-MOT: Instance Prompt Learning for Cross-Domain Multi-Object Tracking
R Luo, Z Song, L Chen, Y Li, M Yang, W Yang
arXiv preprint arXiv:2410.23907, 2024
2024
Agnostic Feature Compression with Semantic Guided Channel Importance Analysis
Y Tang, W Yang, J Yu, Z Song
2024 IEEE International Conference on Multimedia and Expo (ICME), 1-6, 2024
2024
Nie można teraz wykonać tej operacji. Spróbuj ponownie później.
Prace 1–20