Transformer tracking with cyclic shifting window attention Z Song, J Yu, YPP Chen, W Yang Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2022 | 174 | 2022 |
Compact transformer tracker with correlative masked modeling Z Song, R Luo, J Yu, YPP Chen, W Yang Proceedings of the AAAI conference on artificial intelligence 37 (2), 2321-2329, 2023 | 62 | 2023 |
Comprehensive dataset of broadcast soccer videos J Yu, A Lei, Z Song, T Wang, H Cai, N Feng 2018 IEEE Conference on Multimedia Information Processing and Retrieval …, 2018 | 45 | 2018 |
SSET: a dataset for shot segmentation, event detection, player tracking in soccer videos N Feng, Z Song, J Yu, YPP Chen, Y Zhao, Y He, T Guan Multimedia Tools and Applications 79, 28971-28992, 2020 | 37 | 2020 |
Diffusiontrack: Diffusion model for multi-object tracking R Luo, Z Song, L Ma, J Wei, W Yang, M Yang Proceedings of the AAAI Conference on Artificial Intelligence 38 (5), 3991-3999, 2024 | 18 | 2024 |
Deem: Diffusion models serve as the eyes of large language models for image perception R Luo, Y Li, L Chen, W He, TE Lin, Z Liu, L Zhang, Z Song, X Xia, T Liu, ... arXiv preprint arXiv:2405.15232, 2024 | 12 | 2024 |
EfficientGS: Streamlining Gaussian Splatting for Large-Scale High-Resolution Scene Representation W Liu, T Guan, B Zhu, L Ju, Z Song, D Li, Y Wang, W Yang arXiv preprint arXiv:2404.12777, 2024 | 9 | 2024 |
MA-VLAD: a fine-grained local feature aggregation scheme for action recognition N Feng, Y Tang, Z Song, J Yu, YPP Chen, W Yang Multimedia Systems 30 (3), 1-13, 2024 | 4 | 2024 |
Distractor-aware tracker with a domain-special optimized benchmark for soccer player tracking Z Song, Z Wan, W Yuan, Y Tang, J Yu, YPP Chen Proceedings of the 2021 International Conference on Multimedia Retrieval …, 2021 | 4 | 2021 |
Autogenic language embedding for coherent point tracking Z Song, Y Tang, R Luo, L Ma, J Yu, YPP Chen, W Yang Proceedings of the 32nd ACM International Conference on Multimedia, 2021-2030, 2024 | 3 | 2024 |
Coupled mamba: Enhanced multi-modal fusion with coupled state space model W Li, H Zhou, J Yu, Z Song, W Yang arXiv preprint arXiv:2405.18014, 2024 | 3 | 2024 |
Amd: Anatomical motion diffusion with interpretable motion decomposition and fusion B Jing, Y Zhang, Z Song, J Yu, W Yang Proceedings of the AAAI Conference on Artificial Intelligence 38 (3), 2643-2651, 2024 | 3 | 2024 |
Progressive Text-to-Image Diffusion with Soft Latent Direction YT Ye, J Cai, H Zhou, G Li, Y Zhang, Z Song, C Gao, J Yu, W Yang Proceedings of the AAAI Conference on Artificial Intelligence 38 (7), 6693-6701, 2024 | 2 | 2024 |
Optimized View and Geometry Distillation from Multi-view Diffuser Y Zhang, Z Song, J Yu, Y Luo, W Yang arXiv preprint arXiv:2312.06198, 2023 | 1 | 2023 |
Fine-grained Appearance Transfer with Diffusion Models Y Ye, G Li, H Zhou, C Jiale, J Yu, Y Luo, Z Song, Q Xing, Y Zhang, ... arXiv preprint arXiv:2311.16513, 2023 | 1 | 2023 |
Fine-Grain Level Sports Video Search Engine Z Song, J Yu, H Cai, Y Hu, YPP Chen MultiMedia Modeling: 26th International Conference, MMM 2020, Daejeon, South …, 2020 | 1 | 2020 |
Video Anomaly Detection with Motion and Appearance Guided Patch Diffusion Model H Zhou, J Cai, Y Ye, Y Feng, C Gao, J Yu, Z Song, W Yang arXiv preprint arXiv:2412.09026, 2024 | | 2024 |
Ref-GS: Directional Factorization for 2D Gaussian Splatting Y Zhang, A Chen, Y Wan, Z Song, J Yu, Y Luo, W Yang arXiv preprint arXiv:2412.00905, 2024 | | 2024 |
IP-MOT: Instance Prompt Learning for Cross-Domain Multi-Object Tracking R Luo, Z Song, L Chen, Y Li, M Yang, W Yang arXiv preprint arXiv:2410.23907, 2024 | | 2024 |
Agnostic Feature Compression with Semantic Guided Channel Importance Analysis Y Tang, W Yang, J Yu, Z Song 2024 IEEE International Conference on Multimedia and Expo (ICME), 1-6, 2024 | | 2024 |