Object tracking in satellite videos by improved correlation filters with motion estimations S Xuan, S Li, M Han, X Wan, GS Xia IEEE Transactions on Geoscience and Remote Sensing 58 (2), 1074-1086, 2019 | 143 | 2019 |
Mining inter-video proposal relations for video object detection M Han, Y Wang, X Chang, Y Qiao ECCV, 431-446, 2020 | 106 | 2020 |
Dual-AI: Dual-path Actor Interaction Learning for Group Activity Recognition M Han, DJ Zhang, Y Wang, R Yan, L Yao, X Chang, Y Qiao CVPR, Oral, 2022 | 73 | 2022 |
An Efficient Spatio-Temporal Pyramid Transformer for Action Detection Y Weng, Z Pan, M Han, X Chang, B Zhuang ECCV 2022, 2022 | 38 | 2022 |
HTML: Hybrid Temporal-scale Multimodal Learning Framework for Referring Video Object Segmentation M Han, Y Wang, Z Li, L Yao, X Chang, Y Qiao ICCV, 2023 | 27 | 2023 |
Longvlm: Efficient long video understanding via large language models Y Weng, M Han, H He, X Chang, B Zhuang ECCV, Oral, 2024 | 23 | 2024 |
Mmvg-inf-etrol@ trecvid 2019: Activities in extended video X Chang, W Liu, PY Huang, C Li, F Zhu, M Han, M Li, M Ma, S Hu, G Kang, ... TREC Video Retrieval Evaluation 2019, 2019 | 17 | 2019 |
Mask propagation for efficient video semantic segmentation Y Weng, M Han, H He, M Li, L Yao, X Chang, B Zhuang NeurIPS, 2024 | 15 | 2024 |
Shot2Story: A New Benchmark for Comprehensive Understanding of Multi-shot Videos M Han, L Yang, X Chang, L Yao, H Wang ICLR, 2025 | 14 | 2025 |
Scene recognition with convolutional residual features via deep forest M Han, S Li, X Wan, G Liu 2018 IEEE 3rd International Conference on Image, Vision and Computing (ICIVC …, 2018 | 6 | 2018 |
Progressive Frame-Proposal Mining for Weakly Supervised Video Object Detection M Han, Y Wang, M Li, X Chang, Y Yang, Y Qiao IEEE Transactions on Image Processing 33, 1560-1573, 2024 | 5 | 2024 |
Generating Action-conditioned Prompts for Open-vocabulary Video Action Recognition C Jia, M Luo, X Chang, Z Dang, M Han, M Wang, G Dai, S Dang, J Wang ACM MM, 2024 | 4 | 2024 |
Generalizable memory-driven transformer for multivariate long sequence time-series forecasting X Zhao, R Liu, M Li, G Shi, M Han, C Li, L Chen, X Chang arXiv preprint arXiv:2207.07827, 2022 | 3 | 2022 |
Video Recognition in Portrait Mode M Han, L Yang, X Jin, J Feng, X Chang, H Wang CVPR, 2024 | 2 | 2024 |
RoomTour3D: Geometry-Aware Video-Instruction Tuning for Embodied Navigation M Han, L Ma, K Zhumakhanova, E Radionova, J Zhang, X Chang, X Liang, ... arXiv preprint arXiv:2412.08591, 2024 | | 2024 |
EACO: Enhancing Alignment in Multimodal LLMs via Critical Observation Y Wang, M Cao, H Lin, M Han, L Ma, J Jiang, Y Cheng, X Liang arXiv preprint arXiv:2412.04903, 2024 | | 2024 |
MALMM: Multi-Agent Large Language Models for Zero-Shot Robotics Manipulation H Singh, RJ Das, M Han, P Nakov, I Laptev arXiv preprint arXiv:2411.17636, 2024 | | 2024 |
StoryAgent: Customized Storytelling Video Generation via Multi-Agent Collaboration P Hu, J Jiang, J Chen, M Han, S Liao, X Chang, X Liang arXiv preprint arXiv:2411.04925, 2024 | | 2024 |