Folgen
Mingfei Han
Mingfei Han
MBZUAI; University of Technology Sydney; Bytedance; SIAT-MMLab
Bestätigte E-Mail-Adresse bei student.uts.edu.au - Startseite
Titel
Zitiert von
Zitiert von
Jahr
Object tracking in satellite videos by improved correlation filters with motion estimations
S Xuan, S Li, M Han, X Wan, GS Xia
IEEE Transactions on Geoscience and Remote Sensing 58 (2), 1074-1086, 2019
1432019
Mining inter-video proposal relations for video object detection
M Han, Y Wang, X Chang, Y Qiao
ECCV, 431-446, 2020
1062020
Dual-AI: Dual-path Actor Interaction Learning for Group Activity Recognition
M Han, DJ Zhang, Y Wang, R Yan, L Yao, X Chang, Y Qiao
CVPR, Oral, 2022
732022
An Efficient Spatio-Temporal Pyramid Transformer for Action Detection
Y Weng, Z Pan, M Han, X Chang, B Zhuang
ECCV 2022, 2022
382022
HTML: Hybrid Temporal-scale Multimodal Learning Framework for Referring Video Object Segmentation
M Han, Y Wang, Z Li, L Yao, X Chang, Y Qiao
ICCV, 2023
272023
Longvlm: Efficient long video understanding via large language models
Y Weng, M Han, H He, X Chang, B Zhuang
ECCV, Oral, 2024
232024
Mmvg-inf-etrol@ trecvid 2019: Activities in extended video
X Chang, W Liu, PY Huang, C Li, F Zhu, M Han, M Li, M Ma, S Hu, G Kang, ...
TREC Video Retrieval Evaluation 2019, 2019
172019
Mask propagation for efficient video semantic segmentation
Y Weng, M Han, H He, M Li, L Yao, X Chang, B Zhuang
NeurIPS, 2024
152024
Shot2Story: A New Benchmark for Comprehensive Understanding of Multi-shot Videos
M Han, L Yang, X Chang, L Yao, H Wang
ICLR, 2025
142025
Scene recognition with convolutional residual features via deep forest
M Han, S Li, X Wan, G Liu
2018 IEEE 3rd International Conference on Image, Vision and Computing (ICIVC …, 2018
62018
Progressive Frame-Proposal Mining for Weakly Supervised Video Object Detection
M Han, Y Wang, M Li, X Chang, Y Yang, Y Qiao
IEEE Transactions on Image Processing 33, 1560-1573, 2024
52024
Generating Action-conditioned Prompts for Open-vocabulary Video Action Recognition
C Jia, M Luo, X Chang, Z Dang, M Han, M Wang, G Dai, S Dang, J Wang
ACM MM, 2024
42024
Generalizable memory-driven transformer for multivariate long sequence time-series forecasting
X Zhao, R Liu, M Li, G Shi, M Han, C Li, L Chen, X Chang
arXiv preprint arXiv:2207.07827, 2022
32022
Video Recognition in Portrait Mode
M Han, L Yang, X Jin, J Feng, X Chang, H Wang
CVPR, 2024
22024
RoomTour3D: Geometry-Aware Video-Instruction Tuning for Embodied Navigation
M Han, L Ma, K Zhumakhanova, E Radionova, J Zhang, X Chang, X Liang, ...
arXiv preprint arXiv:2412.08591, 2024
2024
EACO: Enhancing Alignment in Multimodal LLMs via Critical Observation
Y Wang, M Cao, H Lin, M Han, L Ma, J Jiang, Y Cheng, X Liang
arXiv preprint arXiv:2412.04903, 2024
2024
MALMM: Multi-Agent Large Language Models for Zero-Shot Robotics Manipulation
H Singh, RJ Das, M Han, P Nakov, I Laptev
arXiv preprint arXiv:2411.17636, 2024
2024
StoryAgent: Customized Storytelling Video Generation via Multi-Agent Collaboration
P Hu, J Jiang, J Chen, M Han, S Liao, X Chang, X Liang
arXiv preprint arXiv:2411.04925, 2024
2024
Das System kann den Vorgang jetzt nicht ausführen. Versuchen Sie es später erneut.
Artikel 1–18