フォロー
Zun Wang
Zun Wang
確認したメール アドレス: cs.unc.edu - ホームページ
タイトル
引用先
引用先
Internvideo: General video foundation models via generative and discriminative learning
Y Wang, K Li, Y Li, Y He, B Huang, Z Zhao, H Zhang, J Xu, Y Liu, Z Wang, ...
arXiv preprint arXiv:2212.03191, 2022
3302022
Mvbench: A comprehensive multi-modal video understanding benchmark
K Li, Y Wang, Y He, Y Li, Y Wang, Y Liu, Z Wang, J Xu, G Chen, P Luo, ...
CVPR 2024, 2024
2592024
Internvideo2: Scaling video foundation models for multimodal video understanding
Y Wang, K Li, X Li, J Yu, Y He, G Chen, B Pei, R Zheng, J Xu, Z Wang, ...
ECCV 2024, 2024
119*2024
Bridging the Gap Between Learning in Discrete and Continuous Environments for Vision-and-Language Navigation
Y Hong*, Z Wang*, Q Wu, S Gould
CVPR 2022, 2022
732022
Scaling Data Generation in Vision-and-Language Navigation
Z Wang, J Li, Y Hong, Y Wang, Q Wu, M Bansal, S Gould, H Tan, Y Qiao
ICCV 2023, 2023
612023
Etpnav: Evolving topological planning for vision-language navigation in continuous environments
D An, H Wang, W Wang, Z Wang, Y Huang, K He, L Wang
TPAMI 2024, 2024
532024
Internvideo-ego4d: A pack of champion solutions to ego4d challenges
G Chen, S Xing, Z Chen, Y Wang, K Li, Y Li, Y Liu, J Wang, YD Zheng, ...
arXiv preprint arXiv:2211.09529, 2022
432022
Vision-and-language navigation today and tomorrow: A survey in the era of foundation models
Y Zhang*, Z Ma*, J Li*, Y Qiao*, Z Wang*, J Chai, Q Wu, M Bansal, ...
TMLR 2024, 2024
142024
Navgpt-2: Unleashing navigational reasoning capability for large vision-language models
G Zhou, Y Hong, Z Wang, XE Wang, Q Wu
ECCV 2024, 2024
142024
1st Place Solutions for RxR-Habitat Vision-and-Language Navigation Competition (CVPR 2022)
D An*, Z Wang*, Y Li, Y Wang, Y Hong, Y Huang, L Wang, J Shao
arXiv preprint arXiv:2206.11610, 2022
112022
SAME: Learning Generic Language-Guided Visual Navigation with State-Adaptive Mixture of Experts
G Zhou, Y Hong, Z Wang, C Zhao, M Bansal, Q Wu
arXiv preprint arXiv:2412.05552, 2024
12024
DreamRunner: Fine-Grained Storytelling Video Generation with Retrieval-Augmented Motion Adaptation
Z Wang, J Li, H Lin, J Yoon, M Bansal
arXiv preprint arXiv:2411.16657, 2024
12024
Bootstrapping Language-Guided Navigation Learning with Self-Refining Data Flywheel
Z Wang, J Li, Y Hong, S Li, K Li, S Yu, Y Wang, Y Qiao, Y Wang, M Bansal, ...
ICLR 2025, 2024
2024
現在システムで処理を実行できません。しばらくしてからもう一度お試しください。
論文 1–13