Suivre
Ziyang Wang
Titre
Citée par
Citée par
Année
A simple llm framework for long-range video question-answering
C Zhang, T Lu, MM Islam, Z Wang, S Yu, M Bansal, G Bertasius
arXiv preprint arXiv:2312.17235, 2023
602023
Unified coarse-to-fine alignment for video-text retrieval
Z Wang, YL Sung, F Cheng, G Bertasius, M Bansal
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023
472023
VideoTree: Adaptive Tree-based Video Representation for LLM Reasoning on Long Videos
Z Wang, S Yu, E Stengel-Eskin, J Yoon, F Cheng, G Bertasius, M Bansal
arXiv preprint arXiv:2405.19209, 2024
212024
Language-augmented pixel embedding for generalized zero-shot learning
Z Wang, Y Gou, J Li, L Zhu, HT Shen
IEEE Transactions on Circuits and Systems for Video Technology 33 (3), 1019-1030, 2022
202022
Region semantically aligned network for zero-shot learning
Z Wang, Y Gou, J Li, Y Zhang, Y Yang
Proceedings of the 30th ACM International Conference on Information …, 2021
122021
DAM: Dynamic Adapter Merging for Continual Video QA Learning
F Cheng, Z Wang, YL Sung, YB Lin, M Bansal, G Bertasius
arXiv preprint arXiv:2403.08755, 2024
42024
Unified embeddings for multimodal retrieval via frozen LLMs
Z Wang, H Elfardy, M Dreyer, K Small, M Bansal
Findings of the Association for Computational Linguistics: EACL 2024, 1537-1547, 2024
42024
TimeRefine: Temporal Grounding with Time Refining Video LLM
X Wang, F Cheng, Z Wang, H Wang, MM Islam, L Torresani, M Bansal, ...
arXiv preprint arXiv:2412.09601, 2024
2024
Le système ne peut pas réaliser cette opération maintenant. Veuillez réessayer plus tard.
Articles 1–8