yatai ji

Cited by

	All	Since 2020
Citations	253	253
h-index	6	6
i10-index	6	6

200

100

150

20222023202420251 43 183 26

Public access

View all

6 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Yujiu YangSIGS@Tsinghua UniversityVerified email at sz.tsinghua.edu.cn
Junjie WangA Postdoctoral Fellow at the Tsinghua University (THU)Verified email at toki.waseda.jp
Yong LiuTsinghua UniversityVerified email at mails.tsinghua.edu.cn
Lin ZhangSymbiosis-X Technology Inc.Verified email at albany.edu
Rong-Cheng TuNanyang Technological UniversityVerified email at ntu.edu.sg
Wei Liu, IEEE/IAPR/IMA FellowDistinguished Scientist, TencentVerified email at ee.columbia.edu

yatai ji

The University of Hong Kong

Verified email at connect.hku.hk - Homepage

multi-modal


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Control-a-video: Controllable text-to-video generation with diffusion models W Chen, Y Ji, J Wu, H Wu, P Xie, J Li, X Xia, X Xiao, L Lin arXiv preprint arXiv:2305.13840, 2023	122	2023
MAP: Multimodal Uncertainty-Aware Vision-Language Pre-training Model Y Ji, J Wang, Y Gong, L Zhang, Y Zhu, H Wang, J Zhang, T Sakai, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023	40	2023
Bridging the gap: A unified video comprehension framework for moment retrieval and highlight detection Y Xiao, Z Luo, Y Liu, Y Ma, H Bian, Y Ji, Y Yang, X Li Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024	31	2024
Mirtt: Learning multimodal interaction representations from trilinear transformers for visual question answering J Wang, Y Ji, J Sun, Y Yang, T Sakai Findings of the Association for Computational Linguistics: EMNLP 2021, 2280-2292, 2021	19	2021
Multimodal prototype-enhanced network for few-shot action recognition X Ni, Y Liu, H Wen, Y Ji, J Xiao, Y Yang Proceedings of the 2024 International Conference on Multimedia Retrieval, 1-10, 2024	15	2024
Seeing What You Miss: Vision-Language Pre-training with Semantic Completion Learning Y Ji, R Tu, J Jiang, W Kong, C Cai, W Zhao, H Wang, Y Yang, W Liu Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023	14*	2023
Ida-vlm: Towards movie understanding via id-aware large vision-language model Y Ji, S Zhang, J Wu, P Sun, W Chen, X Xiao, S Yang, Y Yang, P Luo arXiv preprint arXiv:2407.07577, 2024	2	2024
PIN: A Knowledge-Intensive Dataset for Paired and Interleaved Multimodal Documents J Wang, Y Zhang, Y Ji, Y Zhang, C Jiang, Y Wang, K Zhu, Z Wang, ... arXiv preprint arXiv:2406.13923, 2024	2	2024
Similarity Transitivity Broken-Aware Multi-Modal Hashing RC Tu, XL Mao, J Liu, Y Ji, W Wei, H Huang IEEE Transactions on Knowledge and Data Engineering, 2024	2	2024
3D face reconstruction system from a single photo based on regression neural network Y Ji, K Li, H Wu, G Xiong, Z Shen, X Shang, B Xi IFAC-PapersOnLine 53 (5), 71-76, 2020	2	2020
Onlinevpo: Align video diffusion model with online video-centric preference optimization J Zhang, J Wu, W Chen, Y Ji, X Xiao, W Huang, K Han arXiv preprint arXiv:2412.15159, 2024	1	2024
Taming Lookup Tables for Efficient Image Retouching S Yang, B Huang, M Cao, Y Ji, H Guo, N Wong, Y Yang European Conference on Computer Vision, 144-159, 2024	1	2024
Modeling Multimodal Uncertainties via Probability Distribution Encoders Included Vision-Language Models J Wang, Y Ji, Y Zhang, Y Zhu, T Sakai IEEE Access, 2023	1	2023
Global and Local Semantic Completion Learning for Vision-Language Pre-training RC Tu, Y Ji, J Jiang, W Kong, C Cai, W Zhao, H Wang, Y Yang, W Liu arXiv preprint arXiv:2306.07096, 2023	1	2023
Prompt-A-Video: Prompt Your Video Diffusion Model via Preference-Aligned LLM Y Ji, J Zhang, J Wu, S Zhang, S Chen, C GE, P Sun, W Chen, W Shao, ... arXiv preprint arXiv:2412.15156, 2024		2024

The system can't perform the operation now. Try again later.

Articles 1–15

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors