Clip4clip: An empirical study of clip for end to end video clip retrieval and captioning H Luo, L Ji, M Zhong, Y Chen, W Lei, N Duan, T Li Neurocomputing 508, 293-304, 2022 | 568 | 2022 |
Univl: A unified video and language pre-training model for multimodal understanding and generation H Luo, L Ji, B Shi, H Huang, N Duan, T Li, J Li, T Bharti, M Zhou arXiv preprint arXiv:2002.06353, 2020 | 497 | 2020 |
Clip4clip: An empirical study of clip for end to end video clip retrieval H Luo, L Ji, M Zhong, Y Chen, W Lei, N Duan, T Li arXiv preprint arXiv:2104.08860, 2021 | 332 | 2021 |
Nüwa: Visual synthesis pre-training for neural visual world creation C Wu, J Liang, L Ji, F Yang, Y Fang, D Jiang, N Duan European conference on computer vision, 720-736, 2022 | 328 | 2022 |
Improving web search results using affinity graph B Zhang, H Li, Y Liu, L Ji, W Xi, W Fan, Z Chen, WY Ma Proceedings of the 28th annual international ACM SIGIR conference on …, 2005 | 269 | 2005 |
Godiva: Generating open-domain videos from natural descriptions C Wu, L Huang, Q Zhang, B Li, L Ji, F Yang, G Sapiro, N Duan arXiv preprint arXiv:2104.14806, 2021 | 216 | 2021 |
Taskmatrix. ai: Completing tasks by connecting foundation models with millions of apis Y Liang, C Wu, T Song, W Wu, Y Xia, Y Liu, Y Ou, S Lu, L Ji, S Mao, ... Intelligent Computing 3, 0063, 2024 | 179 | 2024 |
R-VQA: learning visual relation facts with semantic attention for visual question answering P Lu, L Ji, W Zhang, N Duan, M Zhou, J Wang Proceedings of the 24th ACM SIGKDD International Conference on Knowledge …, 2018 | 102 | 2018 |
Mist: Multi-modal iterative spatial-temporal transformer for long-form video question answering D Gao, L Zhou, L Ji, L Zhu, Y Yang, MZ Shou Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2023 | 96 | 2023 |
Knowledge Aware Semantic Concept Expansion for Image-Text Matching. B Shi, L Ji, P Lu, Z Niu, N Duan IJCAI 1, 2, 2019 | 85 | 2019 |
Xgpt: Cross-modal generative pre-training for image captioning Q Xia, H Huang, N Duan, D Zhang, L Ji, Z Sui, E Cui, T Bharti, M Zhou Natural Language Processing and Chinese Computing: 10th CCF International …, 2021 | 82 | 2021 |
Dense procedure captioning in narrated instructional videos B Shi, L Ji, Y Liang, N Duan, P Chen, Z Niu, M Zhou Proceedings of the 57th annual meeting of the association for computational …, 2019 | 81 | 2019 |
Related links recommendation J Yan, N Liu, Z Chen, L Ji, J Wang, X Liang US Patent 8,412,726, 2013 | 78 | 2013 |
Assistgpt: A general multi-modal assistant that can plan, execute, inspect, and learn D Gao, L Ji, L Zhou, KQ Lin, J Chen, Z Fan, MZ Shou arXiv preprint arXiv:2306.08640, 2023 | 76 | 2023 |
Indexing semantic user profiles for targeted advertising J Yan, N Liu, L Ji, SJ Hanks, Q Xu, Z Chen US Patent 8,533,188, 2013 | 75 | 2013 |
Microsoft concept graph: Mining semantic concepts for short text understanding L Ji, Y Wang, B Shi, D Zhang, Z Wang, J Yan Data Intelligence 1 (3), 238-270, 2019 | 68 | 2019 |
An CNN-LSTM attention approach to understanding user query intent from online health communities R Cai, B Zhu, L Ji, T Hao, J Yan, W Liu 2017 ieee international conference on data mining workshops (icdmw), 430-437, 2017 | 59 | 2017 |
Acronym disambiguation using word embedding C Li, L Ji, J Yan Proceedings of the AAAI Conference on Artificial Intelligence 29 (1), 2015 | 59 | 2015 |
GRACE: Gradient harmonized and cascaded labeling for aspect-based sentiment analysis H Luo, L Ji, T Li, N Duan, D Jiang arXiv preprint arXiv:2009.10557, 2020 | 49 | 2020 |
Sparse hidden-dynamics conditional random fields for user intent understanding Y Shen, J Yan, S Yan, L Ji, N Liu, Z Chen Proceedings of the 20th international conference on World wide web, 7-16, 2011 | 49 | 2011 |