Next-gpt: Any-to-any multimodal llm S Wu, H Fei, L Qu, W Ji, TS Chua Forty-first International Conference on Machine Learning, 2024 | 498 | 2024 |
Dynamic modality interaction modeling for image-text retrieval L Qu, M Liu, J Wu, Z Gao, L Nie Proceedings of the 44th International ACM SIGIR Conference on Research and …, 2021 | 168 | 2021 |
Context-aware multi-view summarization network for image-text matching L Qu, M Liu, D Cao, L Nie, Q Tian Proceedings of the 28th ACM international conference on multimedia, 1047-1055, 2020 | 151 | 2020 |
Layoutllm-t2i: Eliciting layout guidance from llm for text-to-image generation L Qu*, S Wu*, H Fei, L Nie, TS Chua Proceedings of the 31st ACM International Conference on Multimedia, 643-654, 2023 | 98 | 2023 |
Search-oriented micro-video captioning L Nie, L Qu, D Meng, M Zhang, Q Tian, AD Bimbo Proceedings of the 30th ACM international conference on multimedia, 3234-3243, 2022 | 42 | 2022 |
Self-supervised correlation learning for cross-modal retrieval Y Liu, J Wu, L Qu, T Gan, J Yin, L Nie IEEE Transactions on Multimedia 25, 2851-2863, 2022 | 40 | 2022 |
Composed image retrieval with text feedback via multi-grained uncertainty regularization Y Chen, Z Zheng, W Ji, L Qu, TS Chua International Conference on Learning Representations (ICLR), 2022 | 38 | 2022 |
Temporal anomaly detection on IIoT-enabled manufacturing P Zhan, S Wang, J Wang, L Qu, K Wang, Y Hu, X Li Journal of Intelligent Manufacturing 32, 1669-1678, 2021 | 26 | 2021 |
Iterative local-global collaboration learning towards one-shot video person re-identification M Liu, L Qu, L Nie, M Liu, L Duan, B Chen IEEE Transactions on Image Processing 29, 9360-9372, 2020 | 25 | 2020 |
Generative cross-modal retrieval: Memorizing images in multimodal language models for retrieval and beyond Y Li, W Wang, L Qu, L Nie, W Li, TS Chua arXiv preprint arXiv:2402.10805, 2024 | 15 | 2024 |
Learnable Pillar-based Re-ranking for Image-Text Retrieval L Qu, M Liu, W Wang, Z Zheng, L Nie, TS Chua Proceedings of the 46th International ACM SIGIR Conference on Research and …, 2023 | 13 | 2023 |
Discriminative probing and tuning for text-to-image generation L Qu, W Wang, Y Li, H Zhang, L Nie, TS Chua Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024 | 9 | 2024 |
Popularity-aware distributionally robust optimization for recommendation system J Zhao, W Wang, X Lin, L Qu, J Zhang, TS Chua Proceedings of the 32nd ACM International Conference on Information and …, 2023 | 9 | 2023 |
Video-language understanding: A survey from model architecture, model training, and data perspectives T Nguyen, Y Bin, J Xiao, L Qu, Y Li, JZ Wu, CD Nguyen, SK Ng, LA Tuan arXiv preprint arXiv:2406.05615, 2024 | 7 | 2024 |
Unified text-to-image generation and retrieval L Qu, H Li, T Wang, W Wang, Y Li, L Nie, TS Chua arXiv preprint arXiv:2406.05814, 2024 | 3 | 2024 |
Automatic Pruning via Structured Lasso with Class-wise Information X Liu, M Li, X Li, L Qu, Z Peng, Y Song, Z Liu, L Jiang, J Li arXiv preprint arXiv:2502.09125, 2025 | | 2025 |
Generative Ghost: Investigating Ranking Bias Hidden in AI-Generated Videos H Gao, L Pang, S Xu, L Qu, TS Chua, H Shen, X Cheng arXiv preprint arXiv:2502.07327, 2025 | | 2025 |
SILMM: Self-Improving Large Multimodal Models for Compositional Text-to-Image Generation L Qu, H Li, W Wang, X Liu, J Li, L Nie, TS Chua arXiv preprint arXiv:2412.05818, 2024 | | 2024 |
Revolutionizing Text-to-Image Retrieval as Autoregressive Token-to-Voken Generation Y Li, H Cai, W Wang, L Qu, Y Wei, W Li, L Nie, TS Chua arXiv preprint arXiv:2407.17274, 2024 | | 2024 |