Initno: Boosting text-to-image diffusion models via initial noise optimization X Guo, J Liu, M Cui, J Li, H Yang, D Huang Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024 | 15 | 2024 |
MGMP: Multimodal graph message propagation network for event detection J Li, Y Wang, W Li International Conference on Multimedia Modeling, 141-153, 2022 | 6 | 2022 |
MHRN: A multimodal hierarchical reasoning network for topic detection J Li, Y Wang, W Li IEEE Transactions on Multimedia 26, 6968-6980, 2024 | 3 | 2024 |
Zero-shot scene graph generation via triplet calibration and reduction J Li, Y Wang, W Li ACM Transactions on Multimedia Computing, Communications and Applications 20 …, 2023 | 3 | 2023 |
Leveraging predicate and triplet learning for scene graph generation J Li, Y Wang, X Guo, R Yang, W Li Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024 | 2 | 2024 |
Entity Relation Fusion for Real-Time One-Stage Referring Expression Comprehension H Yu, W Li, J Li, Y Du Proceedings of the 3rd ACM International Conference on Multimedia in Asia, 1-8, 2021 | | 2021 |