Lumina-T2X: Transforming Text into Any Modality, Resolution, and Duration via Flow-based Large Diffusion Transformers P Gao, L Zhuo, Z Lin, C Liu, J Chen, R Du, E Xie, X Luo, L Qiu, Y Zhang, ... arXiv preprint arXiv:2405.05945, 2024 | 56* | 2024 |
Generalized unbiased scene graph generation X Lyu, L Gao, J Xie, P Zeng, Y Tian, J Shao, HT Shen arXiv preprint arXiv:2308.04802, 2023 | 8 | 2023 |
PixWizard: Versatile image-to-image visual assistant with open-language instructions W Lin, X Wei, R Zhang, L Zhuo, S Zhao, S Huang, J Xie, Y Qiao, P Gao, ... arXiv preprint arXiv:2409.15278, 2024 | 3 | 2024 |
STDC‐MA network for semantic segmentation X Lei, L Lu, Z Jiang, Z Gong, C Lu, J Liang, J Xie IET Image Processing 16 (14), 3758-3767, 2022 | 3 | 2022 |
KinD-LCE: curve estimation and Retinex Fusion on low-light image X Lei, W Mai, J Xie, H Liu, Z Jiang, Z Gong, C Lu, L Lu Signal, Image and Video Processing 18 (2), 1733-1746, 2024 | 1 | 2024 |