Seguir
Xiaoshi Wu
Xiaoshi Wu
Ph.D. candidate, CUHK
Dirección de correo verificada de link.cuhk.edu.hk - Página principal
Título
Citado por
Citado por
Año
Human preference score v2: A solid benchmark for evaluating human preferences of text-to-image synthesis
X Wu, Y Hao, K Sun, Y Chen, F Zhu, R Zhao, H Li
arXiv preprint arXiv:2306.09341, 2023
1642023
Uni-perceiver: Pre-training unified architecture for generic perception for zero-shot and few-shot tasks
X Zhu, J Zhu, H Li, X Wu, H Li, X Wang, J Dai
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022
1362022
Cora: Adapting clip for open-vocabulary detection with region prompting and anchor pre-matching
X Wu, F Zhu, R Zhao, H Li
Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2023
1282023
Better aligning text-to-image models with human preference
X Wu, K Sun, F Zhu, R Zhao, H Li
arXiv preprint arXiv:2303.14420 1 (3), 2023
992023
Human preference score: Better aligning text-to-image models with human preference
X Wu, K Sun, F Zhu, R Zhao, H Li
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023
982023
Journeydb: A benchmark for generative image understanding
K Sun, J Pan, Y Ge, H Li, H Duan, X Wu, R Zhang, A Zhou, Z Qin, Y Wang, ...
Advances in Neural Information Processing Systems 36, 2024
682024
Towers of babel: Combining images, language, and 3d geometry for learning multimodal vision
X Wu, H Averbuch-Elor, J Sun, N Snavely
Proceedings of the IEEE/CVF International Conference on Computer Vision, 428-437, 2021
202021
CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept Matching
D Jiang, G Song, X Wu, R Zhang, D Shen, Z Zong, Y Liu, H Li
arXiv preprint arXiv:2404.03653, 2024
112024
Deep reward supervisions for tuning text-to-image diffusion models
X Wu, Y Hao, M Zhang, K Sun, Z Huang, G Song, Y Liu, H Li
European Conference on Computer Vision, 108-124, 2024
72024
Ecnet: Effective controllable text-to-image diffusion models
S Li, K Sun, Z Lai, X Wu, F Qiu, H Xie, K Miyata, H Li
arXiv preprint arXiv:2403.18417, 2024
52024
Be-your-outpainter: Mastering video outpainting through input-specific adaptation
FY Wang, X Wu, Z Huang, X Shi, D Shen, G Song, Y Liu, H Li
European Conference on Computer Vision, 153-168, 2024
42024
El sistema no puede realizar la operación en estos momentos. Inténtalo de nuevo más tarde.
Artículos 1–11