Følg
Junjie Zhou
Tittel
Sitert av
Sitert av
År
Mlvu: A comprehensive benchmark for multi-task long video understanding
J Zhou, Y Shu, B Zhao, B Wu, S Xiao, X Yang, Y Xiong, B Zhang, T Huang, ...
arXiv preprint arXiv:2406.04264, 2024
702024
Omnigen: Unified image generation
S Xiao, Y Wang, J Zhou, H Yuan, X Xing, R Yan, S Wang, T Huang, Z Liu
arXiv preprint arXiv:2409.11340, 2024
282024
Docdiff: Document enhancement via residual diffusion models
Z Yang, B Liu, Y Xxiong, L Yi, G Wu, X Tang, Z Liu, J Zhou, X Zhang
Proceedings of the 31st ACM international conference on multimedia, 2795-2806, 2023
272023
Video-xl: Extra-long vision language model for hour-scale video understanding
Y Shu, P Zhang, Z Liu, M Qin, J Zhou, T Huang, B Zhao
arXiv preprint arXiv:2409.14485, 2024
152024
FAT: Field-Aware Transformer for Point Cloud Segmentation With Adaptive Attention Fields
J Zhou, B Liu, Y Xiong, C Chiu, F Liu, X Gong
IEEE Transactions on Industrial Informatics, 2024
15*2024
VISTA: Visualized Text Embedding For Universal Multi-Modal Retrieval
J Zhou, Z Liu, S Xiao, B Zhao, Y Xiong
The 62nd Annual Meeting of the Association for Computational Linguistics …, 2024
142024
Textdiff: Mask-guided residual diffusion models for scene text image super-resolution
B Liu, Z Yang, P Wang, J Zhou, Z Liu, Z Song, Y Liu, Y Xiong
arXiv preprint arXiv:2308.06743, 2023
72023
MegaPairs: Massive Data Synthesis For Universal Multimodal Retrieval
J Zhou, Z Liu, Z Liu, S Xiao, Y Wang, B Zhao, CJ Zhang, D Lian, Y Xiong
arXiv preprint arXiv:2412.14475, 2024
32024
Fat: Field-Aware Transformer for 3D Point Cloud Semantic Segmentation
J Zhou, Y Xiong, C Chiu, F Liu, X Gong
2023 IEEE International Conference on Image Processing (ICIP), 660-664, 2023
12023
Any Information Is Just Worth One Single Screenshot: Unifying Search With Visualized Information Retrieval
Z Liu, Z Liang, J Zhou, Z Liu, D Lian
arXiv preprint arXiv:2502.11431, 2025
2025
Systemet kan ikke utføre handlingen. Prøv på nytt senere.
Artikler 1–10