Unsupervised multi-source domain adaptation for person re-identification Z Bai, Z Wang, J Wang, D Hu, E Ding Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2021 | 110 | 2021 |
Hallucination of Multimodal Large Language Models: A Survey Z Bai, P Wang, T Xiao, T He, Z Han, Z Zhang, MZ Shou arXiv preprint arXiv:2404.18930, 2024 | 108 | 2024 |
Show-o: One Single Transformer to Unify Multimodal Understanding and Generation J Xie, W Mao, Z Bai, DJ Zhang, W Wang, KQ Lin, Y Gu, Z Chen, Z Yang, ... arXiv preprint arXiv:2408.12528, 2024 | 90 | 2024 |
Show, recall, and tell: Image captioning with recall mechanism L Wang, Z Bai, Y Zhang, H Lu Proceedings of the AAAI conference on artificial intelligence 34 (07), 12176 …, 2020 | 75 | 2020 |
Going beyond real data: A robust visual representation for vehicle re-identification Z Zheng, M Jiang, Z Wang, J Wang, Z Bai, X Zhang, X Yu, X Tan, Y Yang, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2020 | 57 | 2020 |
Explain me the painting: Multi-topic knowledgeable art description generation Z Bai, Y Nakashima, N Garcia Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2021 | 46 | 2021 |
AssistGUI: Task-Oriented PC Graphical User Interface Automation D Gao, L Ji, Z Bai, M Ouyang, P Li, D Mao, Q Wu, W Zhang, P Wang, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024 | 29* | 2024 |
Robust vehicle re-identification via rigid structure prior M Jiang, X Zhang, Y Yu, Z Bai, Z Zheng, Z Wang, J Wang, X Tan, H Sun, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2021 | 25 | 2021 |
Enhancing emotional experience by building emotional virtual characters in vr volleyball games Z Bai, N Yao, N Mishra, H Chen, H Wang, N Magnenat Thalmann Computer Animation and Virtual Worlds 32 (3-4), e2008, 2021 | 18 | 2021 |
Skip\n: A simple method to reduce hallucination in large vision-language models Z Han, Z Bai, H Mei, Q Xu, C Zhang, MZ Shou arXiv preprint arXiv:2402.01345, 2024 | 13 | 2024 |
Object-centric multiple object tracking Z Zhao, J Wang, M Horn, Y Ding, T He, Z Bai, D Zietlow, CJ Simon-Gabriel, ... Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023 | 12 | 2023 |
One Token to Seg Them All: Language Instructed Reasoning Segmentation in Videos Zechen Bai, Tong He, Haiyang Mei, Pichao Wang, Ziteng Gao, Joya Chen, Lei ... Thirty-eighth Conference on Neural Information Processing Systems (NeurIPS), 2024 | 8* | 2024 |
ShowUI: One Vision-Language-Action Model for Generalist GUI Agent KQ Lin, L Li, D Gao, Z Yang, Z Bai, W Lei, L Wang, MZ Shou NeurIPS 2024 Workshop on Open-World Agents, 2024 | 8* | 2024 |
Play with emotional characters: Improving user emotional experience by a data-driven approach in vr volleyball games Z Bai, N Yao, N Mishra, H Chen, H Wang, NM Thalmann 2021 IEEE Conference on Virtual Reality and 3D User Interfaces Abstracts and …, 2021 | 8 | 2021 |
Unsupervised Open-Vocabulary Object Localization in Videos K Fan, Z Bai, T Xiao, D Zietlow, M Horn, Z Zhao, CJ Simon-Gabriel, ... Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023 | 6 | 2023 |
Adaptive Slot Attention: Object Discovery with Dynamic Slot Number K Fan, Z Bai, T Xiao, T He, M Horn, Y Fu, F Locatello, Z Zhang CVPR 2024, 2024 | 5 | 2024 |
Lova3: Learning to visual question answering, asking and assessment HH Zhao, P Zhou, D Gao, Z Bai, MZ Shou arXiv preprint arXiv:2405.14974, 2024 | 4 | 2024 |
Bring Your Own Character: A Holistic Solution for Automatic Facial Animation Generation of Customized Characters Z Bai, P Chen, X Peng, L Liu, N Yao, H Chen 2024 IEEE Conference Virtual Reality and 3D User Interfaces (VR), 429-438, 2024 | 2 | 2024 |
AssistEditor: Multi-Agent Collaboration for GUI Workflow Automation in Video Creation D Gao, S Hu, Z Bai, Q Lin, MZ Shou Proceedings of the 32nd ACM International Conference on Multimedia, 11255-11257, 2024 | 1 | 2024 |
Gqe: Generalized query expansion for enhanced text-video retrieval Z Bai, T Xiao, T He, P Wang, Z Zhang, T Brox, MZ Shou arXiv preprint arXiv:2408.07249, 2024 | 1 | 2024 |