Følg
Xiaowei Chi
Tittel
Sitert av
Sitert av
År
Chatmusician: Understanding and generating music intrinsically with llm
R Yuan, H Lin, Y Wang, Z Tian, S Wu, T Shen, G Zhang, Y Wu, C Liu, ...
ACL 2024, 2024
372024
Bev-san: Accurate bev 3d object detection via slice attention networks
X Chi, J Liu, M Lu, R Zhang, Z Wang, Y Guo, S Zhang
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023
222023
Llms meet multimodal generation and editing: A survey
Y He, Z Liu, J Chen, Z Tian, H Liu, X Chi, R Liu, R Yuan, Y Xing, W Wang, ...
arXiv preprint arXiv:2405.19334, 2024
172024
Unimodal training-multimodal prediction: Cross-modal federated learning with hierarchical aggregation
R Zhang, X Chi, G Liu, W Zhang, Y Du, F Wang
arXiv preprint arXiv:2303.15486, 2023
162023
DOZE: A Dataset for Open-Vocabulary Zero-Shot Object Navigation in Dynamic Environments
J Ma, H Dai, Y Mu, P Wu, H Wang, X Chi, Y Fei, S Zhang, C Liu
IEEE Robotics and Automation Letters, 2024
82024
Weakly-supervised emotion transition learning for diverse 3d co-speech gesture generation
X Qi, J Pan, P Li, R Yuan, X Chi, M Li, W Luo, W Xue, S Zhang, Q Liu, ...
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024
82024
BEVUDA: Multi-geometric space alignments for domain adaptive BEV 3D object detection
J Liu, R Zhang, X Li, X Chi, Z Chen, M Lu, Y Guo, S Zhang
2024 IEEE International Conference on Robotics and Automation (ICRA), 9487-9494, 2024
6*2024
Coal and gangue recognition method based on local texture classification network for robot picking
Y Xie, X Chi, H Li, F Wang, L Yan, B Zhang, Q Zhang
Applied Sciences 11 (23), 11495, 2021
62021
Chatillusion: Efficient-aligning interleaved generation ability with visual instruction model
X Chi, Y Liu, Z Jiang, R Zhang, Z Lin, R Zhang, P Gao, C Fu, S Zhang, ...
CoRR, 2023
32023
Towards efficient full 8-bit integer DNN online training on resource-limited devices without batch normalization
Y Yang, X Chi, L Deng, T Yan, F Gao, G Li
Neurocomputing 511, 175-186, 2022
32022
EVA: An Embodied World Model for Future Video Anticipation
X Chi, H Zhang, CK Fan, X Qi, R Zhang, A Chen, C Chan, W Xue, W Luo, ...
arXiv preprint arXiv:2410.15461, 2024
22024
PSHuman: Photorealistic Single-view Human Reconstruction using Cross-Scale Diffusion
P Li, W Zheng, Y Liu, T Yu, Y Li, X Qi, M Li, X Chi, S Xia, W Xue, W Luo, ...
arXiv preprint arXiv:2409.10141, 2024
22024
MMTrail: A Multimodal Trailer Video Dataset with Language and Music Descriptions
X Chi, Y Wang, A Cheng, P Fang, Z Tian, Y He, Z Liu, X Qi, J Pan, ...
arXiv preprint arXiv:2407.20962, 2024
22024
M-lrm: Multi-view large reconstruction model
M Li, X Long, Y Liang, W Li, Y Liu, P Li, X Chi, X Qi, W Xue, W Luo, Q Liu, ...
arXiv preprint arXiv:2406.07648, 2024
12024
Cocogesture: Toward coherent co-speech 3d gesture generation in the wild
X Qi, H Zhang, Y Wang, J Pan, C Liu, P Li, X Chi, M Li, W Xue, S Zhang, ...
arXiv preprint arXiv:2405.16874, 2024
12024
MChat: Empowering VLM for Multimodal LLM Interleaved Text-Image Generation
X Chi, R Zhang, Z Jiang, Y Liu, Y Wang, X Qi, W Luo, P Gao, S Zhang, ...
arXiv preprint arXiv:2311.17963, 2023
2023
ViML: A Video, Music, Language Unified Dataset for Understanding and Generation
X Chi, A Cheng, Y Wang, P Fang, Z Tian, Y He, X Qi, Z Liu, R Zhang, ...
: Towards Coherent Co-speech 3D Gesture Generation in the Wild
X Qi, H Zhang, Y Wang, J Pan, C Liu, P Li, X Chi, M Li, W Xue, S Zhang, ...
Systemet kan ikke utføre handlingen. Prøv på nytt senere.
Artikler 1–18