Obserwuj
Zhuofan Xia
Zhuofan Xia
PhD candidate, Department of Automation, Tsinghua University
Zweryfikowany adres z mails.tsinghua.edu.cn - Strona główna
Tytuł
Cytowane przez
Cytowane przez
Rok
Vision Transformer with Deformable Attention
Z Xia, X Pan, S Song, LE Li, G Huang
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2022), 2022
6602022
3D Object Detection with Pointformer
X Pan, Z Xia, S Song, LE Li, G Huang
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2021), 2021
4532021
Adaptive Rotated Convolution for Rotated Object Detection
Y Pu, Y Wang, Z Xia, Y Han, Y Wang, W Gan, Z Wang, S Song, G Huang
IEEE/CVF International Conference on Computer Vision (ICCV 2023), 2023
982023
Agent attention: On the integration of softmax and linear attention
D Han, T Ye, Y Han, Z Xia, S Song, G Huang
European Conference on Computer Vision (ECCV 2024), 2024
802024
Slide-transformer: Hierarchical vision transformer with local self-attention
X Pan, T Ye, Z Xia, S Song, G Huang
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2023), 2023
672023
GSVA: Generalized Segmentation via Multimodal Large Language Models
Z Xia, D Han, Y Han, X Pan, S Song, G Huang
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2024), 2024
392024
Demystify Mamba in Vision: A Linear Attention Perspective
D Han, Z Wang, Z Xia, Y Han, Y Pu, C Ge, J Song, S Song, B Zheng, ...
arXiv preprint arXiv:2405.16605, 2024
352024
Dat++: Spatially dynamic vision transformer with deformable attention
Z Xia, X Pan, S Song, LE Li, G Huang
arXiv preprint arXiv:2309.01430, 2023
212023
Budgeted Training for Vision Transformer
Z Xia, X Pan, X Jin, Y He, H Xue, S Song, G Huang
International Conference on Learning Representations (ICLR 2023), 2023
9*2023
Efficient Diffusion Transformer with Step-wise Dynamic Attention Mediators
Y Pu, Z Xia, J Guo, D Han, Q Li, D Li, Y Yuan, J Li, Y Han, S Song, ...
European Conference on Computer Vision (ECCV 2024), 0
9*
Bridging the divide: Reconsidering softmax and linear attention
D Han, Y Pu, Z Xia, Y Han, X Pan, X Li, J Lu, S Song, G Huang
Advances in Neural Information Processing Systems 37, 79221-79245, 2025
22025
Generalized Activation via Multivariate Projection
J Li, Y Cheng, Y Lu, Z Xia, Y Mo, G Huang
arXiv preprint arXiv:2309.17194, 2023
12023
Training an Open-Vocabulary Monocular 3D Object Detection Model without 3D Data
R Huang, H Zheng, Y Wang, Z Xia, M Pavone, G Huang
arXiv preprint arXiv:2411.15657, 2024
2024
Training an Open-Vocabulary Monocular 3D Detection Model without 3D Data
R Huang, H Zheng, Y Wang, Z Xia, M Pavone, G Huang
The Thirty-eighth Annual Conference on Neural Information Processing Systems, 0
Nie można teraz wykonać tej operacji. Spróbuj ponownie później.
Prace 1–14