Follow
Bin Xiao
Title
Cited by
Cited by
Year
Deep High-Resolution Representation Learning for Human Pose Estimation
K Sun, B Xiao, D Liu, J Wang
Proceedings of the IEEE Conference on Computer Vision and Pattern …, 2019
57592019
Deep High-Resolution Representation Learning for Visual Recognition
J Wang, K Sun, T Cheng, B Jiang, C Deng, Y Zhao, D Liu, Y Mu, M Tan, ...
IEEE Transactions on Pattern Analysis and Machine Intelligence, 2020
5169*2020
CvT: Introducing Convolutions to Vision Transformers
H Wu, B Xiao, N Codella, M Liu, X Dai, L Yuan, L Zhang
ICCV 2021, 2021
24072021
Simple baselines for human pose estimation and tracking
B Xiao, H Wu, Y Wei
Proceedings of the European Conference on Computer Vision (ECCV), 466-481, 2018
24062018
Integral human pose regression
X Sun, B Xiao, F Wei, S Liang, Y Wei
Proceedings of the European Conference on Computer Vision (ECCV), 529-545, 2018
10352018
HigherHRNet: Scale-Aware Representation Learning for Bottom-Up Human Pose Estimation
B Cheng, B Xiao, J Wang, H Shi, TS Huang, L Zhang
CVPR, 2020
10332020
Florence: A New Foundation Model for Computer Vision
L Yuan, D Chen, YL Chen, N Codella, X Dai, J Gao, H Hu, X Huang, B Li, ...
arXiv preprint arXiv:2111.11432, 2021
9542021
Dynamic Head: Unifying Object Detection Heads with Attentions
X Dai, Y Chen, B Xiao, D Chen, M Liu, L Yuan, L Zhang
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2021
7612021
Phi-3 technical report: A highly capable language model locally on your phone
M Abdin, J Aneja, H Awadalla, A Awadallah, AA Awan, N Bach, A Bahree, ...
arXiv preprint arXiv:2404.14219, 2024
7232024
Focal attention for long-range interactions in vision transformers
J Yang, C Li, P Zhang, X Dai, B Xiao, L Yuan, J Gao
Advances in Neural Information Processing Systems 34, 30008-30022, 2021
664*2021
Lite-hrnet: A lightweight high-resolution network
C Yu, B Xiao, C Gao, L Yuan, L Zhang, N Sang, J Wang
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2021
4702021
Interleaved Group Convolutions
T Zhang, GJ Qi, B Xiao, J Wang
The IEEE International Conference on Computer Vision (ICCV), 4373-4382, 2017
4152017
Multi-Scale Vision Longformer: A New Vision Transformer for High-Resolution Image Encoding
P Zhang, X Dai, J Yang, B Xiao, L Yuan, L Zhang, J Gao
ICCV 2021, 2021
3922021
Bottom-Up Human Pose Estimation Via Disentangled Keypoint Regression
Z Geng, K Sun, B Xiao, Z Zhang, J Wang
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2021
3632021
DaViT: Dual Attention Vision Transformers
M Ding, B Xiao, N Codella, P Luo, J Wang, L Yuan
ECCV, 2022
3442022
TinyViT: Fast Pretraining Distillation for Small Vision Transformers
K Wu, J Zhang, H Peng, M Liu, B Xiao, J Fu, L Yuan
ECCV, 2022
2612022
Efficient Self-supervised Vision Transformers for Representation Learning
C Li, J Yang, P Zhang, M Gao, B Xiao, X Dai, L Yuan, J Gao
ICLR, 2021
2392021
Unified contrastive learning in image-text-label space
J Yang, C Li, P Zhang, B Xiao, C Liu, L Yuan, J Gao
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022
2222022
MiniViT: Compressing Vision Transformers with Weight Multiplexing
J Zhang, H Peng, K Wu, M Liu, B Xiao, J Fu, L Yuan
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022
1402022
Florence-2: Advancing a unified representation for a variety of vision tasks
B Xiao, H Wu, W Xu, X Dai, H Hu, Y Lu, M Zeng, C Liu, L Yuan
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024
902024
The system can't perform the operation now. Try again later.
Articles 1–20