Fastvit: A fast hybrid vision transformer using structural reparameterization

PKA Vasu, J Gabriel, J Zhu, O Tuzel… - Proceedings of the …, 2023 - openaccess.thecvf.com
The recent amalgamation of transformer and convolutional designs has led to steady
improvements in accuracy and efficiency of the models. In this work, we introduce FastViT, a …

Reconstructing hands in 3d with transformers

G Pavlakos, D Shan, I Radosavovic… - Proceedings of the …, 2024 - openaccess.thecvf.com
We present an approach that can reconstruct hands in 3D from monocular input. Our
approach for Hand Mesh Recovery HaMeR follows a fully transformer-based architecture …

Interacting attention graph for single image two-hand reconstruction

M Li, L An, H Zhang, L Wu, F Chen… - Proceedings of the …, 2022 - openaccess.thecvf.com
Graph convolutional network (GCN) has achieved great success in single hand
reconstruction task, while interacting two-hand reconstruction by GCN remains unexplored …

Mobrecon: Mobile-friendly hand mesh reconstruction from monocular image

X Chen, Y Liu, Y Dong, X Zhang… - Proceedings of the …, 2022 - openaccess.thecvf.com
In this work, we propose a framework for single-view hand mesh reconstruction, which can
simultaneously achieve high reconstruction accuracy, fast inference speed, and temporal …

Acr: Attention collaboration-based regressor for arbitrary two-hand reconstruction

Z Yu, S Huang, C Fang… - Proceedings of the …, 2023 - openaccess.thecvf.com
Reconstructing two hands from monocular RGB images is challenging due to frequent
occlusion and mutual confusion. Existing methods mainly learn an entangled representation …

Lisa: Learning implicit shape and appearance of hands

E Corona, T Hodan, M Vo… - Proceedings of the …, 2022 - openaccess.thecvf.com
This paper proposes a do-it-all neural model of human hands, named LISA. The model can
capture accurate hand shape and appearance, generalize to arbitrary hand subjects …

Artiboost: Boosting articulated 3d hand-object pose estimation via online exploration and synthesis

L Yang, K Li, X Zhan, J Lv, W Xu… - Proceedings of the …, 2022 - openaccess.thecvf.com
Estimating the articulated 3D hand-object pose from a single RGB image is a highly
ambiguous and challenging problem, requiring large-scale datasets that contain diverse …

H2onet: Hand-occlusion-and-orientation-aware network for real-time 3d hand mesh reconstruction

H Xu, T Wang, X Tang, CW Fu - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
Real-time 3D hand mesh reconstruction is challenging, especially when the hand is holding
some object. Beyond the previous methods, we design H2ONet to fully exploit non-occluded …

Reconstructing interacting hands with interaction prior from monocular images

B Zuo, Z Zhao, W Sun, W **e… - Proceedings of the …, 2023 - openaccess.thecvf.com
Reconstructing interacting hands from monocular images is indispensable in AR/VR
applications. Most existing solutions rely on the accurate localization of each skeleton joint …

Challenges and solutions for vision-based hand gesture interpretation: A review

K Gao, H Zhang, X Liu, X Wang, L **e, B Ji… - Computer Vision and …, 2024 - Elsevier
Hand gesture is one of the most efficient and natural interfaces in current human–computer
interaction (HCI) systems. Despite the great progress achieved in hand gesture-based HCI …