FastViT: A fast hybrid vision transformer using structural reparameterization
The recent amalgamation of transformer and convolutional designs has led to steady
improvements in accuracy and efficiency of the models. In this work, we introduce FastViT, a …
improvements in accuracy and efficiency of the models. In this work, we introduce FastViT, a …
Frankmocap: A monocular 3d whole-body pose estimation system via regression and integration
Most existing monocular 3D pose estimation approaches only focus on a single body part,
neglecting the fact that the essential nuance of human motion is conveyed through a concert …
neglecting the fact that the essential nuance of human motion is conveyed through a concert …
Interacting attention graph for single image two-hand reconstruction
Graph convolutional network (GCN) has achieved great success in single hand
reconstruction task, while interacting two-hand reconstruction by GCN remains unexplored …
reconstruction task, while interacting two-hand reconstruction by GCN remains unexplored …
Mobrecon: Mobile-friendly hand mesh reconstruction from monocular image
In this work, we propose a framework for single-view hand mesh reconstruction, which can
simultaneously achieve high reconstruction accuracy, fast inference speed, and temporal …
simultaneously achieve high reconstruction accuracy, fast inference speed, and temporal …
gsdf: Geometry-driven signed distance functions for 3d hand-object reconstruction
Signed distance functions (SDFs) is an attractive framework that has recently shown
promising results for 3D shape reconstruction from images. SDFs seamlessly generalize to …
promising results for 3D shape reconstruction from images. SDFs seamlessly generalize to …
Taco: Benchmarking generalizable bimanual tool-action-object understanding
Humans commonly work with multiple objects in daily life and can intuitively transfer
manipulation skills to novel objects by understanding object functional regularities. However …
manipulation skills to novel objects by understanding object functional regularities. However …
Reconstructing interacting hands with interaction prior from monocular images
Reconstructing interacting hands from monocular images is indispensable in AR/VR
applications. Most existing solutions rely on the accurate localization of each skeleton joint …
applications. Most existing solutions rely on the accurate localization of each skeleton joint …
Artiboost: Boosting articulated 3d hand-object pose estimation via online exploration and synthesis
Estimating the articulated 3D hand-object pose from a single RGB image is a highly
ambiguous and challenging problem, requiring large-scale datasets that contain diverse …
ambiguous and challenging problem, requiring large-scale datasets that contain diverse …
Alignsdf: Pose-aligned signed distance fields for hand-object reconstruction
Recent work achieved impressive progress towards joint reconstruction of hands and
manipulated objects from monocular color images. Existing methods focus on two …
manipulated objects from monocular color images. Existing methods focus on two …
Towards accurate alignment in real-time 3d hand-mesh reconstruction
Abstract 3D hand-mesh reconstruction from RGB images facilitates many applications,
including augmented reality (AR). However, this requires not only real-time speed and …
including augmented reality (AR). However, this requires not only real-time speed and …