Sign language recognition: A deep survey
Sign language, as a different form of the communication language, is important to large
groups of people in society. There are different signs in each sign language with variability …
groups of people in society. There are different signs in each sign language with variability …
Recovering 3d human mesh from monocular images: A survey
Estimating human pose and shape from monocular images is a long-standing problem in
computer vision. Since the release of statistical body models, 3D human mesh recovery has …
computer vision. Since the release of statistical body models, 3D human mesh recovery has …
Ego-exo4d: Understanding skilled human activity from first-and third-person perspectives
Abstract We present Ego-Exo4D a diverse large-scale multimodal multiview video dataset
and benchmark challenge. Ego-Exo4D centers around simultaneously-captured egocentric …
and benchmark challenge. Ego-Exo4D centers around simultaneously-captured egocentric …
Dynamic 3d gaussians: Tracking by persistent dynamic view synthesis
We present a method that simultaneously addresses the tasks of dynamic scene novel-view
synthesis and six degree-of-freedom (6-DOF) tracking of all dense scene elements. We …
synthesis and six degree-of-freedom (6-DOF) tracking of all dense scene elements. We …
Fake it till you make it: face analysis in the wild using synthetic data alone
We demonstrate that it is possible to perform face-related computer vision in the wild using
synthetic data alone. The community has long enjoyed the benefits of synthesizing training …
synthetic data alone. The community has long enjoyed the benefits of synthesizing training …
Mediapipe hands: On-device real-time hand tracking
We present a real-time on-device hand tracking pipeline that predicts hand skeleton from
single RGB camera for AR/VR applications. The pipeline consists of two models: 1) a palm …
single RGB camera for AR/VR applications. The pipeline consists of two models: 1) a palm …
Probabilistic regression for visual tracking
Visual tracking is fundamentally the problem of regressing the state of the target in each
video frame. While significant progress has been achieved, trackers are still prone to failures …
video frame. While significant progress has been achieved, trackers are still prone to failures …
Frankmocap: A monocular 3d whole-body pose estimation system via regression and integration
Most existing monocular 3D pose estimation approaches only focus on a single body part,
neglecting the fact that the essential nuance of human motion is conveyed through a concert …
neglecting the fact that the essential nuance of human motion is conveyed through a concert …
Dynamic 3d gaussians: Tracking by persistent dynamic view synthesis
We present a method that simultaneously addresses the tasks of dynamic scene novel-view
synthesis and six degree-of-freedom (6-DOF) tracking of all dense scene elements. We …
synthesis and six degree-of-freedom (6-DOF) tracking of all dense scene elements. We …
Learning to reconstruct 3D human pose and shape via model-fitting in the loop
Abstract Model-based human pose estimation is currently approached through two different
paradigms. Optimization-based methods fit a parametric body model to 2D observations in …
paradigms. Optimization-based methods fit a parametric body model to 2D observations in …