Deep learning-based human pose estimation: A survey
Human pose estimation aims to locate the human body parts and build human body
representation (eg, body skeleton) from input data such as images and videos. It has drawn …
representation (eg, body skeleton) from input data such as images and videos. It has drawn …
State of the art on neural rendering
Efficient rendering of photo‐realistic virtual worlds is a long standing effort of computer
graphics. Modern graphics techniques have succeeded in synthesizing photo‐realistic …
graphics. Modern graphics techniques have succeeded in synthesizing photo‐realistic …
Zero-1-to-3: Zero-shot one image to 3d object
Abstract We introduce Zero-1-to-3, a framework for changing the camera viewpoint of an
object given just a single RGB image. To perform novel view synthesis in this …
object given just a single RGB image. To perform novel view synthesis in this …
One-2-3-45: Any single image to 3d mesh in 45 seconds without per-shape optimization
Single image 3D reconstruction is an important but challenging task that requires extensive
knowledge of our natural world. Many existing methods solve this problem by optimizing a …
knowledge of our natural world. Many existing methods solve this problem by optimizing a …
Generative novel view synthesis with 3d-aware diffusion models
We present a diffusion-based model for 3D-aware generative novel view synthesis from as
few as a single input image. Our model samples from the distribution of possible renderings …
few as a single input image. Our model samples from the distribution of possible renderings …
Surroundocc: Multi-camera 3d occupancy prediction for autonomous driving
Abstract 3D scene understanding plays a vital role in vision-based autonomous driving.
While most existing methods focus on 3D object detection, they have difficulty describing …
While most existing methods focus on 3D object detection, they have difficulty describing …
Implicit diffusion models for continuous super-resolution
Image super-resolution (SR) has attracted increasing attention due to its wide applications.
However, current SR methods generally suffer from over-smoothing and artifacts, and most …
However, current SR methods generally suffer from over-smoothing and artifacts, and most …
Omniobject3d: Large-vocabulary 3d object dataset for realistic perception, reconstruction and generation
Recent advances in modeling 3D objects mostly rely on synthetic datasets due to the lack of
large-scale real-scanned 3D databases. To facilitate the development of 3D perception …
large-scale real-scanned 3D databases. To facilitate the development of 3D perception …
Dynibar: Neural dynamic image-based rendering
We address the problem of synthesizing novel views from a monocular video depicting a
complex dynamic scene. State-of-the-art methods based on temporally varying Neural …
complex dynamic scene. State-of-the-art methods based on temporally varying Neural …
Econ: Explicit clothed humans optimized via normal integration
The combination of deep learning, artist-curated scans, and Implicit Functions (IF), is
enabling the creation of detailed, clothed, 3D humans from images. However, existing …
enabling the creation of detailed, clothed, 3D humans from images. However, existing …