Deep learning-based human pose estimation: A survey

C Zheng, W Wu, C Chen, T Yang, S Zhu, J Shen… - ACM Computing …, 2023 - dl.acm.org
Human pose estimation aims to locate the human body parts and build human body
representation (eg, body skeleton) from input data such as images and videos. It has drawn …

State of the art on neural rendering

A Tewari, O Fried, J Thies, V Sitzmann… - Computer Graphics …, 2020 - Wiley Online Library
Efficient rendering of photo‐realistic virtual worlds is a long standing effort of computer
graphics. Modern graphics techniques have succeeded in synthesizing photo‐realistic …

Zero-1-to-3: Zero-shot one image to 3d object

R Liu, R Wu, B Van Hoorick… - Proceedings of the …, 2023 - openaccess.thecvf.com
Abstract We introduce Zero-1-to-3, a framework for changing the camera viewpoint of an
object given just a single RGB image. To perform novel view synthesis in this …

One-2-3-45: Any single image to 3d mesh in 45 seconds without per-shape optimization

M Liu, C Xu, H **, L Chen… - Advances in Neural …, 2024 - proceedings.neurips.cc
Single image 3D reconstruction is an important but challenging task that requires extensive
knowledge of our natural world. Many existing methods solve this problem by optimizing a …

Generative novel view synthesis with 3d-aware diffusion models

ER Chan, K Nagano, MA Chan… - Proceedings of the …, 2023 - openaccess.thecvf.com
We present a diffusion-based model for 3D-aware generative novel view synthesis from as
few as a single input image. Our model samples from the distribution of possible renderings …

Surroundocc: Multi-camera 3d occupancy prediction for autonomous driving

Y Wei, L Zhao, W Zheng, Z Zhu… - Proceedings of the …, 2023 - openaccess.thecvf.com
Abstract 3D scene understanding plays a vital role in vision-based autonomous driving.
While most existing methods focus on 3D object detection, they have difficulty describing …

Implicit diffusion models for continuous super-resolution

S Gao, X Liu, B Zeng, S Xu, Y Li… - Proceedings of the …, 2023 - openaccess.thecvf.com
Image super-resolution (SR) has attracted increasing attention due to its wide applications.
However, current SR methods generally suffer from over-smoothing and artifacts, and most …

Omniobject3d: Large-vocabulary 3d object dataset for realistic perception, reconstruction and generation

T Wu, J Zhang, X Fu, Y Wang, J Ren… - Proceedings of the …, 2023 - openaccess.thecvf.com
Recent advances in modeling 3D objects mostly rely on synthetic datasets due to the lack of
large-scale real-scanned 3D databases. To facilitate the development of 3D perception …

Dynibar: Neural dynamic image-based rendering

Z Li, Q Wang, F Cole, R Tucker… - Proceedings of the …, 2023 - openaccess.thecvf.com
We address the problem of synthesizing novel views from a monocular video depicting a
complex dynamic scene. State-of-the-art methods based on temporally varying Neural …

Econ: Explicit clothed humans optimized via normal integration

Y **u, J Yang, X Cao, D Tzionas… - Proceedings of the …, 2023 - openaccess.thecvf.com
The combination of deep learning, artist-curated scans, and Implicit Functions (IF), is
enabling the creation of detailed, clothed, 3D humans from images. However, existing …