A survey on generative diffusion models

H Cao, C Tan, Z Gao, Y Xu, G Chen… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Deep generative models have unlocked another profound realm of human creativity. By
capturing and generalizing patterns within data, we have entered the epoch of all …

Diffusion-based 3d human pose estimation with multi-hypothesis aggregation

W Shan, Z Liu, X Zhang, Z Wang… - Proceedings of the …, 2023 - openaccess.thecvf.com
In this paper, a novel Diffusion-based 3D Pose estimation (D3DP) method with Joint-wise
reProjection-based Multi-hypothesis Aggregation (JPMA) is proposed for probabilistic 3D …

Zigma: A dit-style zigzag mamba diffusion model

VT Hu, SA Baumann, M Gui, O Grebenkova… - … on Computer Vision, 2024 - Springer
The diffusion model has long been plagued by scalability and quadratic complexity issues,
especially within transformer-based structures. In this study, we aim to leverage the long …

Distribution-aligned diffusion for human mesh recovery

LG Foo, J Gong, H Rahmani… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
Recovering a 3D human mesh from a single RGB image is a challenging task due to depth
ambiguity and self-occlusion, resulting in a high degree of uncertainty. Meanwhile, diffusion …

Diffusion-based image translation with label guidance for domain adaptive semantic segmentation

D Peng, P Hu, Q Ke, J Liu - Proceedings of the IEEE/CVF …, 2023 - openaccess.thecvf.com
Translating images from a source domain to a target domain for learning target models is
one of the most common strategies in domain adaptive semantic segmentation (DASS) …

Unified pose sequence modeling

LG Foo, T Li, H Rahmani, Q Ke… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
Abstract We propose a Unified Pose Sequence Modeling approach to unify heterogeneous
human behavior understanding tasks based on pose data, eg, action recognition, 3D pose …

Skeleton-in-context: Unified skeleton sequence modeling with in-context learning

X Wang, Z Fang, X Li, X Li… - Proceedings of the …, 2024 - openaccess.thecvf.com
In-context learning provides a new perspective for multi-task modeling for vision and NLP.
Under this setting the model can perceive tasks from prompts and accomplish them without …

DiffHPE: Robust, coherent 3D human pose lifting with diffusion

C Rommel, E Valle, M Chen… - Proceedings of the …, 2023 - openaccess.thecvf.com
We present an innovative approach to 3D Human Pose Estimation (3D-HPE) by integrating
cutting-edge diffusion models, which have revolutionized diverse fields, but are relatively …

6d-diff: A keypoint diffusion framework for 6d object pose estimation

L Xu, H Qu, Y Cai, J Liu - … of the IEEE/CVF Conference on …, 2024 - openaccess.thecvf.com
Estimating the 6D object pose from a single RGB image often involves noise and
indeterminacy due to challenges such as occlusions and cluttered backgrounds. Meanwhile …

Back to optimization: Diffusion-based zero-shot 3d human pose estimation

Z Jiang, Z Zhou, L Li, W Chai… - Proceedings of the …, 2024 - openaccess.thecvf.com
Learning-based methods have dominated the 3D human pose estimation (HPE) tasks with
significantly better performance in most benchmarks than traditional optimization-based …