Ai-generated content (aigc) for various data modalities: A survey

LG Foo, H Rahmani, J Liu - arxiv preprint arxiv:2308.14177, 2023‏ - arxiv.org
AI-generated content (AIGC) methods aim to produce text, images, videos, 3D assets, and
other media using AI algorithms. Due to its wide range of applications and the demonstrated …

Diffusion-based 3d human pose estimation with multi-hypothesis aggregation

W Shan, Z Liu, X Zhang, Z Wang… - Proceedings of the …, 2023‏ - openaccess.thecvf.com
In this paper, a novel Diffusion-based 3D Pose estimation (D3DP) method with Joint-wise
reProjection-based Multi-hypothesis Aggregation (JPMA) is proposed for probabilistic 3D …

Deep learning for 3d human pose estimation and mesh recovery: A survey

Y Liu, C Qiu, Z Zhang - Neurocomputing, 2024‏ - Elsevier
Abstract 3D human pose estimation and mesh recovery have attracted widespread research
interest in many areas, such as computer vision, autonomous driving, and robotics. Deep …

Zigma: A dit-style zigzag mamba diffusion model

VT Hu, SA Baumann, M Gui, O Grebenkova… - … on Computer Vision, 2024‏ - Springer
The diffusion model has long been plagued by scalability and quadratic complexity issues,
especially within transformer-based structures. In this study, we aim to leverage the long …

Unified pose sequence modeling

LG Foo, T Li, H Rahmani, Q Ke… - Proceedings of the IEEE …, 2023‏ - openaccess.thecvf.com
Abstract We propose a Unified Pose Sequence Modeling approach to unify heterogeneous
human behavior understanding tasks based on pose data, eg, action recognition, 3D pose …

Ktpformer: Kinematics and trajectory prior knowledge-enhanced transformer for 3d human pose estimation

J Peng, Y Zhou, PY Mok - … of the IEEE/CVF Conference on …, 2024‏ - openaccess.thecvf.com
This paper presents a novel Kinematics and Trajectory Prior Knowledge-Enhanced
Transformer (KTPFormer) which overcomes the weakness in existing transformer-based …

Diffusion-based image translation with label guidance for domain adaptive semantic segmentation

D Peng, P Hu, Q Ke, J Liu - Proceedings of the IEEE/CVF …, 2023‏ - openaccess.thecvf.com
Translating images from a source domain to a target domain for learning target models is
one of the most common strategies in domain adaptive semantic segmentation (DASS) …

Finepose: Fine-grained prompt-driven 3d human pose estimation via diffusion models

J Xu, Y Guo, Y Peng - … of the IEEE/CVF Conference on …, 2024‏ - openaccess.thecvf.com
Abstract The 3D Human Pose Estimation (3D HPE) task uses 2D images or videos to predict
human joint coordinates in 3D space. Despite recent advancements in deep learning-based …

Monodiff: Monocular 3d object detection and pose estimation with diffusion models

Y Ranasinghe, D Hegde… - Proceedings of the IEEE …, 2024‏ - openaccess.thecvf.com
Abstract 3D object detection and pose estimation from a single-view image is challenging
due to the high uncertainty caused by the absence of 3D perception. As a solution recent …

Back to optimization: Diffusion-based zero-shot 3d human pose estimation

Z Jiang, Z Zhou, L Li, W Chai… - Proceedings of the …, 2024‏ - openaccess.thecvf.com
Learning-based methods have dominated the 3D human pose estimation (HPE) tasks with
significantly better performance in most benchmarks than traditional optimization-based …