Avatargen: a 3d generative model for animatable human avatars

J Zhang, Z Jiang, D Yang, H Xu, Y Shi, G Song… - … on Computer Vision, 2022 - Springer
Unsupervised generation of clothed virtual humans with various appearance and
animatable poses is important for creating 3D human avatars and other AR/VR applications …

Appearance and Pose-guided Human Generation: A Survey

F Liao, X Zou, W Wong - ACM Computing Surveys, 2024 - dl.acm.org
Appearance and pose-guided human generation is a burgeoning field that has captured
significant attention. This subject's primary objective is to transfer pose information from a …

PoseTriplet: Co-evolving 3D human pose estimation, imitation, and hallucination under self-supervision

K Gong, B Li, J Zhang, T Wang… - Proceedings of the …, 2022 - openaccess.thecvf.com
Existing self-supervised 3D human pose estimation schemes have largely relied on weak
supervisions like consistency loss to guide the learning, which, inevitably, leads to inferior …

Xagen: 3d expressive human avatars generation

Z Xu, J Zhang, JH Liew, J Feng… - Advances in Neural …, 2023 - proceedings.neurips.cc
Recent advances in 3D-aware GAN models have enabled the generation of realistic and
controllable human body images. However, existing methods focus on the control of major …

Geometry-guided progressive nerf for generalizable and efficient neural human rendering

M Chen, J Zhang, X Xu, L Liu, Y Cai, J Feng… - European Conference on …, 2022 - Springer
In this work we develop a generalizable and efficient Neural Radiance Field (NeRF) pipeline
for high-fidelity free-viewpoint human body synthesis under settings with sparse camera …

Human image generation: A comprehensive survey

Z Jia, Z Zhang, L Wang, T Tan - ACM Computing Surveys, 2024 - dl.acm.org
Image and video synthesis has become a blooming topic in computer vision and machine
learning communities along with the developments of deep generative models, due to its …

Learning to augment poses for 3D human pose estimation in images and videos

J Zhang, K Gong, X Wang… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
Existing 3D human pose estimation methods often suffer inferior generalization performance
to new datasets, largely due to the limited diversity of 2D-3D pose pairs in the training data …

Humandiffusion: a coarse-to-fine alignment diffusion framework for controllable text-driven person image generation

K Zhang, M Sun, J Sun, B Zhao, K Zhang, Z Sun… - arxiv preprint arxiv …, 2022 - arxiv.org
Text-driven person image generation is an emerging and challenging task in cross-modality
image generation. Controllable person image generation promotes a wide range of …

CrossFormer: Cross-modal Representation Learning via Heterogeneous Graph Transformer

X Liang, E Yang, C Deng, Y Yang - ACM Transactions on Multimedia …, 2024 - dl.acm.org
Transformers have been recognized as powerful tools for various cross-modal tasks due to
their superior ability to perform representation learning through self-attention. Existing …

Texture-Aware Causal Feature Extraction Network for Multimodal Remote Sensing Data Classification

Z Xu, W Jiang, J Geng - IEEE Transactions on Geoscience and …, 2024 - ieeexplore.ieee.org
The pixel-level classification of multimodal remote sensing (RS) images plays a crucial role
in the intelligent interpretation of RS data. However, existing methods that mainly focus on …