Motiongpt: Human motion as a foreign language

B Jiang, X Chen, W Liu, J Yu, G Yu… - Advances in Neural …, 2023 - proceedings.neurips.cc
Though the advancement of pre-trained large language models unfolds, the exploration of
building a unified model for language and other multimodal data, such as motion, remains …

Executing your commands via motion diffusion in latent space

X Chen, B Jiang, W Liu, Z Huang… - Proceedings of the …, 2023 - openaccess.thecvf.com
We study a challenging task, conditional human motion generation, which produces
plausible human motion sequences according to various conditional inputs, such as action …

Learning an animatable detailed 3D face model from in-the-wild images

Y Feng, H Feng, MJ Black, T Bolkart - ACM Transactions on Graphics …, 2021 - dl.acm.org
While current monocular 3D face reconstruction methods can recover fine geometric details,
they suffer several limitations. Some methods produce faces that cannot be realistically …

[HTML][HTML] Survey on 3D face reconstruction from uncalibrated images

A Morales, G Piella, FM Sukno - Computer Science Review, 2021 - Elsevier
Recently, a lot of attention has been focused on the incorporation of 3D data into face
analysis and its applications. Despite providing a more accurate representation of the face …

3D face reconstruction: the road to forensics

SM La Cava, G Orrù, M Drahansky, GL Marcialis… - ACM Computing …, 2023 - dl.acm.org
3D face reconstruction algorithms from images and videos are applied to many fields, from
plastic surgery to the entertainment sector, thanks to their advantageous features. However …

A hierarchical representation network for accurate and detailed face reconstruction from in-the-wild images

B Lei, J Ren, M Feng, M Cui… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
Limited by the nature of the low-dimensional representational capacity of 3DMM, most of the
3DMM-based face reconstruction (FR) methods fail to recover high-frequency facial details …

Motionchain: Conversational motion controllers via multimodal prompts

B Jiang, X Chen, C Zhang, F Yin, Z Li, G Yu… - European Conference on …, 2024 - Springer
Recent advancements in language models have demonstrated their adeptness in
conducting multi-turn dialogues and retaining conversational context. However, this …

Photo-realistic facial details synthesis from single image

A Chen, Z Chen, G Zhang… - Proceedings of the …, 2019 - openaccess.thecvf.com
We present a single-image 3D face synthesis technique that can handle challenging facial
expressions while recovering fine geometric details. Our technique employs expression …

Relightable neural human assets from multi-view gradient illuminations

T Zhou, K He, D Wu, T Xu, Q Zhang… - Proceedings of the …, 2023 - openaccess.thecvf.com
Human modeling and relighting are two fundamental problems in computer vision and
graphics, where high-quality datasets can largely facilitate related research. However, most …

Knowledge-augmented deep learning and its applications: A survey

Z Cui, T Gao, K Talamadupula… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
Deep learning models, though having achieved great success in many different fields over
the past years, are usually data-hungry, fail to perform well on unseen samples, and lack …