Chatpose: Chatting about 3d human pose

Y Feng, J Lin, SK Dwivedi, Y Sun… - Proceedings of the …, 2024‏ - openaccess.thecvf.com
We introduce ChatPose a framework employing Large Language Models (LLMs) to
understand and reason about 3D human poses from images or textual descriptions. Our …

Make-an-animation: Large-scale text-conditional 3d human motion generation

S Azadi, A Shah, T Hayes, D Parikh… - Proceedings of the …, 2023‏ - openaccess.thecvf.com
Text-guided human motion generation has drawn significant interest because of its impactful
applications spanning animation and robotics. Recently, application of diffusion models for …

Posescript: 3d human poses from natural language

G Delmas, P Weinzaepfel, T Lucas… - … on Computer Vision, 2022‏ - Springer
Natural language is leveraged in many computer vision tasks such as image captioning,
cross-modal retrieval or visual question answering, to provide fine-grained semantic …

Trends in integration of vision and language research: A survey of tasks, datasets, and methods

A Mogadala, M Kalimuthu, D Klakow - Journal of Artificial Intelligence …, 2021‏ - jair.org
Abstract Interest in Artificial Intelligence (AI) and its applications has seen unprecedented
growth in the last few years. This success can be partly attributed to the advancements made …

Tips: Text-induced pose synthesis

P Roy, S Ghosh, S Bhattacharya, U Pal… - … on Computer Vision, 2022‏ - Springer
In computer vision, human pose synthesis and transfer deal with probabilistic image
generation of a person in a previously unseen pose from an already available observation of …

Text-conditional contextualized avatars for zero-shot personalization

S Azadi, T Hayes, A Shah, G Pang, D Parikh… - arxiv preprint arxiv …, 2023‏ - arxiv.org
Recent large-scale text-to-image generation models have made significant improvements in
the quality, realism, and diversity of the synthesized images and enable users to control the …

Posescript: Linking 3d human poses and natural language

G Delmas, P Weinzaepfel, T Lucas… - IEEE transactions on …, 2024‏ - ieeexplore.ieee.org
Natural language plays a critical role in many computer vision applications, such as image
captioning, visual question answering, and cross-modal retrieval, to provide fine-grained …