Motiongpt: Human motion as a foreign language

B Jiang, X Chen, W Liu, J Yu, G Yu… - Advances in Neural …, 2023 - proceedings.neurips.cc
Though the advancement of pre-trained large language models unfolds, the exploration of
building a unified model for language and other multimodal data, such as motion, remains …

Generating diverse and natural 3d human motions from text

C Guo, S Zou, X Zuo, S Wang, W Ji… - Proceedings of the …, 2022 - openaccess.thecvf.com
Automated generation of 3D human motions from text is a challenging problem. The
generated motions are expected to be sufficiently diverse to explore the text-grounded …

TEMOS: Generating Diverse Human Motions from Textual Descriptions

M Petrovich, MJ Black, G Varol - European Conference on Computer …, 2022 - Springer
We address the problem of generating diverse 3D human motions from textual descriptions.
This challenging task requires joint modeling of both modalities: understanding and …

Motionclip: Exposing human motion generation to clip space

G Tevet, B Gordon, A Hertz, AH Bermano… - … on Computer Vision, 2022 - Springer
We introduce MotionCLIP, a 3D human motion auto-encoder featuring a latent embedding
that is disentangled, well behaved, and supports highly semantic textual descriptions …

Tm2t: Stochastic and tokenized modeling for the reciprocal generation of 3d human motions and texts

C Guo, X Zuo, S Wang, L Cheng - European Conference on Computer …, 2022 - Springer
Inspired by the strong ties between vision and language, the two intimate human sensing
and communication modalities, our paper aims to explore the generation of 3D human full …

Synthesis of compositional animations from textual descriptions

A Ghosh, N Cheema, C Oguz… - Proceedings of the …, 2021 - openaccess.thecvf.com
How can we animate 3D-characters from a movie script or move robots by simply telling
them what we would like them to do?" How unstructured and complex can we make a …

Survey on frontiers of language and robotics

T Taniguchi, D Mochihashi, T Nagai, S Uchida… - Advanced …, 2019 - Taylor & Francis
The understanding and acquisition of a language in a real-world environment is an
important task for future robotics services. Natural language processing and cognitive …

Language2pose: Natural language grounded pose forecasting

C Ahuja, LP Morency - 2019 International conference on 3D …, 2019 - ieeexplore.ieee.org
Generating animations from natural language sentences finds its applications in aa number
of domains such as movie script visualization, virtual human animation and, robot motion …

3d human motion estimation via motion compression and refinement

Z Luo, SA Golestaneh… - Proceedings of the Asian …, 2020 - openaccess.thecvf.com
We develop a technique for generating smooth and accurate 3D human pose and motion
estimates from RGB video sequences. Our technique, which we call Motion Estimation via …

Posescript: 3d human poses from natural language

G Delmas, P Weinzaepfel, T Lucas… - … on Computer Vision, 2022 - Springer
Natural language is leveraged in many computer vision tasks such as image captioning,
cross-modal retrieval or visual question answering, to provide fine-grained semantic …