[HTML][HTML] A survey of GPT-3 family large language models including ChatGPT and GPT-4

KS Kalyan - Natural Language Processing Journal, 2024 - Elsevier
Large language models (LLMs) are a special class of pretrained language models (PLMs)
obtained by scaling model size, pretraining corpus and computation. LLMs, because of their …

Intergen: Diffusion-based multi-human motion generation under complex interactions

H Liang, W Zhang, W Li, J Yu, L Xu - International Journal of Computer …, 2024 - Springer
We have recently seen tremendous progress in diffusion advances for generating realistic
human motions. Yet, they largely disregard the multi-human interactions. In this paper, we …

TMR: Text-to-motion retrieval using contrastive 3D human motion synthesis

M Petrovich, MJ Black, G Varol - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
In this paper, we present TMR, a simple yet effective approach for text to 3D human motion
retrieval. While previous work has only treated retrieval as a proxy evaluation metric, we …

SINC: Spatial composition of 3D human motions for simultaneous action generation

N Athanasiou, M Petrovich… - Proceedings of the …, 2023 - openaccess.thecvf.com
Our goal is to synthesize 3D human motions given textual inputs describing simultaneous
actions, for examplewaving hand'whilewalking'at the same time. We refer to generating such …

Cg-hoi: Contact-guided 3d human-object interaction generation

C Diller, A Dai - Proceedings of the IEEE/CVF Conference …, 2024 - openaccess.thecvf.com
We propose CG-HOI the first method to address the task of generating dynamic 3D human-
object interactions (HOIs) from text. We model the motion of both human and object in an …

Inter-x: Towards versatile human-human interaction analysis

L Xu, X Lv, Y Yan, X **, S Wu, C Xu… - Proceedings of the …, 2024 - openaccess.thecvf.com
The analysis of the ubiquitous human-human interactions is pivotal for understanding
humans as social beings. Existing human-human interaction datasets typically suffer from …

Vamos: Versatile action models for video understanding

S Wang, Q Zhao, MQ Do, N Agarwal, K Lee… - European Conference on …, 2024 - Springer
What makes good representations for video understanding, such as anticipating future
activities, or answering video-conditioned questions? While earlier approaches focus on …