[HTML][HTML] A survey of GPT-3 family large language models including ChatGPT and GPT-4
KS Kalyan - Natural Language Processing Journal, 2024 - Elsevier
Large language models (LLMs) are a special class of pretrained language models (PLMs)
obtained by scaling model size, pretraining corpus and computation. LLMs, because of their …
obtained by scaling model size, pretraining corpus and computation. LLMs, because of their …
Intergen: Diffusion-based multi-human motion generation under complex interactions
We have recently seen tremendous progress in diffusion advances for generating realistic
human motions. Yet, they largely disregard the multi-human interactions. In this paper, we …
human motions. Yet, they largely disregard the multi-human interactions. In this paper, we …
TMR: Text-to-motion retrieval using contrastive 3D human motion synthesis
In this paper, we present TMR, a simple yet effective approach for text to 3D human motion
retrieval. While previous work has only treated retrieval as a proxy evaluation metric, we …
retrieval. While previous work has only treated retrieval as a proxy evaluation metric, we …
SINC: Spatial composition of 3D human motions for simultaneous action generation
Our goal is to synthesize 3D human motions given textual inputs describing simultaneous
actions, for examplewaving hand'whilewalking'at the same time. We refer to generating such …
actions, for examplewaving hand'whilewalking'at the same time. We refer to generating such …
Cg-hoi: Contact-guided 3d human-object interaction generation
We propose CG-HOI the first method to address the task of generating dynamic 3D human-
object interactions (HOIs) from text. We model the motion of both human and object in an …
object interactions (HOIs) from text. We model the motion of both human and object in an …
Inter-x: Towards versatile human-human interaction analysis
The analysis of the ubiquitous human-human interactions is pivotal for understanding
humans as social beings. Existing human-human interaction datasets typically suffer from …
humans as social beings. Existing human-human interaction datasets typically suffer from …
Vamos: Versatile action models for video understanding
What makes good representations for video understanding, such as anticipating future
activities, or answering video-conditioned questions? While earlier approaches focus on …
activities, or answering video-conditioned questions? While earlier approaches focus on …