Intergen: Diffusion-based multi-human motion generation under complex interactions

H Liang, W Zhang, W Li, J Yu, L Xu - International Journal of Computer …, 2024 - Springer
We have recently seen tremendous progress in diffusion advances for generating realistic
human motions. Yet, they largely disregard the multi-human interactions. In this paper, we …

Large motion model for unified multi-modal motion generation

M Zhang, D **, C Gu, F Hong, Z Cai, J Huang… - … on Computer Vision, 2024 - Springer
Human motion generation, a cornerstone technique in animation and video production, has
widespread applications in various tasks like text-to-motion and music-to-dance. Previous …

Generating human motion in 3D scenes from text descriptions

Z Cen, H Pi, S Peng, Z Shen, M Yang… - Proceedings of the …, 2024 - openaccess.thecvf.com
Generating human motions from textual descriptions has gained growing research interest
due to its wide range of applications. However only a few works consider human-scene …

Generative action description prompts for skeleton-based action recognition

W **ang, C Li, Y Zhou, B Wang… - Proceedings of the …, 2023 - openaccess.thecvf.com
Skeleton-based action recognition has recently received considerable attention. Current
approaches to skeleton-based action recognition are typically formulated as one-hot …

Inter-x: Towards versatile human-human interaction analysis

L Xu, X Lv, Y Yan, X **, S Wu, C Xu… - Proceedings of the …, 2024 - openaccess.thecvf.com
The analysis of the ubiquitous human-human interactions is pivotal for understanding
humans as social beings. Existing human-human interaction datasets typically suffer from …

Integer or floating point? new outlooks for low-bit quantization on large language models

Y Zhang, L Zhao, S Cao, S Zhang… - … on Multimedia and …, 2024 - ieeexplore.ieee.org
Efficient deployment of Large Language Models (LLMs) requires low-bit quantization to
reduce model size and inference cost. Besides low-bit integer formats (eg, INT8/INT4) used …

Motion-aware mask feature reconstruction for skeleton-based action recognition

X Zhu, X Shu, J Tang - … on Circuits and Systems for Video …, 2024 - ieeexplore.ieee.org
Despite recent advancements in masked skeleton modeling and visual-language pre-
training, no method has yet been proposed to explore capturing and utilizing the rich …

Leveraging the Large Language Model for Activity Recognition: A Comprehensive Review

MN Shoumi, S Inoue - International Journal of Activity and Behavior …, 2024 - jstage.jst.go.jp
In this paper, we are using comprehensively review the ways in which Large Language
Models (LLMs) advance activity recognition systems, discuss the challenges of …

Vision-language meets the skeleton: Progressively distillation with cross-modal knowledge for 3d action representation learning

Y Chen, T He, J Fu, L Wang, J Guo… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Skeleton-based action representation learning aims to interpret and understand human
behaviors by encoding the skeleton sequences, which can be categorized into two primary …

A review of deep learning-based approaches to sign language processing

S Tan, N Khan, Z An, Y Ando, R Kawakami… - Advanced …, 2024 - Taylor & Francis
Technology to support human communication by sign language may address a growing
social need and is interesting from an engineering perspective, considering multimodal …