Foundations & trends in multimodal machine learning: Principles, challenges, and open questions

PP Liang, A Zadeh, LP Morency - ACM Computing Surveys, 2024 - dl.acm.org
Multimodal machine learning is a vibrant multi-disciplinary research field that aims to design
computer agents with intelligent capabilities such as understanding, reasoning, and learning …

Digitizing intangible cultural heritage embodied: State of the art

Y Hou, S Kenderdine, D Picca, M Egloff… - Journal on Computing …, 2022 - dl.acm.org
Intangible cultural heritage (ICH) as a field of research and site for digital efforts has grown
significantly since the UNESCO 2003 Convention for the Safeguarding of Intangible …

Motionbert: A unified perspective on learning human motion representations

W Zhu, X Ma, Z Liu, L Liu, W Wu… - Proceedings of the …, 2023 - openaccess.thecvf.com
We present a unified perspective on tackling various human-centric video tasks by learning
human motion representations from large-scale and heterogeneous data resources …

Back to mlp: A simple baseline for human motion prediction

W Guo, Y Du, X Shen, V Lepetit… - Proceedings of the …, 2023 - openaccess.thecvf.com
This paper tackles the problem of human motion prediction, consisting in forecasting future
body poses from historically observed sequences. State-of-the-art approaches provide good …

Belfusion: Latent diffusion for behavior-driven human motion prediction

G Barquero, S Escalera… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
Stochastic human motion prediction (HMP) has generally been tackled with generative
adversarial networks and variational autoencoders. Most prior works aim at predicting highly …

A comprehensive survey on deep learning methods in human activity recognition

M Kaseris, I Kostavelis, S Malassiotis - Machine Learning and Knowledge …, 2024 - mdpi.com
Human activity recognition (HAR) remains an essential field of research with increasing real-
world applications ranging from healthcare to industrial environments. As the volume of …

A comprehensive review of vision-based 3d reconstruction methods

L Zhou, G Wu, Y Zuo, X Chen, H Hu - Sensors, 2024 - mdpi.com
With the rapid development of 3D reconstruction, especially the emergence of algorithms
such as NeRF and 3DGS, 3D reconstruction has become a popular research topic in recent …

Flex: Full-body gras** without full-body grasps

P Tendulkar, D Surís… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
Synthesizing 3D human avatars interacting realistically with a scene is an important problem
with applications in AR/VR, video games, and robotics. Towards this goal, we address the …

Unrealego: A new dataset for robust egocentric 3d human motion capture

H Akada, J Wang, S Shimada, M Takahashi… - … on Computer Vision, 2022 - Springer
We present UnrealEgo, ie, a new large-scale naturalistic dataset for egocentric 3D human
pose estimation. UnrealEgo is based on an advanced concept of eyeglasses equipped with …

Mocap everyone everywhere: Lightweight motion capture with smartwatches and a head-mounted camera

J Lee, H Joo - Proceedings of the IEEE/CVF conference on …, 2024 - openaccess.thecvf.com
We present a lightweight and affordable motion capture method based on two smartwatches
and a head-mounted camera. In contrast to the existing approaches that use six or more …