Foundations & trends in multimodal machine learning: Principles, challenges, and open questions

PP Liang, A Zadeh, LP Morency - ACM Computing Surveys, 2024 - dl.acm.org
Multimodal machine learning is a vibrant multi-disciplinary research field that aims to design
computer agents with intelligent capabilities such as understanding, reasoning, and learning …

Sign language recognition: A deep survey

R Rastgoo, K Kiani, S Escalera - Expert Systems with Applications, 2021 - Elsevier
Sign language, as a different form of the communication language, is important to large
groups of people in society. There are different signs in each sign language with variability …

HaGRID--HAnd Gesture Recognition Image Dataset

A Kapitanov, K Kvanchiani, A Nagaev… - Proceedings of the …, 2024 - openaccess.thecvf.com
This paper introduces an enormous dataset, HaGRID (HAnd Gesture Recognition Image
Dataset), to build a hand gesture recognition (HGR) system concentrating on interaction with …

Skeleton-based action recognition using spatio-temporal LSTM network with trust gates

J Liu, A Shahroudy, D Xu, AC Kot… - IEEE transactions on …, 2017 - ieeexplore.ieee.org
Skeleton-based human action recognition has attracted a lot of research attention during the
past few years. Recent works attempted to utilize recurrent neural networks to model the …

Modeling temporal dynamics and spatial configurations of actions using two-stream recurrent neural networks

H Wang, L Wang - Proceedings of the IEEE conference on …, 2017 - openaccess.thecvf.com
Recently, skeleton based action recognition gains more popularity due to cost-effective
depth sensors coupled with real-time skeleton estimation algorithms. Traditional approaches …

Flowing convnets for human pose estimation in videos

T Pfister, J Charles… - Proceedings of the IEEE …, 2015 - openaccess.thecvf.com
The objective of this work is human pose estimation in videos, where multiple frames are
available. We investigate a ConvNet architecture that is able to benefit from temporal context …

Modeling video evolution for action recognition

B Fernando, E Gavves, JM Oramas… - Proceedings of the …, 2015 - openaccess.thecvf.com
In this paper we present a method to capture video-wide temporal information for action
recognition. We postulate that a function capable of ordering the frames of a video …

Skeleton based action recognition with convolutional neural network

Y Du, Y Fu, L Wang - 2015 3rd IAPR Asian conference on …, 2015 - ieeexplore.ieee.org
Temporal dynamics of postures over time is crucial for sequence-based action recognition.
Human actions can be represented by the corresponding motions of articulated skeleton …

A review of state-of-the-art techniques for abnormal human activity recognition

C Dhiman, DK Vishwakarma - Engineering Applications of Artificial …, 2019 - Elsevier
The concept of intelligent visual identification of abnormal human activity has raised the
standards of surveillance systems, situation cognizance, homeland safety and smart …

Recent methods and databases in vision-based hand gesture recognition: A review

PK Pisharady, M Saerbeck - Computer Vision and Image Understanding, 2015 - Elsevier
Successful efforts in hand gesture recognition research within the last two decades paved
the path for natural human–computer interaction systems. Unresolved challenges such as …