Human action recognition from various data modalities: A review

Z Sun, Q Ke, H Rahmani, M Bennamoun… - IEEE transactions on …, 2022 - ieeexplore.ieee.org
Human Action Recognition (HAR) aims to understand human behavior and assign a label to
each action. It has a wide range of applications, and therefore has been attracting increasing …

A survey on video-based human action recognition: recent updates, datasets, challenges, and applications

P Pareek, A Thakkar - Artificial Intelligence Review, 2021 - Springer
Abstract Human Action Recognition (HAR) involves human activity monitoring task in
different areas of medical, education, entertainment, visual surveillance, video retrieval, as …

Memvit: Memory-augmented multiscale vision transformer for efficient long-term video recognition

CY Wu, Y Li, K Mangalam, H Fan… - Proceedings of the …, 2022 - openaccess.thecvf.com
While today's video recognition systems parse snapshots or short clips accurately, they
cannot connect the dots and reason across a longer range of time yet. Most existing video …

Vision-based human activity recognition: a survey

DR Beddiar, B Nini, M Sabokrou, A Hadid - Multimedia Tools and …, 2020 - Springer
Human activity recognition (HAR) systems attempt to automatically identify and analyze
human activities using acquired information from various types of sensors. Although several …

Autoad ii: The sequel-who, when, and what in movie audio description

T Han, M Bain, A Nagrani, G Varol… - Proceedings of the …, 2023 - openaccess.thecvf.com
Audio Description (AD) is the task of generating descriptions of visual content, at suitable
time intervals, for the benefit of visually impaired audiences. For movies, this presents …

A multimodal approach for human activity recognition based on skeleton and RGB data

A Franco, A Magnani, D Maio - Pattern Recognition Letters, 2020 - Elsevier
Human action recognition plays a fundamental role in the design of smart solution for home
environments, particularly in relation to ambient assisted living applications, where the …

Deep learning for deepfakes creation and detection: A survey

TT Nguyen, QVH Nguyen, DT Nguyen… - Computer Vision and …, 2022 - Elsevier
Deep learning has been successfully applied to solve various complex problems ranging
from big data analytics to computer vision and human-level control. Deep learning advances …

Slowfast networks for video recognition

C Feichtenhofer, H Fan, J Malik… - Proceedings of the IEEE …, 2019 - openaccess.thecvf.com
We present SlowFast networks for video recognition. Our model involves (i) a Slow pathway,
operating at low frame rate, to capture spatial semantics, and (ii) a Fast pathway, operating …

Self-supervised video representation learning by pace prediction

J Wang, J Jiao, YH Liu - Computer Vision–ECCV 2020: 16th European …, 2020 - Springer
This paper addresses the problem of self-supervised video representation learning from a
new perspective–by video pace prediction. It stems from the observation that human visual …

Word-level deep sign language recognition from video: A new large-scale dataset and methods comparison

D Li, C Rodriguez, X Yu, H Li - Proceedings of the IEEE …, 2020 - openaccess.thecvf.com
Vision-based sign language recognition aims at hel** the hearing-impaired people to
communicate with others. However, most existing sign language datasets are limited to a …