Human action recognition from various data modalities: A review
Human Action Recognition (HAR) aims to understand human behavior and assign a label to
each action. It has a wide range of applications, and therefore has been attracting increasing …
each action. It has a wide range of applications, and therefore has been attracting increasing …
A survey on video-based human action recognition: recent updates, datasets, challenges, and applications
Abstract Human Action Recognition (HAR) involves human activity monitoring task in
different areas of medical, education, entertainment, visual surveillance, video retrieval, as …
different areas of medical, education, entertainment, visual surveillance, video retrieval, as …
Memvit: Memory-augmented multiscale vision transformer for efficient long-term video recognition
While today's video recognition systems parse snapshots or short clips accurately, they
cannot connect the dots and reason across a longer range of time yet. Most existing video …
cannot connect the dots and reason across a longer range of time yet. Most existing video …
Tdn: Temporal difference networks for efficient action recognition
Temporal modeling still remains challenging for action recognition in videos. To mitigate this
issue, this paper presents a new video architecture, termed as Temporal Difference Network …
issue, this paper presents a new video architecture, termed as Temporal Difference Network …
Self-supervised visual feature learning with deep neural networks: A survey
Large-scale labeled data are generally required to train deep neural networks in order to
obtain better performance in visual feature learning from images or videos for computer …
obtain better performance in visual feature learning from images or videos for computer …
Bmn: Boundary-matching network for temporal action proposal generation
Temporal action proposal generation is an challenging and promising task which aims to
locate temporal regions in real-world videos where action or event may occur. Current …
locate temporal regions in real-world videos where action or event may occur. Current …
Temporal pyramid network for action recognition
Visual tempo characterizes the dynamics and the temporal scale of an action. Modeling
such visual tempos of different actions facilitates their recognition. Previous works often …
such visual tempos of different actions facilitates their recognition. Previous works often …
Word-level deep sign language recognition from video: A new large-scale dataset and methods comparison
Vision-based sign language recognition aims at hel** the hearing-impaired people to
communicate with others. However, most existing sign language datasets are limited to a …
communicate with others. However, most existing sign language datasets are limited to a …
Deep learning for spatio-temporal data mining: A survey
With the fast development of various positioning techniques such as Global Position System
(GPS), mobile devices and remote sensing, spatio-temporal data has become increasingly …
(GPS), mobile devices and remote sensing, spatio-temporal data has become increasingly …
Tcgl: Temporal contrastive graph for self-supervised video representation learning
Video self-supervised learning is a challenging task, which requires significant expressive
power from the model to leverage rich spatial-temporal knowledge and generate effective …
power from the model to leverage rich spatial-temporal knowledge and generate effective …