A survey of traditional and deep learning-based feature descriptors for high dimensional data in computer vision
Higher dimensional data such as video and 3D are the leading edge of multimedia retrieval
and computer vision research. In this survey, we give a comprehensive overview and key …
and computer vision research. In this survey, we give a comprehensive overview and key …
A comprehensive survey of human action recognition with spatio-temporal interest point (STIP) detector
Over the past two decades, human action recognition from video has been an important
area of research in computer vision. Its applications include surveillance systems, human …
area of research in computer vision. Its applications include surveillance systems, human …
Human action recognition and prediction: A survey
Derived from rapid advances in computer vision and machine learning, video analysis tasks
have been moving from inferring the present state to predicting the future state. Vision-based …
have been moving from inferring the present state to predicting the future state. Vision-based …
Online real-time multiple spatiotemporal action localisation and prediction
We present a deep-learning framework for real-time multiple spatio-temporal (S/T) action
localisation and classification. Current state-of-the-art approaches work offline, and are too …
localisation and classification. Current state-of-the-art approaches work offline, and are too …
Human activity prediction: Early recognition of ongoing activities from streaming videos
MS Ryoo - 2011 international conference on computer vision, 2011 - ieeexplore.ieee.org
In this paper, we present a novel approach of human activity prediction. Human activity
prediction is a probabilistic process of inferring ongoing activities from videos only …
prediction is a probabilistic process of inferring ongoing activities from videos only …
A survey on activity recognition and behavior understanding in video surveillance
This paper provides a comprehensive survey for activity recognition in video surveillance. It
starts with a description of simple and complex human activity, and various applications. The …
starts with a description of simple and complex human activity, and various applications. The …
Hierarchical long short-term concurrent memory for human interaction recognition
In this work, we aim to address the problem of human interaction recognition in videos by
exploring the long-term inter-related dynamics among multiple persons. Recently, Long …
exploring the long-term inter-related dynamics among multiple persons. Recently, Long …
Efficient feature extraction, encoding and classification for action recognition
Local video features provide state-of-the-art performance for action recognition. While the
accuracy of action recognition has been continuously improved over the recent years, the …
accuracy of action recognition has been continuously improved over the recent years, the …
Encouraging lstms to anticipate actions very early
In contrast to the widely studied problem of recognizing an action given a complete
sequence, action anticipation aims to identify the action from only partially available videos …
sequence, action anticipation aims to identify the action from only partially available videos …
Dynamic sampling networks for efficient action recognition in videos
YD Zheng, Z Liu, T Lu, L Wang - IEEE transactions on image …, 2020 - ieeexplore.ieee.org
The existing action recognition methods are mainly based on clip-level classifiers such as
two-stream CNNs or 3D CNNs, which are trained from the randomly selected clips and …
two-stream CNNs or 3D CNNs, which are trained from the randomly selected clips and …