Human action recognition from various data modalities: A review
Human Action Recognition (HAR) aims to understand human behavior and assign a label to
each action. It has a wide range of applications, and therefore has been attracting increasing …
each action. It has a wide range of applications, and therefore has been attracting increasing …
Temporal action segmentation: An analysis of modern techniques
Temporal action segmentation (TAS) in videos aims at densely identifying video frames in
minutes-long videos with multiple action classes. As a long-range video understanding task …
minutes-long videos with multiple action classes. As a long-range video understanding task …
Bedlam: A synthetic dataset of bodies exhibiting detailed lifelike animated motion
We show, for the first time, that neural networks trained only on synthetic data achieve state-
of-the-art accuracy on the problem of 3D human pose and shape (HPS) estimation from real …
of-the-art accuracy on the problem of 3D human pose and shape (HPS) estimation from real …
Google scanned objects: A high-quality dataset of 3d scanned household items
Interactive 3D simulations have enabled break-throughs in robotics and computer vision, but
simulating the broad diversity of environments needed for deep learning requires large …
simulating the broad diversity of environments needed for deep learning requires large …
Assembly101: A large-scale multi-view video dataset for understanding procedural activities
Assembly101 is a new procedural activity dataset featuring 4321 videos of people
assembling and disassembling 101" take-apart" toy vehicles. Participants work without fixed …
assembling and disassembling 101" take-apart" toy vehicles. Participants work without fixed …
Error detection in egocentric procedural task videos
We present a new egocentric procedural error dataset containing videos with various types
of errors as well as normal videos and propose a new framework for procedural error …
of errors as well as normal videos and propose a new framework for procedural error …
Weakly-supervised action segmentation and unseen error detection in anomalous instructional videos
We present a novel method for weakly-supervised action segmentation and unseen error
detection in anomalous instructional videos. In the absence of an appropriate dataset for this …
detection in anomalous instructional videos. In the absence of an appropriate dataset for this …
Novel motion patterns matter for practical skeleton-based action recognition
Most skeleton-based action recognition methods assume that the same type of action
samples in the training set and the test set share similar motion patterns. However, action …
samples in the training set and the test set share similar motion patterns. However, action …
Learning fine-grained view-invariant representations from unpaired ego-exo videos via temporal alignment
The egocentric and exocentric viewpoints of a human activity look dramatically different, yet
invariant representations to link them are essential for many potential applications in …
invariant representations to link them are essential for many potential applications in …
Motion stimulation for compositional action recognition
Recognizing the unseen combinations of action and different objects, namely (zero-shot)
compositional action recognition, is extremely challenging for conventional action …
compositional action recognition, is extremely challenging for conventional action …