Diversifying spatial-temporal perception for video domain generalization
Video domain generalization aims to learn generalizable video classification models for
unseen target domains by training in a source domain. A critical challenge of video domain …
unseen target domains by training in a source domain. A critical challenge of video domain …
[HTML][HTML] Meccano: A multimodal egocentric dataset for humans behavior understanding in the industrial-like domain
Wearable cameras allow to acquire images and videos from the user's perspective. These
data can be processed to understand humans behavior. Despite human behavior analysis …
data can be processed to understand humans behavior. Despite human behavior analysis …
Representing videos as discriminative sub-graphs for action recognition
Human actions are typically of combinatorial structures or patterns, ie, subjects, objects, plus
spatio-temporal interactions in between. Discovering such structures is therefore a …
spatio-temporal interactions in between. Discovering such structures is therefore a …
Human–robot interaction-oriented video understanding of human actions
This paper focuses on action recognition tasks oriented to the field of human–robot
interaction, which is one of the major challenges in the robotic video understanding field …
interaction, which is one of the major challenges in the robotic video understanding field …
Optimization planning for 3d convnets
It is not trivial to optimally learn a 3D Convolutional Neural Networks (3D ConvNets) due to
high complexity and various options of the training scheme. The most common hand-tuning …
high complexity and various options of the training scheme. The most common hand-tuning …
A Grammatical Compositional Model for Video Action Detection
Analysis of human actions in videos demands understanding complex human dynamics, as
well as the interaction between actors and context. However, these interaction relationships …
well as the interaction between actors and context. However, these interaction relationships …
[BUCH][B] Human Action Recognition and Detection in Human-Centric Scenes
L Liu - 2021 - search.proquest.com
This thesis studies the problem of human action recognition and detection in single images,
which are important steps towards the high-level semantic understanding of human-centric …
which are important steps towards the high-level semantic understanding of human-centric …
[PDF][PDF] NGO, Chong-wah; and MEI, Tao. Optimization planning for 3D ConvNets.(2021)
It is not trivial to optimally learn a 3D Convolutional Neural Networks (3D ConvNets) due to
high complexity and various options of the training scheme. The most common hand-tuning …
high complexity and various options of the training scheme. The most common hand-tuning …