Diversifying spatial-temporal perception for video domain generalization

KY Lin, JR Du, Y Gao, J Zhou… - Advances in Neural …, 2023 - proceedings.neurips.cc
Video domain generalization aims to learn generalizable video classification models for
unseen target domains by training in a source domain. A critical challenge of video domain …

[HTML][HTML] Meccano: A multimodal egocentric dataset for humans behavior understanding in the industrial-like domain

F Ragusa, A Furnari, GM Farinella - Computer Vision and Image …, 2023 - Elsevier
Wearable cameras allow to acquire images and videos from the user's perspective. These
data can be processed to understand humans behavior. Despite human behavior analysis …

Representing videos as discriminative sub-graphs for action recognition

D Li, Z Qiu, Y Pan, T Yao, H Li… - Proceedings of the IEEE …, 2021 - openaccess.thecvf.com
Human actions are typically of combinatorial structures or patterns, ie, subjects, objects, plus
spatio-temporal interactions in between. Discovering such structures is therefore a …

Human–robot interaction-oriented video understanding of human actions

B Wang, F Chang, C Liu, W Wang - Engineering Applications of Artificial …, 2024 - Elsevier
This paper focuses on action recognition tasks oriented to the field of human–robot
interaction, which is one of the major challenges in the robotic video understanding field …

Optimization planning for 3d convnets

Z Qiu, T Yao, CW Ngo, T Mei - arxiv preprint arxiv:2201.04021, 2022 - arxiv.org
It is not trivial to optimally learn a 3D Convolutional Neural Networks (3D ConvNets) due to
high complexity and various options of the training scheme. The most common hand-tuning …

A Grammatical Compositional Model for Video Action Detection

Z Zhang, X Zou, J Zhou, S Zhong, Y Wu - arxiv preprint arxiv:2310.02887, 2023 - arxiv.org
Analysis of human actions in videos demands understanding complex human dynamics, as
well as the interaction between actors and context. However, these interaction relationships …

[BUCH][B] Human Action Recognition and Detection in Human-Centric Scenes

L Liu - 2021 - search.proquest.com
This thesis studies the problem of human action recognition and detection in single images,
which are important steps towards the high-level semantic understanding of human-centric …

[PDF][PDF] NGO, Chong-wah; and MEI, Tao. Optimization planning for 3D ConvNets.(2021)

Z QIU, T YAO - … of the 38th International Conference on …, 2021 - ink.library.smu.edu.sg
It is not trivial to optimally learn a 3D Convolutional Neural Networks (3D ConvNets) due to
high complexity and various options of the training scheme. The most common hand-tuning …