Dynamic and static mutual fitting for action recognition
Action recognition is intended to classify a video into a certain category by aggregating and
summarizing its temporal and spatial information. Existing methods have achieved …
summarizing its temporal and spatial information. Existing methods have achieved …
Uncovering the Unseen: Discover Hidden Intentions by Micro-Behavior Graph Reasoning
This paper introduces a new and challenging Hidden Intention Discovery (HID) task. Unlike
existing intention recognition tasks, which are based on obvious visual representations to …
existing intention recognition tasks, which are based on obvious visual representations to …
Fragrant: frequency-auxiliary guided relational attention network for low-light action recognition
Video action recognition aims to classify actions within sequences of video frames, which
has important applications in computer vision fields. Existing methods have shown …
has important applications in computer vision fields. Existing methods have shown …
Bi-Causal: Group Activity Recognition via Bidirectional Causality
Abstract Current approaches in Group Activity Recognition (GAR) predominantly emphasize
Human Relations (HRs) while often neglecting the impact of Human-Object Interactions …
Human Relations (HRs) while often neglecting the impact of Human-Object Interactions …
Token-disentangling Mutual Transformer for multimodal emotion recognition
Multimodal emotion recognition presents a complex challenge, as it involves the
identification of human emotions using various modalities such as video, text, and audio …
identification of human emotions using various modalities such as video, text, and audio …
A lightweight CNN based information fusion for image denoising
Q Zhang, S **e, L Ji - Multimedia Tools and Applications, 2024 - Springer
Deep convolutional neural networks (CNNs) with strong learning abilities have obtained
good results for image denoising. However, the CNNs for image denoising have …
good results for image denoising. However, the CNNs for image denoising have …
TSHNN: Temporal-Spatial Hybrid Neural Network for Cognitive Wireless Human Activity Recognition
WiFi-based human activity recognition has gained ever-growing attention in the field of
wireless sensor networks. As a promising technology, it has large application potential for …
wireless sensor networks. As a promising technology, it has large application potential for …
See What You Seek: Semantic Contextual Integration for Cloth-Changing Person Re-Identification
Cloth-changing person re-identification (CC-ReID) aims to match individuals across multiple
surveillance cameras despite variations in clothing. Existing methods typically focus on …
surveillance cameras despite variations in clothing. Existing methods typically focus on …
Weakly Supervised Video Re-localization Through Multi-agent-reinforced Switchable Network
The objective of video re-localization (VRL) is to localize a successive sequence of frames,
namely, the target moment, from untrimmed reference videos that semantically correspond …
namely, the target moment, from untrimmed reference videos that semantically correspond …
Converting Artificial Neural Networks to Ultra-Low-Latency Spiking Neural Networks for Action Recognition
Spiking neural networks (SNNs) have garnered significant attention for their potential in ultra-
low-power event-driven neuromorphic hardware implementations. One effective strategy for …
low-power event-driven neuromorphic hardware implementations. One effective strategy for …