Vectorized evidential learning for weakly-supervised temporal action localization

J Gao, M Chen, C Xu - IEEE transactions on pattern analysis …, 2023 - ieeexplore.ieee.org
With the explosive growth of videos, weakly-supervised temporal action localization (WS-
TAL) task has become a promising research direction in pattern analysis and machine …

Taco: Benchmarking generalizable bimanual tool-action-object understanding

Y Liu, H Yang, X Si, L Liu, Z Li… - Proceedings of the …, 2024 - openaccess.thecvf.com
Humans commonly work with multiple objects in daily life and can intuitively transfer
manipulation skills to novel objects by understanding object functional regularities. However …

Motion stimulation for compositional action recognition

L Ma, Y Zheng, Z Zhang, Y Yao… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
Recognizing the unseen combinations of action and different objects, namely (zero-shot)
compositional action recognition, is extremely challenging for conventional action …

Chop & learn: Recognizing and generating object-state compositions

N Saini, H Wang, A Swaminathan… - Proceedings of the …, 2023 - openaccess.thecvf.com
Recognizing and generating object-state compositions has been a challenging task,
especially when generalizing to unseen compositions. In this paper, we study the task of …

Improving the generalization of MAML in few-shot classification via bi-level constraint

Y Shao, W Wu, X You, C Gao… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
Few-shot classification (FSC), which aims to identify novel classes in the presence of a few
labeled samples, has drawn vast attention in recent years. One of the representative few …

C2c: Component-to-composition learning for zero-shot compositional action recognition

R Li, Z Feng, T Xu, L Li, XJ Wu, M Awais, S Atito… - … on Computer Vision, 2024 - Springer
Compositional actions consist of dynamic (verbs) and static (objects) concepts. Humans can
easily recognize unseen compositions using the learned concepts. For machines, solving …

Hade: Exploiting human action recognition through fine-tuned deep learning methods

M Karim, S Khalid, A Aleryani, N Tairan, Z Ali… - IEEE Access, 2024 - ieeexplore.ieee.org
Human Action Recognition (HAR) is a vital area of computer vision with diverse applications
in security, healthcare, and human-computer interaction. Addressing the challenges of HAR …

Appearance-agnostic representation learning for compositional action recognition

P Huang, X Shu, R Yan, Z Tu… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
The discussion of compositional generalization in action recognition, ie., Compositional
Action Recognition (CAR), has recently received increasing attention. CAR challenges …

GPT4Ego: unleashing the potential of pre-trained models for zero-shot egocentric action recognition

G Dai, X Shu, W Wu, R Yan… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Vision-Language Models (VLMs), pre-trained on large-scale datasets, have shown
impressive performance in various visual recognition tasks. This advancement paves the …

MUP: multi-granularity unified perception for panoramic activity recognition

M Cao, R Yan, X Shu, J Zhang, J Wang… - Proceedings of the 31st …, 2023 - dl.acm.org
Panoramic activity recognition is required to jointly identify multi-granularity human
behaviors including individual actions, group activities, and global activities in multi-person …