Transfer learning and its extensive appositeness in human activity recognition: A survey

A Ray, MH Kolekar - Expert Systems with Applications, 2024 - Elsevier
In this competitive world, the supervision and monitoring of human resources are primary
and necessary tasks to drive context-aware applications. Advancement in sensor and …

Rethinking zero-shot video classification: End-to-end training for realistic applications

B Brattoli, J Tighe, F Zhdanov… - Proceedings of the …, 2020 - openaccess.thecvf.com
Trained on large datasets, deep learning (DL) can accurately classify videos into hundreds
of diverse classes. However, video data is expensive to annotate. Zero-shot learning (ZSL) …

Audio-visual generalised zero-shot learning with cross-modal attention and language

OB Mercea, L Riesch, A Koepke… - Proceedings of the …, 2022 - openaccess.thecvf.com
Learning to classify video data from classes not included in the training data, ie video-based
zero-shot learning, is challenging. We conjecture that the natural alignment between the …

Cross-modal representation learning for zero-shot action recognition

CC Lin, K Lin, L Wang, Z Liu… - Proceedings of the IEEE …, 2022 - openaccess.thecvf.com
We present a cross-modal Transformer-based framework, which jointly encodes video data
and text labels for zero-shot action recognition (ZSAR). Our model employs a conceptually …

Spiking tucker fusion transformer for audio-visual zero-shot learning

W Li, P Wang, R **ong, X Fan - IEEE Transactions on Image …, 2024 - ieeexplore.ieee.org
The spiking neural networks (SNNs) that efficiently encode temporal sequences have shown
great potential in extracting audio-visual joint feature representations. However, coupling …

Motion-decoupled spiking transformer for audio-visual zero-shot learning

W Li, XL Zhao, Z Ma, X Wang, X Fan… - Proceedings of the 31st …, 2023 - dl.acm.org
Audio-visual zero-shot learning (ZSL) has attracted board attention, as it could classify video
data from classes that are not observed during training. However, most of the existing …

Zero-shot action recognition with transformer-based video semantic embedding

K Doshi, Y Yilmaz - … of the IEEE/CVF conference on …, 2023 - openaccess.thecvf.com
While video action recognition has been an active area of research for several years, zero-
shot action recognition has only recently started gaining traction. In this work, we propose a …

Actionbytes: Learning from trimmed videos to localize actions

M Jain, A Ghodrati, CGM Snoek - Proceedings of the IEEE …, 2020 - openaccess.thecvf.com
This paper tackles the problem of localizing actions in long untrimmed videos. Different from
existing works, which all use annotated untrimmed videos during training, we learn only from …

A new split for evaluating true zero-shot action recognition

SN Gowda, L Sevilla-Lara, K Kim, F Keller… - … German Conference on …, 2021 - Springer
Zero-shot action recognition is the task of classifying action categories that are not available
in the training set. In this setting, the standard evaluation protocol is to use existing action …

Zero-shot learning for action recognition using synthesized features

A Mishra, A Pandey, HA Murthy - Neurocomputing, 2020 - Elsevier
The major disadvantage of supervised methods for action recognition is the need for a large
amount of annotated data, where the data is matched to its label accurately. To address this …