- Academic Search

Z Sun, Q Ke, H Rahmani, M Bennamoun… - IEEE transactions on …, 2022 - ieeexplore.ieee.org

Human Action Recognition (HAR) aims to understand human behavior and assign a label to
each action. It has a wide range of applications, and therefore has been attracting increasing …

Save Cite Cited by 639 Related articles All 16 versions Free GPT-4

[Free GPT-4]

[PDF] thecvf.com

Verbs in action: Improving verb understanding in video-language models

L Momeni, M Caron, A Nagrani… - Proceedings of the …, 2023 - openaccess.thecvf.com

Understanding verbs is crucial to modelling how people and objects interact with each other
and the environment through space and time. Recently, state-of-the-art video-language …

Save Cite Cited by 72 Related articles All 6 versions Free GPT-4 View as HTML

Vectorized evidential learning for weakly-supervised temporal action localization

J Gao, M Chen, C Xu - IEEE transactions on pattern analysis …, 2023 - ieeexplore.ieee.org

With the explosive growth of videos, weakly-supervised temporal action localization (WS-
TAL) task has become a promising research direction in pattern analysis and machine …

Save Cite Cited by 51 Related articles All 5 versions Free GPT-4

[Free GPT-4]

[PDF] neurips.cc

Learning state-aware visual representations from audible interactions

H Mittal, P Morgado, U Jain… - Advances in Neural …, 2022 - proceedings.neurips.cc

We propose a self-supervised algorithm to learn representations from egocentric video data.
Recently, significant efforts have been made to capture humans interacting with their own …

Save Cite Cited by 32 Related articles All 5 versions Free GPT-4 View as HTML

Multi-task learning of object states and state-modifying actions from web videos

T Soucek, JB Alayrac, A Miech, I Laptev… - IEEE Transactions on …, 2024 - computer.org

We aim to learn to temporally localize object state changes and the corresponding state-
modifying actions by observing people interacting with objects in long uncurated web …

Save Cite Cited by 6 Related articles All 4 versions Free GPT-4

[Free GPT-4]

[PDF] thecvf.com

Learning action changes by measuring verb-adverb textual relationships

D Moltisanti, F Keller, H Bilen… - Proceedings of the …, 2023 - openaccess.thecvf.com

The goal of this work is to understand the way actions are performed in videos. That is, given
a video, we aim to predict an adverb indicating a modification applied to the action (eg cut" …

Save Cite Cited by 8 Related articles All 8 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] arxiv.org

Multi-task learning of object state changes from uncurated videos

T Souček, JB Alayrac, A Miech, I Laptev… - arxiv preprint arxiv …, 2022 - arxiv.org

We aim to learn to temporally localize object state changes and the corresponding state-
modifying actions by observing people interacting with objects in long uncurated web …

Save Cite Cited by 9 Related articles All 3 versions Free GPT-4 View as HTML

Multi-task learning of object states and state-modifying actions from web videos

T Souček, JB Alayrac, A Miech… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org

We aim to learn to temporally localize object state changes and the corresponding state-
modifying actions by observing people interacting with objects in long uncurated web …

Save Cite Cited by 4 Related articles

[Free GPT-4]

[PDF] thecvf.com

Coarse or Fine? Recognising Action End States without Labels

D Moltisanti, H Bilen, L Sevilla-Lara… - Proceedings of the …, 2024 - openaccess.thecvf.com

We focus on the problem of recognising the end state of an action in an image which is
critical for understanding what action is performed and in which manner. We study this …

[Free GPT-4]

[PDF] arxiv.org

Video-adverb retrieval with compositional adverb-action embeddings

T Hummel, OB Mercea, A Koepke, Z Akata - arxiv preprint arxiv …, 2023 - arxiv.org

Retrieving adverbs that describe an action in a video poses a crucial step towards fine-
grained video understanding. We propose a framework for video-to-adverb retrieval (and …

Save Cite Cited by 1 Related articles All 2 versions Free GPT-4 View as HTML

Create alert

Cite

Advanced search

Saved to My library

How do you do it? fine-grained action understanding with pseudo-adverbs

Human action recognition from various data modalities: A review

Verbs in action: Improving verb understanding in video-language models

Vectorized evidential learning for weakly-supervised temporal action localization

Learning state-aware visual representations from audible interactions

Multi-task learning of object states and state-modifying actions from web videos

Learning action changes by measuring verb-adverb textual relationships

Multi-task learning of object state changes from uncurated videos

Multi-task learning of object states and state-modifying actions from web videos

Coarse or Fine? Recognising Action End States without Labels

Video-adverb retrieval with compositional adverb-action embeddings