Can't make an Omelette without Breaking some Eggs: Plausible Action Anticipation using Large Video-Language Models

H Mittal, N Agarwal, SY Lo… - Proceedings of the IEEE …, 2024 - openaccess.thecvf.com
We introduce PlausiVL a large video-language model for anticipating action sequences that
are plausible in the real-world. While significant efforts have been made towards anticipating …

Antgpt: Can large language models help long-term action anticipation from videos?

Q Zhao, S Wang, C Zhang, C Fu, MQ Do… - arxiv preprint arxiv …, 2023 - arxiv.org
Can we better anticipate an actor's future actions (eg mix eggs) by knowing what commonly
happens after his/her current action (eg crack eggs)? What if we also know the longer-term …

Uncertainty-aware action decoupling transformer for action anticipation

H Guo, N Agarwal, SY Lo, K Lee… - Proceedings of the IEEE …, 2024 - openaccess.thecvf.com
Human action anticipation aims at predicting what people will do in the future based on past
observations. In this paper we introduce Uncertainty-aware Action Decoupling Transformer …

Interaction region visual transformer for egocentric action anticipation

D Roy, R Rajendiran… - Proceedings of the IEEE …, 2024 - openaccess.thecvf.com
Human-object interaction (HOI) and temporal dynamics along the motion paths are the most
important visual cues for egocentric action anticipation. Especially, interaction regions …

Semantically guided representation learning for action anticipation

A Diko, D Avola, B Prenkaj, F Fontana… - European Conference on …, 2024 - Springer
Action anticipation is the task of forecasting future activity from a partially observed
sequence of events. However, this task is exposed to intrinsic future uncertainty and the …

Uncertainty-boosted robust video activity anticipation

Z Qi, S Wang, W Zhang, Q Huang - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Video activity anticipation aims to predict what will happen in the future, embracing a broad
application prospect ranging from robot vision and autonomous driving. Despite the recent …

A survey on deep learning techniques for action anticipation

Z Zhong, M Martin, M Voit, J Gall, J Beyerer - arxiv preprint arxiv …, 2023 - arxiv.org
The ability to anticipate possible future human actions is essential for a wide range of
applications, including autonomous driving and human-robot interaction. Consequently …

Predicting the next action by modeling the abstract goal

D Roy, B Fernando - International Conference on Pattern Recognition, 2025 - Springer
The problem of predicting human actions from observed videos is an inherently uncertain
one. We present an action anticipation model that leverages latent goal information to …

Learnable cube-based video encryption for privacy-preserving action recognition

Y Ishikawa, M Kondo… - Proceedings of the IEEE …, 2024 - openaccess.thecvf.com
With the development of cloud services and machine learning, there has been an inevitable
need to enhance privacy and security when serving video recognition models. Although …

Pear: Phrase-based hand-object interaction anticipation

Z Zhang, H Luo, W Zhai, Y Cao, Y Kang - arxiv preprint arxiv:2407.21510, 2024 - arxiv.org
First-person hand-object interaction anticipation aims to predict the interaction process over
a forthcoming period based on current scenes and prompts. This capability is crucial for …