IKEA furniture assembly environment for long-horizon complex manipulation tasks

Y Lee, ES Hu, JJ Lim - 2021 ieee international conference on …, 2021 - ieeexplore.ieee.org
The IKEA Furniture Assembly Environment is one of the first benchmarks for testing and
accelerating the automation of long-horizon and hierarchical manipulation tasks. The …

Ht-step: Aligning instructional articles with how-to videos

T Afouras, E Mavroudi, T Nagarajan… - Advances in …, 2024 - proceedings.neurips.cc
We introduce HT-Step, a large-scale dataset containing temporal annotations of instructional
article steps in cooking videos. It includes 122k segment-level annotations over 20k narrated …

Industreal: A dataset for procedure step recognition handling execution errors in egocentric videos in an industrial-like setting

TJ Schoonbeek, T Houben, H Onvlee… - Proceedings of the …, 2024 - openaccess.thecvf.com
Although action recognition for procedural tasks has received notable attention, it has a
fundamental flaw in that no measure of success for actions is provided. This limits the …

Am I done? Predicting action progress in videos

F Becattini, T Uricchio, L Seidenari, L Ballan… - ACM Transactions on …, 2020 - dl.acm.org
In this article, we deal with the problem of predicting action progress in videos. We argue
that this is an extremely important task, since it can be valuable for a wide range of …

A comprehensive survey of procedural video datasets

HL Tan, H Zhu, JH Lim, C Tan - Computer Vision and Image …, 2021 - Elsevier
Procedural knowledge is crucial for understanding and performing concrete real-world
tasks. Yet, despite the importance of procedural knowledge, research into procedural …

Symmetric sub-graph spatio-temporal graph convolution and its application in complex activity recognition

P Das, A Ortega - … 2021-2021 IEEE International Conference on …, 2021 - ieeexplore.ieee.org
Understanding complex hand actions, such as assembly tasks or kitchen activities, from
hand skeleton data is an important yet challenging task. In this paper, we analyze hand …

Human Action Anticipation: A Survey

B Lai, S Toyer, T Nagarajan, R Girdhar, S Zha… - arxiv preprint arxiv …, 2024 - arxiv.org
Predicting future human behavior is an increasingly popular topic in computer vision, driven
by the interest in applications such as autonomous vehicles, digital assistants and human …

Action Progression Networks for Temporal Action Detection in Videos

C Lu, M Mak, R Li, Z Chi, H Fu - IEEE Access, 2024 - ieeexplore.ieee.org
This study introduces an innovative Temporal Action Detection (TAD) model that is
distinguished by its lightweight structure and capability for end-to-end training, delivering …

What and how? jointly forecasting human action and pose

Y Zhu, D Doermann, Y Zhang, Q Liu… - 2020 25th …, 2021 - ieeexplore.ieee.org
Forecasting human actions and motion trajectories address the problem of predicting what a
person is going to do next and how they will perform it. This is crucial in a wide range of …

Learning representations for predicting future activities

M Zolfaghari, Ö Çiçek, SM Ali, F Mahdisoltani… - arxiv preprint arxiv …, 2019 - arxiv.org
Foreseeing the future is one of the key factors of intelligence. It involves understanding of the
past and current environment as well as decent experience of its possible dynamics. In this …