A review of multimodal human activity recognition with special emphasis on classification, applications, challenges and future directions

SK Yadav, K Tiwari, HM Pandey, SA Akbar - Knowledge-Based Systems, 2021 - Elsevier
Human activity recognition (HAR) is one of the most important and challenging problems in
the computer vision. It has critical application in wide variety of tasks including gaming …

Ego4d: Around the world in 3,000 hours of egocentric video

K Grauman, A Westbury, E Byrne… - Proceedings of the …, 2022 - openaccess.thecvf.com
We introduce Ego4D, a massive-scale egocentric video dataset and benchmark suite. It
offers 3,670 hours of daily-life activity video spanning hundreds of scenarios (household …

[HTML][HTML] Recognition of activities of daily living with egocentric vision: A review

THC Nguyen, JC Nebel, F Florez-Revuelta - Sensors, 2016 - mdpi.com
Video-based recognition of activities of daily living (ADLs) is being used in ambient assisted
living systems in order to support the independent living of older people. However, current …

Egoexolearn: A dataset for bridging asynchronous ego-and exo-centric view of procedural activities in real world

Y Huang, G Chen, J Xu, M Zhang… - Proceedings of the …, 2024 - openaccess.thecvf.com
Being able to map the activities of others into one's own point of view is one fundamental
human skill even from a very early age. Taking a step toward understanding this human …

H2o: Two hands manipulating objects for first person interaction recognition

T Kwon, B Tekin, J Stühmer, F Bogo… - Proceedings of the …, 2021 - openaccess.thecvf.com
We present a comprehensive framework for egocentric interaction recognition using
markerless 3D annotations of two hands manipulating objects. To this end, we propose a …

Advancing high fidelity identity swap** for forgery detection

L Li, J Bao, H Yang, D Chen… - Proceedings of the IEEE …, 2020 - openaccess.thecvf.com
In this work, we study various existing benchmarks for deepfake detection researches. In
particular, we examine a novel two-stage face swap** algorithm, called FaceShifter, for …

In the eye of beholder: Joint learning of gaze and actions in first person video

Y Li, M Liu, JM Rehg - Proceedings of the European …, 2018 - openaccess.thecvf.com
We address the task of jointly determining what a person is doing and where they are
looking based on the analysis of video captured by a headworn camera. We propose a …

Gaze prediction in dynamic 360 immersive videos

Y Xu, Y Dong, J Wu, Z Sun, Z Shi… - proceedings of the …, 2018 - openaccess.thecvf.com
This paper explores gaze prediction in dynamic $360^ circ $ immersive videos, emph {ie},
based on the history scan path and VR contents, we predict where a viewer will look at an …

A survey on activity detection and classification using wearable sensors

M Cornacchia, K Ozcan, Y Zheng… - IEEE Sensors …, 2016 - ieeexplore.ieee.org
Activity detection and classification are very important for autonomous monitoring of humans
for applications, including assistive living, rehabilitation, and surveillance. Wearable sensors …