Incorporating physics into data-driven computer vision

A Kadambi, C de Melo, CJ Hsieh… - Nature Machine …, 2023 - nature.com
Many computer vision techniques infer properties of our physical world from images.
Although images are formed through the physics of light and mechanics, computer vision …

Visual affordance and function understanding: A survey

M Hassanin, S Khan, M Tahtali - ACM Computing Surveys (CSUR), 2021 - dl.acm.org
Nowadays, robots are dominating the manufacturing, entertainment, and healthcare
industries. Robot vision aims to equip robots with the capabilities to discover information …

Affordances from human videos as a versatile representation for robotics

S Bahl, R Mendonca, L Chen… - Proceedings of the …, 2023 - openaccess.thecvf.com
Building a robot that can understand and learn to interact by watching humans has inspired
several vision problems. However, despite some successful results on static datasets, it …

DECO: Dense estimation of 3D human-scene contact in the wild

S Tripathi, A Chatterjee, JC Passy… - Proceedings of the …, 2023 - openaccess.thecvf.com
Understanding how humans use physical contact to interact with the world is key to enabling
human-centric artificial intelligence. While inferring 3D contact is crucial for modeling …

Populating 3D scenes by learning human-scene interaction

M Hassan, P Ghosh, J Tesch… - Proceedings of the …, 2021 - openaccess.thecvf.com
Humans live within a 3D space and constantly interact with it to perform tasks. Such
interactions involve physical contact between surfaces that is semantically meaningful. Our …

Detecting human-object contact in images

Y Chen, SK Dwivedi, MJ Black… - Proceedings of the …, 2023 - openaccess.thecvf.com
Humans constantly contact objects to move and perform tasks. Thus, detecting human-
object contact is important for building human-centered artificial intelligence. However, there …

Synthesizing long-term 3d human motion and interaction in 3d scenes

J Wang, H Xu, J Xu, S Liu… - Proceedings of the IEEE …, 2021 - openaccess.thecvf.com
Synthesizing 3D human motion plays an important role in many graphics applications as
well as understanding human activity. While many efforts have been made on generating …

Dark, beyond deep: A paradigm shift to cognitive ai with humanlike common sense

Y Zhu, T Gao, L Fan, S Huang, M Edmonds, H Liu… - Engineering, 2020 - Elsevier
Recent progress in deep learning is essentially based on a “big data for small tasks”
paradigm, under which massive amounts of data are used to train a classifier for a single …

Human-centric indoor scene synthesis using stochastic grammar

S Qi, Y Zhu, S Huang, C Jiang… - Proceedings of the …, 2018 - openaccess.thecvf.com
We present a human-centric method to sample and synthesize 3D room layouts and 2D
images thereof, for the purpose of obtaining large-scale 2D/3D image data with the perfect …

Grounded human-object interaction hotspots from video

T Nagarajan, C Feichtenhofer… - Proceedings of the …, 2019 - openaccess.thecvf.com
Learning how to interact with objects is an important step towards embodied visual
intelligence, but existing techniques suffer from heavy supervision or sensing requirements …