Ganhand: Predicting human grasp affordances in multi-object scenes

E Corona, A Pumarola, G Alenya… - Proceedings of the …, 2020 - openaccess.thecvf.com
The rise of deep learning has brought remarkable progress in estimating hand geometry
from images where the hands are part of the scene. This paper focuses on a new problem …

Naq: Leveraging narrations as queries to supervise episodic memory

SK Ramakrishnan, Z Al-Halah… - Proceedings of the …, 2023 - openaccess.thecvf.com
Searching long egocentric videos with natural language queries (NLQ) has compelling
applications in augmented reality and robotics, where a fluid index into everything that a …

Learning state-aware visual representations from audible interactions

H Mittal, P Morgado, U Jain… - Advances in Neural …, 2022 - proceedings.neurips.cc
We propose a self-supervised algorithm to learn representations from egocentric video data.
Recently, significant efforts have been made to capture humans interacting with their own …

Spotem: Efficient video search for episodic memory

SK Ramakrishnan, Z Al-Halah… - … on Machine Learning, 2023 - proceedings.mlr.press
The goal in episodic memory (EM) is to search a long egocentric video to answer a natural
language query (eg,“where did I leave my purse?”). Existing EM methods exhaustively …

Hands holding clues for object recognition in teachable machines

K Lee, H Kacorri - Proceedings of the 2019 CHI conference on human …, 2019 - dl.acm.org
Camera manipulation confounds the use of object recognition applications by blind people.
This is exacerbated when photos from this population are also used to train models, as with …

Crowdsourcing the perception of machine teaching

J Hong, K Lee, J Xu, H Kacorri - … of the 2020 CHI Conference on Human …, 2020 - dl.acm.org
Teachable interfaces can empower end-users to attune machine learning systems to their
idiosyncratic characteristics and environment by explicitly providing pertinent training …

[PDF][PDF] Hand--object interaction recognition based on visual attention using multiscopic cyber-physical-social system.

ARA Besari, AA Saputra, WH Chin… - International Journal of …, 2023 - core.ac.uk
Rehabilitation is required for patients recovering from neurological illnesses, particularly
hand stroke. However, roughly two-thirds of hand stroke survivors have visual impairments …

Grasp-type recognition leveraging object affordance

N Wake, K Sasabuchi, K Ikeuchi - ar**
object intuitively. First, we propose a system based on the stereo infra-red image as a sensor …

Transferring the semantic constraints in human manipulation behaviors to robots

C Li, G Tian - Applied Intelligence, 2020 - Springer
In this study, we aim to help robots manipulate objects with the guidance of semantic
constraints (the grasp location, grasp type, approaching way, trajectory constraint, grasp …