Contrastive representation learning: A framework and review

PH Le-Khac, G Healy, AF Smeaton - Ieee Access, 2020 - ieeexplore.ieee.org
Contrastive Learning has recently received interest due to its success in self-supervised
representation learning in the computer vision domain. However, the origins of Contrastive …

Human-to-robot imitation in the wild

S Bahl, A Gupta, D Pathak - arxiv preprint arxiv:2207.09450, 2022 - arxiv.org
We approach the problem of learning by watching humans in the wild. While traditional
approaches in Imitation and Reinforcement Learning are promising for learning in the real …

Space-time correspondence as a contrastive random walk

A Jabri, A Owens, A Efros - Advances in neural information …, 2020 - proceedings.neurips.cc
This paper proposes a simple self-supervised approach for learning a representation for
visual correspondence from raw video. We cast correspondence as prediction of links in a …

Language conditioned imitation learning over unstructured data

C Lynch, P Sermanet - arxiv preprint arxiv:2005.07648, 2020 - arxiv.org
Natural language is perhaps the most flexible and intuitive way for humans to communicate
tasks to a robot. Prior work in imitation learning typically requires each task be specified with …

Vid2robot: End-to-end video-conditioned policy learning with cross-attention transformers

V Jain, M Attarian, NJ Joshi, A Wahid, D Driess… - arxiv preprint arxiv …, 2024 - arxiv.org
Large-scale multi-task robotic manipulation systems often rely on text to specify the task. In
this work, we explore whether a robot can learn by observing humans. To do so, the robot …

[PDF][PDF] Grounding language in play

C Lynch, P Sermanet - arxiv preprint arxiv:2005.07648, 2020 - academia.edu
Natural language is perhaps the most versatile and intuitive way for humans to communicate
tasks to a robot. Prior work on Learning from Play (LfP)(Lynch et al., 2019) provides a simple …

Object-aware contrastive learning for debiased scene representation

S Mo, H Kang, K Sohn, CL Li… - Advances in Neural …, 2021 - proceedings.neurips.cc
Contrastive self-supervised learning has shown impressive results in learning visual
representations from unlabeled images by enforcing invariance against different data …

Interactron: Embodied adaptive object detection

K Kotar, R Mottaghi - … of the IEEE/CVF conference on …, 2022 - openaccess.thecvf.com
Over the years various methods have been proposed for the problem of object detection.
Recently, we have witnessed great strides in this domain owing to the emergence of …

Learning video-conditioned policies for unseen manipulation tasks

E Chane-Sane, C Schmid… - 2023 IEEE International …, 2023 - ieeexplore.ieee.org
The ability to specify robot commands by a non-expert user is critical for building generalist
agents capable of solving a large variety of tasks. One convenient way to specify the …

DynaMo: In-Domain Dynamics Pretraining for Visuo-Motor Control

Z Cui, H Pan, A Iyer, S Haldar… - Advances in Neural …, 2025 - proceedings.neurips.cc
Imitation learning has proven to be a powerful tool for training complex visuo-motor policies.
However, current methods often require hundreds to thousands of expert demonstrations to …