Genesis-v2: Inferring unordered object representations without iterative refinement

M Engelcke, O Parker Jones… - Advances in Neural …, 2021 - proceedings.neurips.cc
Advances in unsupervised learning of object-representations have culminated in the
development of a broad range of methods for unsupervised object segmentation and …

Provably learning object-centric representations

J Brady, RS Zimmermann, Y Sharma… - International …, 2023 - proceedings.mlr.press
Learning structured representations of the visual world in terms of objects promises to
significantly improve the generalization abilities of current machine learning models. While …

Slot-vae: Object-centric scene generation with slot attention

Y Wang, L Liu, J Dauwels - International Conference on …, 2023 - proceedings.mlr.press
Slot attention has shown remarkable object-centric representation learning performance in
computer vision tasks without requiring any supervision. Despite its object-centric binding …

Recasting generic pretrained vision transformers as object-centric scene encoders for manipulation policies

J Qian, A Panagopoulos… - 2024 IEEE International …, 2024 - ieeexplore.ieee.org
Generic re-usable pre-trained image representation encoders have become a standard
component of methods for many computer vision tasks. As visual representations for robots …

Test-time adaptation with slot-centric models

M Prabhudesai, A Goyal, S Paul… - International …, 2023 - proceedings.mlr.press
Current visual detectors, though impressive within their training distribution, often fail to
parse out-of-distribution scenes into their constituent entities. Recent test-time adaptation …

Composing Pre-Trained Object-Centric Representations for Robotics From" What" and" Where" Foundation Models

J Shi, J Qian, YJ Ma… - 2024 IEEE International …, 2024 - ieeexplore.ieee.org
There have recently been large advances both in pre-training visual representations for
robotic control and segmenting unknown category objects in general images. To leverage …

[PDF][PDF] Plug-and-play object-centric representations from “what” and “where” foundation models

J Shi, J Qian, YJ Ma, D Jayaraman - ICRA, 2024 - mit-spark.github.io
There have recently been large advances in the problem of segmenting unknown category
objects in general images. To leverage these for improved robot learning, we propose a new …

Discovering deformable keypoint pyramids

J Qian, A Panagopoulos, D Jayaraman - European Conference on …, 2022 - Springer
The locations of objects and their associated landmark keypoints can serve as versatile and
semantically meaningful image representations. In natural scenes, these keypoints are often …

Variational Inference for Scalable 3D Object-centric Learning

T Wang, KS Ng, M Liu - arxiv preprint arxiv:2309.14010, 2023 - arxiv.org
We tackle the task of scalable unsupervised object-centric representation learning on 3D
scenes. Existing approaches to object-centric representation learning show limitations in …