Object-centric slot diffusion

J Jiang, F Deng, G Singh, S Ahn - arxiv preprint arxiv:2303.10834, 2023 - arxiv.org
The recent success of transformer-based image generative models in object-centric learning
highlights the importance of powerful image generators for handling complex scenes …

Slotdiffusion: Object-centric generative modeling with diffusion models

Z Wu, J Hu, W Lu, I Gilitschenski… - Advances in Neural …, 2023 - proceedings.neurips.cc
Object-centric learning aims to represent visual data with a set of object entities (aka slots),
providing structured representations that enable systematic generalization. Leveraging …

Zero-shot object-centric representation learning

A Didolkar, A Zadaianchuk, A Goyal, M Mozer… - arxiv preprint arxiv …, 2024 - arxiv.org
The goal of object-centric representation learning is to decompose visual scenes into a
structured representation that isolates the entities. Recent successes have shown that object …

SPOT: Self-Training with Patch-Order Permutation for Object-Centric Learning with Autoregressive Transformers

I Kakogeorgiou, S Gidaris… - Proceedings of the …, 2024 - openaccess.thecvf.com
Unsupervised object-centric learning aims to decompose scenes into interpretable object
entities termed slots. Slot-based auto-encoders stand out as a prominent method for this …

Betrayed by attention: A simple yet effective approach for self-supervised video object segmentation

S Ding, R Qian, H Xu, D Lin, H **ong - European Conference on Computer …, 2024 - Springer
In this paper, we propose a simple yet effective approach for self-supervised video object
segmentation (VOS). Previous self-supervised VOS techniques majorly resort to auxiliary …

SlotLifter: Slot-Guided Feature Lifting for Learning Object-Centric Radiance Fields

Y Liu, B Jia, Y Chen, S Huang - European Conference on Computer …, 2024 - Springer
The ability to distill object-centric abstractions from intricate visual scenes underpins human-
level generalization. Despite the significant progress in object-centric learning methods …

DIOD: Self-Distillation Meets Object Discovery

S Kara, H Ammar, J Denize… - Proceedings of the …, 2024 - openaccess.thecvf.com
Instance segmentation demands substantial labeling resources. This has prompted
increased interest to explore the object discovery task as an unsupervised alternative. In …

Next state prediction gives rise to entangled, yet compositional representations of objects

T Saanum, LMS Buschoff, P Dayan… - arxiv preprint arxiv …, 2024 - arxiv.org
Compositional representations are thought to enable humans to generalize across
combinatorially vast state spaces. Models with learnable object slots, which encode …

Object-Centric Temporal Consistency via Conditional Autoregressive Inductive Biases

C Meo, A Nakano, M Lică, A Didolkar, M Suzuki… - arxiv preprint arxiv …, 2024 - arxiv.org
Unsupervised object-centric learning from videos is a promising approach towards learning
compositional representations that can be applied to various downstream tasks, such as …

Parallelized Spatiotemporal Slot Binding for Videos

G Singh, Y Wang, J Yang, B Ivanovic, S Ahn… - … on Machine Learning, 2024 - openreview.net
While modern best practices advocate for scalable architectures that support long-range
interactions, object-centric models are yet to fully embrace these architectures. In particular …