Disentangled representation learning

X Wang, H Chen, Z Wu, W Zhu - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Disentangled Representation Learning (DRL) aims to learn a model capable of identifying
and disentangling the underlying factors hidden in the observable data in representation …

Unsupervised object localization: Observing the background to discover objects

O Siméoni, C Sekkat, G Puy… - Proceedings of the …, 2023 - openaccess.thecvf.com
Recent advances in self-supervised visual representation learning have paved the way for
unsupervised methods tackling tasks such as object discovery and instance segmentation …

Object-centric learning for real-world videos by predicting temporal feature similarities

A Zadaianchuk, M Seitzer… - Advances in Neural …, 2023 - proceedings.neurips.cc
Unsupervised video-based object-centric learning is a promising avenue to learn structured
representations from large, unlabeled video collections, but previous approaches have only …

Self-supervised object-centric learning for videos

G Aydemir, W **e, F Guney - Advances in Neural …, 2023 - proceedings.neurips.cc
Unsupervised multi-object segmentation has shown impressive results on images by
utilizing powerful semantics learned from self-supervised pretraining. An additional modality …

Slotdiffusion: Object-centric generative modeling with diffusion models

Z Wu, J Hu, W Lu, I Gilitschenski… - Advances in Neural …, 2023 - proceedings.neurips.cc
Object-centric learning aims to represent visual data with a set of object entities (aka slots),
providing structured representations that enable systematic generalization. Leveraging …

Provably learning object-centric representations

J Brady, RS Zimmermann, Y Sharma… - International …, 2023 - proceedings.mlr.press
Learning structured representations of the visual world in terms of objects promises to
significantly improve the generalization abilities of current machine learning models. While …

Object-centric slot diffusion

J Jiang, F Deng, G Singh, S Ahn - arxiv preprint arxiv:2303.10834, 2023 - arxiv.org
The recent success of transformer-based image generative models in object-centric learning
highlights the importance of powerful image generators for handling complex scenes …

On permutation-invariant neural networks

M Kimura, R Shimizu, Y Hirakawa, R Goto… - arxiv preprint arxiv …, 2024 - arxiv.org
Conventional machine learning algorithms have traditionally been designed under the
assumption that input data follows a vector-based format, with an emphasis on vector-centric …

Shatter and gather: Learning referring image segmentation with text supervision

D Kim, N Kim, C Lan, S Kwak - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
Referring image segmentation, the task of segmenting any arbitrary entities described in free-
form texts, opens up a variety of vision applications. However, manual labeling of training …

Eagle: Eigen aggregation learning for object-centric unsupervised semantic segmentation

C Kim, W Han, D Ju, SJ Hwang - Proceedings of the IEEE …, 2024 - openaccess.thecvf.com
Semantic segmentation has innately relied on extensive pixel-level annotated data leading
to the emergence of unsupervised methodologies. Among them leveraging self-supervised …