Clusterfomer: clustering as a universal visual learner

J Liang, Y Cui, Q Wang, T Geng… - Advances in neural …, 2023 - proceedings.neurips.cc
This paper presents ClusterFormer, a universal vision model that is based on the Clustering
paradigm with TransFormer. It comprises two novel designs: 1) recurrent cross-attention …

Learning equivariant segmentation with instance-unique querying

W Wang, J Liang, D Liu - Advances in Neural Information …, 2022 - proceedings.neurips.cc
Prevalent state-of-the-art instance segmentation methods fall into a query-based scheme, in
which instance masks are derived by querying the image feature using a set of instance …

Tf-blender: Temporal feature blender for video object detection

Y Cui, L Yan, Z Cao, D Liu - Proceedings of the IEEE/CVF …, 2021 - openaccess.thecvf.com
Video objection detection is a challenging task because isolated video frames may
encounter appearance deterioration, which introduces great confusion for detection. One of …

Clustseg: Clustering for universal segmentation

J Liang, T Zhou, D Liu, W Wang - arxiv preprint arxiv:2305.02187, 2023 - arxiv.org
We present CLUSTSEG, a general, transformer-based framework that tackles different
image segmentation tasks (ie, superpixel, semantic, instance, and panoptic) through a …

Physical attack on monocular depth estimation with optimal adversarial patches

Z Cheng, J Liang, H Choi, G Tao, Z Cao, D Liu… - European conference on …, 2022 - Springer
Deep learning has substantially boosted the performance of Monocular Depth Estimation
(MDE), a critical component in fully vision-based autonomous driving (AD) systems (eg …

Eigenplaces: Training viewpoint robust models for visual place recognition

G Berton, G Trivigno, B Caputo… - Proceedings of the …, 2023 - openaccess.thecvf.com
Abstract Visual Place Recognition is a task that aims to predict the place of an image (called
query) based solely on its visual features. This is typically done through image retrieval …

Deep unsupervised part-whole relational visual saliency

Y Liu, X Dong, D Zhang, S Xu - Neurocomputing, 2024 - Elsevier
Abstract Deep Supervised Salient Object Detection (SSOD) excessively relies on large-
scale annotated pixel-level labels which consume intensive labour acquiring high quality …

[PDF][PDF] Where is your place, visual place recognition?

S Garg, T Fischer, M Milford - IJCAI, 2021 - ijcai.org
Abstract Visual Place Recognition (VPR) is often characterized as being able to recognize
the same place despite significant changes in appearance and viewpoint. VPR is a key …

Deep visual geo-localization benchmark

G Berton, R Mereu, G Trivigno… - Proceedings of the …, 2022 - openaccess.thecvf.com
In this paper, we propose a new open-source benchmarking framework for Visual Geo-
localization (VG) that allows to build, train, and test a wide range of commonly used …

A survey on map-based localization techniques for autonomous vehicles

A Chalvatzaras, I Pratikakis… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
Autonomous vehicles integrate complex software stacks for realizing the necessary iterative
perception, planning, and action operations. One of the foundational layers of such stacks is …