- Academic Search

J Liang, Y Cui, Q Wang, T Geng… - Advances in neural …, 2023 - proceedings.neurips.cc

This paper presents ClusterFormer, a universal vision model that is based on the Clustering
paradigm with TransFormer. It comprises two novel designs: 1) recurrent cross-attention …

Save Cite Cited by 59 Related articles All 5 versions Free GPT-4 DeepSeek View as HTML

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Learning equivariant segmentation with instance-unique querying

W Wang, J Liang, D Liu - Advances in Neural Information …, 2022 - proceedings.neurips.cc

Prevalent state-of-the-art instance segmentation methods fall into a query-based scheme, in
which instance masks are derived by querying the image feature using a set of instance …

Save Cite Cited by 93 Related articles All 5 versions Free GPT-4 DeepSeek View as HTML

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Tf-blender: Temporal feature blender for video object detection

Y Cui, L Yan, Z Cao, D Liu - Proceedings of the IEEE/CVF …, 2021 - openaccess.thecvf.com

Video objection detection is a challenging task because isolated video frames may
encounter appearance deterioration, which introduces great confusion for detection. One of …

Save Cite Cited by 194 Related articles All 6 versions Free GPT-4 DeepSeek View as HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Clustseg: Clustering for universal segmentation

J Liang, T Zhou, D Liu, W Wang - arxiv preprint arxiv:2305.02187, 2023 - arxiv.org

We present CLUSTSEG, a general, transformer-based framework that tackles different
image segmentation tasks (ie, superpixel, semantic, instance, and panoptic) through a …

Save Cite Cited by 93 Related articles All 5 versions Free GPT-4 DeepSeek View as HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Physical attack on monocular depth estimation with optimal adversarial patches

Z Cheng, J Liang, H Choi, G Tao, Z Cao, D Liu… - European conference on …, 2022 - Springer

Deep learning has substantially boosted the performance of Monocular Depth Estimation
(MDE), a critical component in fully vision-based autonomous driving (AD) systems (eg …

Save Cite Cited by 113 Related articles All 9 versions Free GPT-4 DeepSeek

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Eigenplaces: Training viewpoint robust models for visual place recognition

G Berton, G Trivigno, B Caputo… - Proceedings of the …, 2023 - openaccess.thecvf.com

Abstract Visual Place Recognition is a task that aims to predict the place of an image (called
query) based solely on its visual features. This is typically done through image retrieval …

Save Cite Cited by 64 Related articles All 7 versions Free GPT-4 DeepSeek View as HTML

Deep unsupervised part-whole relational visual saliency

Y Liu, X Dong, D Zhang, S Xu - Neurocomputing, 2024 - Elsevier

Abstract Deep Supervised Salient Object Detection (SSOD) excessively relies on large-
scale annotated pixel-level labels which consume intensive labour acquiring high quality …

Save Cite Cited by 46 Related articles All 2 versions Free GPT-4 DeepSeek

[Free GPT-4]
[DeepSeek]

[PDF] ijcai.org

[PDF][PDF] Where is your place, visual place recognition?

S Garg, T Fischer, M Milford - IJCAI, 2021 - ijcai.org

Abstract Visual Place Recognition (VPR) is often characterized as being able to recognize
the same place despite significant changes in appearance and viewpoint. VPR is a key …

Save Cite Cited by 132 Related articles All 6 versions Free GPT-4 DeepSeek View as HTML

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Deep visual geo-localization benchmark

G Berton, R Mereu, G Trivigno… - Proceedings of the …, 2022 - openaccess.thecvf.com

In this paper, we propose a new open-source benchmarking framework for Visual Geo-
localization (VG) that allows to build, train, and test a wide range of commonly used …

Save Cite Cited by 85 Related articles All 7 versions Free GPT-4 DeepSeek View as HTML

A survey on map-based localization techniques for autonomous vehicles

A Chalvatzaras, I Pratikakis… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org

Autonomous vehicles integrate complex software stacks for realizing the necessary iterative
perception, planning, and action operations. One of the foundational layers of such stacks is …

Save Cite Cited by 98 Related articles All 2 versions Free GPT-4 DeepSeek

Create alert

Cite

Advanced search

Saved to My library

Densernet: Weakly supervised visual localization using multi-scale feature aggregation

Clusterfomer: clustering as a universal visual learner

Learning equivariant segmentation with instance-unique querying

Tf-blender: Temporal feature blender for video object detection

Clustseg: Clustering for universal segmentation

Physical attack on monocular depth estimation with optimal adversarial patches

Eigenplaces: Training viewpoint robust models for visual place recognition

Deep unsupervised part-whole relational visual saliency

[PDF][PDF] Where is your place, visual place recognition?

Deep visual geo-localization benchmark

A survey on map-based localization techniques for autonomous vehicles