Lisa: Reasoning segmentation via large language model

X Lai, Z Tian, Y Chen, Y Li, Y Yuan… - Proceedings of the …, 2024 - openaccess.thecvf.com
Although perception systems have made remarkable advancements in recent years they still
rely on explicit human instruction or pre-defined categories to identify the target objects …

Spherical transformer for lidar-based 3d recognition

X Lai, Y Chen, F Lu, J Liu, J Jia - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
LiDAR-based 3D point cloud recognition has benefited various applications. Without
specially considering the LiDAR point distribution, most current methods suffer from …

Hierarchical dense correlation distillation for few-shot segmentation

B Peng, Z Tian, X Wu, C Wang… - Proceedings of the …, 2023 - openaccess.thecvf.com
Few-shot semantic segmentation (FSS) aims to form class-agnostic models segmenting
unseen classes with only a handful of annotations. Previous methods limited to the semantic …

Oa-cnns: Omni-adaptive sparse cnns for 3d semantic segmentation

B Peng, X Wu, L Jiang, Y Chen… - Proceedings of the …, 2024 - openaccess.thecvf.com
The booming of 3D recognition in the 2020s began with the introduction of point cloud
transformers. They quickly overwhelmed sparse CNNs and became state-of-the-art models …

When object detection meets knowledge distillation: A survey

Z Li, P Xu, X Chang, L Yang, Y Zhang… - … on Pattern Analysis …, 2023 - ieeexplore.ieee.org
Object detection (OD) is a crucial computer vision task that has seen the development of
many algorithms and models over the years. While the performance of current OD models …

Unified language-driven zero-shot domain adaptation

S Yang, Z Tian, L Jiang, J Jia - Proceedings of the IEEE …, 2024 - openaccess.thecvf.com
Abstract This paper introduces Unified Language-driven Zero-shot Domain Adaptation
(ULDA) a novel task setting that enables a single model to adapt to diverse target domains …

Decoupled kullback-leibler divergence loss

J Cui, Z Tian, Z Zhong, X Qi, B Yu… - Advances in Neural …, 2025 - proceedings.neurips.cc
In this paper, we delve deeper into the Kullback–Leibler (KL) Divergence loss and
mathematically prove that it is equivalent to the Decoupled Kullback-Leibler (DKL) …

Removing anomalies as noises for industrial defect localization

F Lu, X Yao, CW Fu, J Jia - Proceedings of the IEEE/CVF …, 2023 - openaccess.thecvf.com
Unsupervised anomaly detection aims to train models with only anomaly-free images to
detect and localize unseen anomalies. Previous reconstruction-based methods have been …

Understanding imbalanced semantic segmentation through neural collapse

Z Zhong, J Cui, Y Yang, X Wu, X Qi… - Proceedings of the …, 2023 - openaccess.thecvf.com
A recent study has shown a phenomenon called neural collapse in that the within-class
means of features and the classifier weight vectors converge to the vertices of a simplex …

See say and segment: Teaching lmms to overcome false premises

TH Wu, G Biamby, D Chan, L Dunlap… - Proceedings of the …, 2024 - openaccess.thecvf.com
Abstract Current open-source Large Multimodal Models (LMMs) excel at tasks such as open-
vocabulary language grounding and segmentation but can suffer under false premises …