Object detection in 20 years: A survey

Z Zou, K Chen, Z Shi, Y Guo, J Ye - Proceedings of the IEEE, 2023 - ieeexplore.ieee.org
Object detection, as of one the most fundamental and challenging problems in computer
vision, has received great attention in recent years. Over the past two decades, we have …

Change detection methods for remote sensing in the last decade: A comprehensive review

G Cheng, Y Huang, X Li, S Lyu, Z Xu, H Zhao, Q Zhao… - Remote Sensing, 2024 - mdpi.com
Change detection is an essential and widely utilized task in remote sensing that aims to
detect and analyze changes occurring in the same geographical area over time, which has …

Transformer-based visual segmentation: A survey

X Li, H Ding, H Yuan, W Zhang, J Pang… - IEEE transactions on …, 2024 - ieeexplore.ieee.org
Visual segmentation seeks to partition images, video frames, or point clouds into multiple
segments or groups. This technique has numerous real-world applications, such as …

Towards open vocabulary learning: A survey

J Wu, X Li, S Xu, H Yuan, H Ding… - … on Pattern Analysis …, 2024 - ieeexplore.ieee.org
In the field of visual scene understanding, deep neural networks have made impressive
advancements in various core tasks like segmentation, tracking, and detection. However …

Omg-llava: Bridging image-level, object-level, pixel-level reasoning and understanding

T Zhang, X Li, H Fei, H Yuan, S Wu… - Advances in …, 2025 - proceedings.neurips.cc
Current universal segmentation methods demonstrate strong capabilities in pixel-level
image and video understanding. However, they lack reasoning abilities and cannot be …

Tube-link: A flexible cross tube framework for universal video segmentation

X Li, H Yuan, W Zhang, G Cheng… - Proceedings of the …, 2023 - openaccess.thecvf.com
Video segmentation aims to segment and track every pixel in diverse scenarios accurately.
In this paper, we present Tube-Link, a versatile framework that addresses multiple core tasks …

Ba-sam: Scalable bias-mode attention mask for segment anything model

Y Song, Q Zhou, X Li, DP Fan… - Proceedings of the …, 2024 - openaccess.thecvf.com
In this paper we address the challenge of image resolution variation for the Segment
Anything Model (SAM). SAM known for its zero-shot generalizability exhibits a performance …

Tracking objects as pixel-wise distributions

Z Zhao, Z Wu, Y Zhuang, B Li, J Jia - European Conference on Computer …, 2022 - Springer
Multi-object tracking (MOT) requires detecting and associating objects through frames.
Unlike tracking via detected bounding boxes or center points, we propose tracking objects …

Betrayed by captions: Joint caption grounding and generation for open vocabulary instance segmentation

J Wu, X Li, H Ding, X Li, G Cheng… - Proceedings of the …, 2023 - openaccess.thecvf.com
In this work, we focus on open vocabulary instance segmentation to expand a segmentation
model to classify and segment instance-level novel categories. Previous approaches have …

RingMo-sense: Remote sensing foundation model for spatiotemporal prediction via spatiotemporal evolution disentangling

F Yao, W Lu, H Yang, L Xu, C Liu, L Hu… - … on Geoscience and …, 2023 - ieeexplore.ieee.org
Remote sensing (RS) spatiotemporal prediction aims to infer future trends from historical
spatiotemporal data, eg, videos and time-series images, which has a broad application …