Transformer for object re-identification: A survey

M Ye, S Chen, C Li, WS Zheng, D Crandall… - International Journal of …, 2024 - Springer
Abstract Object Re-identification (Re-ID) aims to identify specific objects across different
times and scenes, which is a widely researched task in computer vision. For a prolonged …

Clip-driven fine-grained text-image person re-identification

S Yan, N Dong, L Zhang, J Tang - IEEE Transactions on Image …, 2023 - ieeexplore.ieee.org
Text-Image Person Re-identification (TIReID) aims to retrieve the image corresponding to
the given text query from a pool of candidate images. Existing methods employ prior …

Towards cognition-augmented human-centric assembly: A visual computation perspective

J Pang, P Zheng, J Fan, T Liu - Robotics and Computer-Integrated …, 2025 - Elsevier
Human-centric assembly is emerging as a promising paradigm for achieving mass
personalization in the context of Industry 5.0, as it fully capitalizes on the advantages of …

Dynamic aggregated network for gait recognition

K Ma, Y Fu, D Zheng, C Cao, X Hu… - Proceedings of the …, 2023 - openaccess.thecvf.com
Gait recognition is beneficial for a variety of applications, including video surveillance, crime
scene investigation, and social security, to mention a few. However, gait recognition often …

PHA: Patch-wise high-frequency augmentation for transformer-based person re-identification

G Zhang, Y Zhang, T Zhang, B Li… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
Although recent studies empirically show that injecting Convolutional Neural Networks
(CNNs) into Vision Transformers (ViTs) can improve the performance of person re …

Aaformer: Auto-aligned transformer for person re-identification

K Zhu, H Guo, S Zhang, Y Wang, J Liu… - … on Neural Networks …, 2023 - ieeexplore.ieee.org
In person re-identification (re-ID), extracting part-level features from person images has
been verified to be crucial to offer fine-grained information. Most of the existing CNN-based …

Dual guidance enabled fuzzy inference for enhanced fine-grained recognition

Q Chen, F He, G Wang, X Bai… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
In the field of fine-grained visual recognition (FGVR), the ability to resolve minute and often
subtle differences between highly similar object categories is paramount. The advent of …

CLIP-ReID: exploiting vision-language model for image re-identification without concrete text labels

S Li, L Sun, Q Li - Proceedings of the AAAI Conference on Artificial …, 2023 - ojs.aaai.org
Pre-trained vision-language models like CLIP have recently shown superior performances
on various downstream tasks, including image classification and segmentation. However, in …

Noisy-correspondence learning for text-to-image person re-identification

Y Qin, Y Chen, D Peng, X Peng… - Proceedings of the …, 2024 - openaccess.thecvf.com
Text-to-image person re-identification (TIReID) is a compelling topic in the cross-modal
community which aims to retrieve the target person based on a textual query. Although …

CrossFuse: A novel cross attention mechanism based infrared and visible image fusion approach

H Li, XJ Wu - Information Fusion, 2024 - Elsevier
Multimodal visual information fusion aims to integrate the multi-sensor data into a single
image which contains more complementary information and less redundant features …