Академия Google

M Ye, S Chen, C Li, WS Zheng, D Crandall… - International Journal of …, 2024 - Springer

Abstract Object Re-identification (Re-ID) aims to identify specific objects across different
times and scenes, which is a widely researched task in computer vision. For a prolonged …

Сохранить Цитировать Цитируется: 11 Похожие статьи Все версии статьи (2)

[Free GPT-4]

[PDF] arxiv.org

Clip-driven fine-grained text-image person re-identification

S Yan, N Dong, L Zhang, J Tang - IEEE Transactions on Image …, 2023 - ieeexplore.ieee.org

Text-Image Person Re-identification (TIReID) aims to retrieve the image corresponding to
the given text query from a pool of candidate images. Existing methods employ prior …

Сохранить Цитировать Цитируется: 149 Похожие статьи Все версии статьи (7)

Towards cognition-augmented human-centric assembly: A visual computation perspective

J Pang, P Zheng, J Fan, T Liu - Robotics and Computer-Integrated …, 2025 - Elsevier

Human-centric assembly is emerging as a promising paradigm for achieving mass
personalization in the context of Industry 5.0, as it fully capitalizes on the advantages of …

Сохранить Цитировать Цитируется: 2 Похожие статьи Все версии статьи (3)

[Free GPT-4]

[PDF] thecvf.com

Dynamic aggregated network for gait recognition

K Ma, Y Fu, D Zheng, C Cao, X Hu… - Proceedings of the …, 2023 - openaccess.thecvf.com

Gait recognition is beneficial for a variety of applications, including video surveillance, crime
scene investigation, and social security, to mention a few. However, gait recognition often …

Сохранить Цитировать Цитируется: 46 Похожие статьи Все версии статьи (4) В виде HTML

[Free GPT-4]

[PDF] thecvf.com

PHA: Patch-wise high-frequency augmentation for transformer-based person re-identification

G Zhang, Y Zhang, T Zhang, B Li… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com

Although recent studies empirically show that injecting Convolutional Neural Networks
(CNNs) into Vision Transformers (ViTs) can improve the performance of person re …

Сохранить Цитировать Цитируется: 47 Похожие статьи Все версии статьи (3) В виде HTML

[Free GPT-4]

[PDF] arxiv.org

Aaformer: Auto-aligned transformer for person re-identification

K Zhu, H Guo, S Zhang, Y Wang, J Liu… - … on Neural Networks …, 2023 - ieeexplore.ieee.org

In person re-identification (re-ID), extracting part-level features from person images has
been verified to be crucial to offer fine-grained information. Most of the existing CNN-based …

Сохранить Цитировать Цитируется: 142 Похожие статьи Все версии статьи (5)

Dual guidance enabled fuzzy inference for enhanced fine-grained recognition

Q Chen, F He, G Wang, X Bai… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org

In the field of fine-grained visual recognition (FGVR), the ability to resolve minute and often
subtle differences between highly similar object categories is paramount. The advent of …

Сохранить Цитировать Цитируется: 18 Похожие статьи Все версии статьи (2)

[Free GPT-4]

[PDF] aaai.org

CLIP-ReID: exploiting vision-language model for image re-identification without concrete text labels

S Li, L Sun, Q Li - Proceedings of the AAAI Conference on Artificial …, 2023 - ojs.aaai.org

Pre-trained vision-language models like CLIP have recently shown superior performances
on various downstream tasks, including image classification and segmentation. However, in …

Сохранить Цитировать Цитируется: 119 Похожие статьи Все версии статьи (4) В виде HTML

[Free GPT-4]

[PDF] thecvf.com

Noisy-correspondence learning for text-to-image person re-identification

Y Qin, Y Chen, D Peng, X Peng… - Proceedings of the …, 2024 - openaccess.thecvf.com

Text-to-image person re-identification (TIReID) is a compelling topic in the cross-modal
community which aims to retrieve the target person based on a textual query. Although …

Сохранить Цитировать Цитируется: 39 Похожие статьи Все версии статьи (3) В виде HTML

[Free GPT-4]

[PDF] arxiv.org

CrossFuse: A novel cross attention mechanism based infrared and visible image fusion approach

H Li, XJ Wu - Information Fusion, 2024 - Elsevier

Multimodal visual information fusion aims to integrate the multi-sensor data into a single
image which contains more complementary information and less redundant features …

Сохранить Цитировать Цитируется: 66 Похожие статьи Все версии статьи (3)

Создать оповещение

Цитировать

Расширенный поиск

Сохранено в вашей библиотеке

Dual cross-attention learning for fine-grained visual categorization and object re-identification

Transformer for object re-identification: A survey

Clip-driven fine-grained text-image person re-identification

Towards cognition-augmented human-centric assembly: A visual computation perspective

Dynamic aggregated network for gait recognition

PHA: Patch-wise high-frequency augmentation for transformer-based person re-identification

Aaformer: Auto-aligned transformer for person re-identification

Dual guidance enabled fuzzy inference for enhanced fine-grained recognition

CLIP-ReID: exploiting vision-language model for image re-identification without concrete text labels

Noisy-correspondence learning for text-to-image person re-identification

CrossFuse: A novel cross attention mechanism based infrared and visible image fusion approach