Dual cross-attention learning for fine-grained visual categorization and object re-identification

H Zhu, W Ke, D Li, J Liu, L Tian… - Proceedings of the …, 2022 - openaccess.thecvf.com
Recently, self-attention mechanisms have shown impressive performance in various NLP
and CV tasks, which can help capture sequential characteristics and derive global …

Behavioral intention prediction in driving scenes: A survey

J Fang, F Wang, J Xue, TS Chua - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
In driving scenes, road agents often engage in frequent interaction and strive to understand
their surroundings. Ego-agent (each road agent itself) predicts what behavior will be …

TBE-Net: A three-branch embedding network with part-aware ability and feature complementary learning for vehicle re-identification

W Sun, G Dai, X Zhang, X He… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
Vehicle re-identification (Re-ID) is one of the promising applications in the field of computer
vision. Existing vehicle Re-ID methods mainly focus on global appearance features or pre …

Latent image animator: Learning to animate images via latent space navigation

Y Wang, D Yang, F Bremond, A Dantcheva - arxiv preprint arxiv …, 2022 - arxiv.org
Due to the remarkable progress of deep generative models, animating images has become
increasingly efficient, whereas associated results have become increasingly realistic …

The 7th ai city challenge

M Naphade, S Wang, DC Anastasiu… - Proceedings of the …, 2023 - openaccess.thecvf.com
Abstract The AI City Challenge's seventh edition emphasizes two domains at the intersection
of computer vision and artificial intelligence-retail business and Intelligent Traffic Systems …

Align and tell: Boosting text-video retrieval with local alignment and fine-grained supervision

X Wang, L Zhu, Z Zheng, M Xu… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
Text-video retrieval is one of the basic tasks for multimodal research and has been widely
harnessed in many real-world systems. Most existing approaches directly compare the …

Gan-siamese network for cross-domain vehicle re-identification in intelligent transport systems

Z Zhou, Y Li, J Li, K Yu, G Kou, M Wang… - IEEE transactions on …, 2022 - ieeexplore.ieee.org
The vehicle re-identification (Re-ID) has become one of most important techniques for
tracking vehicles in intelligent transport system. Vehicle Re-ID aims at matching identical …

Clothing status awareness for long-term person re-identification

Y Huang, Q Wu, JS Xu, Y Zhong… - Proceedings of the …, 2021 - openaccess.thecvf.com
Long-Term person re-identification (LT-reID) exposes extreme challenges because of the
longer time gaps between two recording footages where a person is likely to change …

Pedestrian-specific bipartite-aware similarity learning for text-based person retrieval

F Shen, X Shu, X Du, J Tang - Proceedings of the 31st ACM International …, 2023 - dl.acm.org
Text-based person retrieval is a challenging task that aims to search pedestrian images with
the same identity according to language descriptions. Current methods usually …

Each part matters: Local patterns facilitate cross-view geo-localization

T Wang, Z Zheng, C Yan, J Zhang… - … on Circuits and …, 2021 - ieeexplore.ieee.org
Cross-view geo-localization is to spot images of the same geographic target from different
platforms, eg, drone-view cameras and satellites. It is challenging in the large visual …