A memorizing and generalizing framework for lifelong person re-identification

N Pu, Z Zhong, N Sebe, MS Lew - IEEE Transactions on Pattern …, 2023 - ieeexplore.ieee.org
In this paper, we introduce a challenging yet practical setting for person re-identification
(ReID) task, named lifelong person re-identification (LReID), which aims to continuously …

Model behavior preserving for class-incremental learning

Y Liu, X Hong, X Tao, S Dong, J Shi… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
Deep models have shown to be vulnerable to catastrophic forgetting, a phenomenon that
the recognition performance on old data degrades when a pre-trained model is fine-tuned …

Exploring optical-flow-guided motion and detection-based appearance for temporal sentence grounding

D Liu, X Fang, W Hu, P Zhou - IEEE Transactions on Multimedia, 2023 - ieeexplore.ieee.org
Temporal sentence grounding aims to localize a target segment in an untrimmed video
semantically according to a given sentence query. Most previous works focus on learning …

CNDesc: Cross normalization for local descriptors learning

C Wang, R Xu, S Xu, W Meng… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
For a long time, the local descriptors learning benefited from the use of L2 normalization,
which projects the descriptor space onto the hypersphere. However, there is no free lunch in …

Computation-efficient deep learning for computer vision: A survey

Y Wang, Y Han, C Wang, S Song… - Cybernetics and …, 2024 - ieeexplore.ieee.org
Over the past decade, deep learning models have exhibited considerable advancements,
reaching or even exceeding human-level performance in a range of visual perception tasks …

Incloud: Incremental learning for point cloud place recognition

J Knights, P Moghadam, M Ramezani… - 2022 IEEE/RSJ …, 2022 - ieeexplore.ieee.org
Place recognition is a fundamental component of robotics, and has seen tremendous
improvements through the use of deep learning models in recent years. Networks can …

Unsupervised knowledge representation of panoramic dental X-ray images using SVG image-and-object clustering

K Salameh, FE Akoum, J Tekli - Multimedia Systems, 2023 - Springer
Given that the meaning of an image is rarely self-evident using traditional keyword and/or
content-based descriptions, the general goal of this study is to convert, with minimal human …

Cross-Modal Alternating Learning with Task-Aware Representations for Continual Learning

W Li, BB Gao, B **a, J Wang, J Liu, Y Liu… - IEEE Transactions …, 2023 - ieeexplore.ieee.org
Continual learning is a research field of artificial neural networks to simulate human lifelong
learning ability. Although a surge of investigations has achieved considerable performance …

Deep learning for weakly-supervised object detection and object localization: A survey

F Shao, L Chen, J Shao, W Ji, S **ao, L Ye… - arxiv preprint arxiv …, 2021 - arxiv.org
Weakly-Supervised Object Detection (WSOD) and Localization (WSOL), ie, detecting
multiple and single instances with bounding boxes in an image using image-level labels …

CM-MaskSD: Cross-Modality Masked Self-Distillation for Referring Image Segmentation

W Wang, X He, Y Zhang, L Guo… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Referring image segmentation (RIS) is a fundamental vision-language task that intends to
segment a desired object from an image based on a given natural language expression …