Ufinebench: Towards text-based person retrieval with ultra-fine granularity

J Zuo, H Zhou, Y Nie, F Zhang, T Guo… - Proceedings of the …, 2024 - openaccess.thecvf.com
Existing text-based person retrieval datasets often have relatively coarse-grained text
annotations. This hinders the model to comprehend the fine-grained semantics of query …

An AI pipeline for garment price projection using computer vision

R Rico Gómez, J Lorentz, T Hartmann, A Goknil… - Neural Computing and …, 2024 - Springer
The fashion industry's traditional price-setting methods, based on historical sales and
Fashion Week trends, are inadequate in the digital era. Rapid changes in collections and …

Prompt-guided transformers for end-to-end open-vocabulary object detection

H Song, J Bang - arxiv preprint arxiv:2303.14386, 2023 - arxiv.org
Prompt-OVD is an efficient and effective framework for open-vocabulary object detection that
utilizes class embeddings from CLIP as prompts, guiding the Transformer decoder to detect …

Prompt-Guided DETR with RoI-pruned masked attention for open-vocabulary object detection

H Song, J Bang - Pattern Recognition, 2024 - Elsevier
Prompt-OVD is an efficient and effective DETR-based framework for open-vocabulary object
detection that utilizes class embeddings from CLIP as prompts, guiding the Transformer …

Videoadviser: Video knowledge distillation for multimodal transfer learning

Y Wang, D Zeng, S Wada, S Kurihara - IEEE Access, 2023 - ieeexplore.ieee.org
Multimodal transfer learning aims to transform pretrained representations of diverse
modalities into a common domain space for effective multimodal fusion. However …

AdvDenoise: Fast Generation Framework of Universal and Robust Adversarial Patches Using Denoise

J Li, Z Wang, J Li - … of the IEEE/CVF Conference on …, 2024 - openaccess.thecvf.com
Adversarial patch attacks which can mislead deep learning models and the human eye in
both the digital and physical domains have led to a trust crisis. Traditional approaches to …

[ЦИТАТА][C] An AI pipeline for garment price projection using computer vision

RR Gomez, J Lorentz, T Hartmann, A Goknil, IP Singh…