A review of modern fashion recommender systems

Y Deldjoo, F Nazary, A Ramisa, J Mcauley… - ACM Computing …, 2023 - dl.acm.org
The textile and apparel industries have grown tremendously over the past few years.
Customers no longer have to visit many stores, stand in long queues, or try on garments in …

Discriminative probing and tuning for text-to-image generation

L Qu, W Wang, Y Li, H Zhang, L Nie… - Proceedings of the …, 2024 - openaccess.thecvf.com
Despite advancements in text-to-image generation (T2I) prior methods often face text-image
misalignment problems such as relation confusion in generated images. Existing solutions …

Mutual-enhanced incongruity learning network for multi-modal sarcasm detection

Y Qiao, L **g, X Song, X Chen, L Zhu… - Proceedings of the AAAI …, 2023 - ojs.aaai.org
Sarcasm is a sophisticated linguistic phenomenon that is prevalent on today's social media
platforms. Multi-modal sarcasm detection aims to identify whether a given sample with multi …

Target-guided composed image retrieval

H Wen, X Zhang, X Song, Y Wei, L Nie - Proceedings of the 31st ACM …, 2023 - dl.acm.org
Composed image retrieval (CIR) is a new and flexible image retrieval paradigm, which can
retrieve the target image for a multimodal query, including a reference image and its …

Composed image retrieval with text feedback via multi-grained uncertainty regularization

Y Chen, Z Zheng, W Ji, L Qu, TS Chua - arxiv preprint arxiv:2211.07394, 2022 - arxiv.org
We investigate composed image retrieval with text feedback. Users gradually look for the
target of interest by moving from coarse to fine-grained feedback. However, existing …

Fine-grained textual inversion network for zero-shot composed image retrieval

H Lin, H Wen, X Song, M Liu, Y Hu, L Nie - Proceedings of the 47th …, 2024 - dl.acm.org
Composed Image Retrieval (CIR) allows users to search target images with a multimodal
query, comprising a reference image and a modification text that describes the user's …

Composed image retrieval using contrastive learning and task-oriented clip-based features

A Baldrati, M Bertini, T Uricchio… - ACM Transactions on …, 2023 - dl.acm.org
Given a query composed of a reference image and a relative caption, the Composed Image
Retrieval goal is to retrieve images visually similar to the reference one that integrates the …

Multi-grained attention network with mutual exclusion for composed query-based image retrieval

S Li, X Xu, X Jiang, F Shen, X Liu… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
The Composed Query-Based Image Retrieval (CQBIR) task aims to precisely obtain the
preserved and modified parts, based on the multi-grained semantics learned from the …

Cross-modal feature alignment and fusion for composed image retrieval

Y Wan, W Wang, G Zou… - Proceedings of the IEEE …, 2024 - openaccess.thecvf.com
Abstract Composed Image Retrieval (CIR) presents challenges in expressing search intent
through hybrid-modality queries where users search for a target image using another image …

Composed image retrieval via cross relation network with hierarchical aggregation transformer

Q Yang, M Ye, Z Cai, K Su, B Du - IEEE Transactions on Image …, 2023 - ieeexplore.ieee.org
Composing Text and Image to Image Retrieval (CTI-IR) aims at finding the target image,
which matches the query image visually along with the query text semantically. However …