Študovňa Google

HV Pham, S Qian, J Wang, T Lutellier… - Proceedings of the 35th …, 2020 - dl.acm.org

Deep learning (DL) training algorithms utilize nondeterminism to improve models' accuracy
and training efficiency. Hence, multiple identical training runs (eg, identical training data …

Uložiť Citovať Citované 162-krát Súvisiace články Všetky verzie 9

[Free GPT-4]
[DeepSeek]

[PDF] sciencedirect.com

Multimodal research in vision and language: A review of current and emerging trends

S Uppal, S Bhagat, D Hazarika, N Majumder, S Poria… - Information …, 2022 - Elsevier

Deep Learning and its applications have cascaded impactful research and development
with a diverse range of modalities present in the real-world data. More recently, this has …

Uložiť Citovať Citované 107-krát Súvisiace články Všetky verzie 5

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Pic2word: Map** pictures to words for zero-shot composed image retrieval

K Saito, K Sohn, X Zhang, CL Li… - Proceedings of the …, 2023 - openaccess.thecvf.com

Abstract In Composed Image Retrieval (CIR), a user combines a query image with text to
describe their intended target. Existing methods rely on supervised learning of CIR models …

Uložiť Citovať Citované 110-krát Súvisiace články Všetky verzie 11 HTML verzia

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Zero-shot composed image retrieval with textual inversion

A Baldrati, L Agnolucci, M Bertini… - Proceedings of the …, 2023 - openaccess.thecvf.com

Abstract Composed Image Retrieval (CIR) aims to retrieve a target image based on a query
composed of a reference image and a relative caption that describes the difference between …

Uložiť Citovať Citované 100-krát Súvisiace články Všetky verzie 9 HTML verzia

[Free GPT-4]
[DeepSeek]

[PDF] springer.com

A survey on deep multimodal learning for computer vision: advances, trends, applications, and datasets

K Bayoudh, R Knani, F Hamdaoui, A Mtibaa - The Visual Computer, 2022 - Springer

The research progress in multimodal learning has grown rapidly over the last decade in
several areas, especially in computer vision. The growing potential of multimodal data …

Uložiť Citovať Citované 356-krát Súvisiace články Všetky verzie 8

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Effective conditioned and composed image retrieval combining clip-based features

A Baldrati, M Bertini, T Uricchio… - Proceedings of the …, 2022 - openaccess.thecvf.com

Conditioned and composed image retrieval extend CBIR systems by combining a query
image with an additional text that expresses the intent of the user, describing additional …

Uložiť Citovať Citované 156-krát Súvisiace články Všetky verzie 5 HTML verzia

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Image retrieval on real-life images with pre-trained vision-and-language models

Z Liu, C Rodriguez-Opazo… - Proceedings of the …, 2021 - openaccess.thecvf.com

We extend the task of composed image retrieval, where an input query consists of an image
and short textual description of how to modify the image. Existing methods have only been …

Uložiť Citovať Citované 201-krát Súvisiace články Všetky verzie 8 HTML verzia

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Language-only training of zero-shot composed image retrieval

G Gu, S Chun, W Kim, Y Kang… - Proceedings of the IEEE …, 2024 - openaccess.thecvf.com

Composed image retrieval (CIR) task takes a composed query of image and text aiming to
search relative images for both conditions. Conventional CIR approaches need a training …

Uložiť Citovať Citované 33-krát Súvisiace články Všetky verzie 8 HTML verzia

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Knowledge-enhanced dual-stream zero-shot composed image retrieval

Y Suo, F Ma, L Zhu, Y Yang - Proceedings of the IEEE/CVF …, 2024 - openaccess.thecvf.com

We study the zero-shot Composed Image Retrieval (ZS-CIR) task which is to retrieve the
target image given a reference image and a description without training on the triplet …

Uložiť Citovať Citované 17-krát Súvisiace články Všetky verzie 6 HTML verzia

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Fashionvlp: Vision language transformer for fashion retrieval with feedback

S Goenka, Z Zheng, A Jaiswal… - Proceedings of the …, 2022 - openaccess.thecvf.com

Fashion image retrieval based on a query pair of reference image and natural language
feedback is a challenging task that requires models to assess fashion related information …

Uložiť Citovať Citované 102-krát Súvisiace články Všetky verzie 5 HTML verzia

Vytvoriť upozornenie

Citovať

Rozšírené vyhľadávanie

Uložené do mojej knižnice

Composing text and image for image retrieval-an empirical odyssey

Problems and opportunities in training deep learning software systems: An analysis of variance

Multimodal research in vision and language: A review of current and emerging trends

Pic2word: Map** pictures to words for zero-shot composed image retrieval

Zero-shot composed image retrieval with textual inversion

A survey on deep multimodal learning for computer vision: advances, trends, applications, and datasets

Effective conditioned and composed image retrieval combining clip-based features

Image retrieval on real-life images with pre-trained vision-and-language models

Language-only training of zero-shot composed image retrieval

Knowledge-enhanced dual-stream zero-shot composed image retrieval

Fashionvlp: Vision language transformer for fashion retrieval with feedback