Text-to-Image Diffusion Models are Great Sketch-Photo Matchmakers

S Koley, AK Bhunia, A Sain… - Proceedings of the …, 2024 - openaccess.thecvf.com
This paper for the first time explores text-to-image diffusion models for Zero-Shot Sketch-
based Image Retrieval (ZS-SBIR). We highlight a pivotal discovery: the capacity of text-to …

You'll Never Walk Alone: A Sketch and Text Duet for Fine-Grained Image Retrieval

S Koley, AK Bhunia, A Sain… - Proceedings of the …, 2024 - openaccess.thecvf.com
Two primary input modalities prevail in image retrieval: sketch and text. While text is widely
used for inter-category retrieval tasks sketches have been established as the sole preferred …

It's All About Your Sketch: Democratising Sketch Control in Diffusion Models

S Koley, AK Bhunia, D Sekhri, A Sain… - Proceedings of the …, 2024 - openaccess.thecvf.com
This paper unravels the potential of sketches for diffusion models addressing the deceptive
promise of direct sketch control in generative AI. We importantly democratise the process …

Doodle your 3d: From abstract freehand sketches to precise 3d shapes

H Bandyopadhyay, S Koley, A Das… - Proceedings of the …, 2024 - openaccess.thecvf.com
In this paper we democratise 3D content creation enabling precise generation of 3D shapes
from abstract sketches while overcoming limitations tied to drawing skills. We introduce a …

Freestyleret: Retrieving images from style-diversified queries

H Li, Y Jia, P **, Z Cheng, K Li, J Sui, C Liu… - European Conference on …, 2024 - Springer
Image Retrieval aims to retrieve corresponding images based on a given query. In
application scenarios, users intend to express their retrieval intent through various query …

3d vr sketch guided 3d shape prototy** and exploration

L Luo, PN Chowdhury, T **ang… - Proceedings of the …, 2023 - openaccess.thecvf.com
Abstract 3D shape modeling is labor-intensive, time-consuming, and requires years of
expertise. To facilitate 3D shape modeling, we propose a 3D shape generation network that …

SceneTrilogy: On Human Scene-Sketch and its Complementarity with Photo and Text

PN Chowdhury, AK Bhunia, A Sain… - Proceedings of the …, 2023 - openaccess.thecvf.com
In this paper, we extend scene understanding to include that of human sketch. The result is a
complete trilogy of scene representation from three diverse and complementary modalities …

FArMARe: a Furniture-Aware Multi-task methodology for Recommending Apartments based on the user interests

A Abdari, A Falcon, G Serra - Proceedings of the IEEE/CVF …, 2023 - openaccess.thecvf.com
Nowadays, many people frequently have to search for new accommodation options.
Searching for a suitable apartment is a time-consuming process, especially because visiting …

TypeDance: Creating semantic typographic logos from image through personalized generation

S **ao, L Wang, X Ma, W Zeng - Proceedings of the CHI Conference on …, 2024 - dl.acm.org
Semantic typographic logos harmoniously blend typeface and imagery to represent
semantic concepts while maintaining legibility. Conventional methods using spatial …

Universal vision-language dense retrieval: Learning a unified representation space for multi-modal retrieval

Z Liu, C **ong, Y Lv, Z Liu, G Yu - arxiv preprint arxiv:2209.00179, 2022 - arxiv.org
This paper presents Universal Vision-Language Dense Retrieval (UniVL-DR), which builds
a unified model for multi-modal retrieval. UniVL-DR encodes queries and multi-modality …