Encoder-based domain tuning for fast personalization of text-to-image models

R Gal, M Arar, Y Atzmon, AH Bermano… - ACM Transactions on …, 2023 - dl.acm.org
Text-to-image personalization aims to teach a pre-trained diffusion model to reason about
novel, user provided concepts, embedding them into new scenes guided by natural …

Sparsectrl: Adding sparse controls to text-to-video diffusion models

Y Guo, C Yang, A Rao, M Agrawala, D Lin… - European Conference on …, 2024 - Springer
The development of text-to-video (T2V), ie, generating videos with a given text prompt, has
been significantly advanced in recent years. However, relying solely on text prompts often …

Diffsketcher: Text guided vector sketch synthesis through latent diffusion models

X **ng, C Wang, H Zhou, J Zhang… - Advances in Neural …, 2023 - proceedings.neurips.cc
Even though trained mainly on images, we discover that pretrained diffusion models show
impressive power in guiding sketch synthesis. In this paper, we present DiffSketcher, an …

Word-as-image for semantic typography

S Iluz, Y Vinker, A Hertz, D Berio, D Cohen-Or… - ACM Transactions on …, 2023 - dl.acm.org
A word-as-image is a semantic typography technique where a word illustration presents a
visualization of the meaning of the word, while also preserving its readability. We present a …

Concept decomposition for visual exploration and inspiration

Y Vinker, A Voynov, D Cohen-Or, A Shamir - ACM Transactions on …, 2023 - dl.acm.org
A creative idea is often born from transforming, combining, and modifying ideas from existing
visual examples capturing various concepts. However, one cannot simply copy the concept …

SVGDreamer: Text guided SVG generation with diffusion model

X **ng, H Zhou, C Wang, J Zhang… - Proceedings of the …, 2024 - openaccess.thecvf.com
Recently text-guided scalable vector graphics (SVGs) synthesis has shown promise in
domains such as iconography and sketch. However existing text-to-SVG generation …

Sketchinr: A first look into sketches as implicit neural representations

H Bandyopadhyay, AK Bhunia… - Proceedings of the …, 2024 - openaccess.thecvf.com
We propose SketchINR to advance the representation of vector sketches with implicit neural
models. A variable length vector sketch is compressed into a latent space of fixed dimension …

Convnet vs transformer, supervised vs clip: Beyond imagenet accuracy

K Vishniakov, Z Shen, Z Liu - arxiv preprint arxiv:2311.09215, 2023 - arxiv.org
Modern computer vision offers a great variety of models to practitioners, and selecting a
model from multiple options for specific applications can be challenging. Conventionally …

Emergent communication in interactive sketch question answering

Z Lei, Y Zhang, Y **ong, S Chen - Advances in Neural …, 2024 - proceedings.neurips.cc
Vision-based emergent communication (EC) aims to learn to communicate through sketches
and demystify the evolution of human communication. Ironically, previous works neglect …

SketchAgent: Language-Driven Sequential Sketch Generation

Y Vinker, TR Shaham, K Zheng, A Zhao, JE Fan… - arxiv preprint arxiv …, 2024 - arxiv.org
Sketching serves as a versatile tool for externalizing ideas, enabling rapid exploration and
visual communication that spans various disciplines. While artificial systems have driven …