Encoder-based domain tuning for fast personalization of text-to-image models
Text-to-image personalization aims to teach a pre-trained diffusion model to reason about
novel, user provided concepts, embedding them into new scenes guided by natural …
novel, user provided concepts, embedding them into new scenes guided by natural …
Sparsectrl: Adding sparse controls to text-to-video diffusion models
The development of text-to-video (T2V), ie, generating videos with a given text prompt, has
been significantly advanced in recent years. However, relying solely on text prompts often …
been significantly advanced in recent years. However, relying solely on text prompts often …
Diffsketcher: Text guided vector sketch synthesis through latent diffusion models
Even though trained mainly on images, we discover that pretrained diffusion models show
impressive power in guiding sketch synthesis. In this paper, we present DiffSketcher, an …
impressive power in guiding sketch synthesis. In this paper, we present DiffSketcher, an …
Word-as-image for semantic typography
A word-as-image is a semantic typography technique where a word illustration presents a
visualization of the meaning of the word, while also preserving its readability. We present a …
visualization of the meaning of the word, while also preserving its readability. We present a …
Concept decomposition for visual exploration and inspiration
A creative idea is often born from transforming, combining, and modifying ideas from existing
visual examples capturing various concepts. However, one cannot simply copy the concept …
visual examples capturing various concepts. However, one cannot simply copy the concept …
SVGDreamer: Text guided SVG generation with diffusion model
Recently text-guided scalable vector graphics (SVGs) synthesis has shown promise in
domains such as iconography and sketch. However existing text-to-SVG generation …
domains such as iconography and sketch. However existing text-to-SVG generation …
Sketchinr: A first look into sketches as implicit neural representations
We propose SketchINR to advance the representation of vector sketches with implicit neural
models. A variable length vector sketch is compressed into a latent space of fixed dimension …
models. A variable length vector sketch is compressed into a latent space of fixed dimension …
Convnet vs transformer, supervised vs clip: Beyond imagenet accuracy
Modern computer vision offers a great variety of models to practitioners, and selecting a
model from multiple options for specific applications can be challenging. Conventionally …
model from multiple options for specific applications can be challenging. Conventionally …
Emergent communication in interactive sketch question answering
Vision-based emergent communication (EC) aims to learn to communicate through sketches
and demystify the evolution of human communication. Ironically, previous works neglect …
and demystify the evolution of human communication. Ironically, previous works neglect …
SketchAgent: Language-Driven Sequential Sketch Generation
Sketching serves as a versatile tool for externalizing ideas, enabling rapid exploration and
visual communication that spans various disciplines. While artificial systems have driven …
visual communication that spans various disciplines. While artificial systems have driven …