[HTML][HTML] Generative AI for visualization: State of the art and future directions

Y Ye, J Hao, Y Hou, Z Wang, S **ao, Y Luo, W Zeng - Visual Informatics, 2024 - Elsevier
Generative AI (GenAI) has witnessed remarkable progress in recent years and
demonstrated impressive performance in various generation tasks in different domains such …

Textdiffuser-2: Unleashing the power of language models for text rendering

J Chen, Y Huang, T Lv, L Cui, Q Chen, F Wei - European Conference on …, 2024 - Springer
The diffusion model has been proven a powerful generative model in recent years, yet it
remains a challenge in generating visual text. Although existing work has endeavored to …

Svgdreamer: Text guided svg generation with diffusion model

X **ng, H Zhou, C Wang, J Zhang… - Proceedings of the …, 2024 - openaccess.thecvf.com
Recently text-guided scalable vector graphics (SVGs) synthesis has shown promise in
domains such as iconography and sketch. However existing text-to-SVG generation …

Kosmos-2.5: A multimodal literate model

T Lv, Y Huang, J Chen, Y Zhao, Y Jia, L Cui… - arxiv preprint arxiv …, 2023 - arxiv.org
The automatic reading of text-intensive images represents a significant advancement toward
achieving Artificial General Intelligence (AGI). In this paper we present KOSMOS-2.5, a …

Genartist: Multimodal llm as an agent for unified image generation and editing

Z Wang, A Li, Z Li, X Liu - Advances in Neural Information …, 2025 - proceedings.neurips.cc
Despite the success achieved by existing image generation and editing methods, current
models still struggle with complex problems including intricate text prompts, and the …

FineMatch: Aspect-Based Fine-Grained Image and Text Mismatch Detection and Correction

H Hua, J Shi, K Kafle, S Jenni, D Zhang… - … on Computer Vision, 2024 - Springer
Recent progress in large-scale pre-training has led to the development of advanced vision-
language models (VLMs) with remarkable proficiency in comprehending and generating …

[PDF][PDF] Intelligent Artistic Typography: A Comprehensive Review of Artistic Text Design and Generation

Y Bai, Z Huang, W Gao, S Yang… - APSIPA Transactions on …, 2024 - nowpublishers.com
Artistic text generation aims to amplify the aesthetic qualities of text while maintaining
readability. It can make the text more attractive and better convey its expression, thus …

Autodir: Automatic all-in-one image restoration with latent diffusion

Y Jiang, Z Zhang, T Xue, J Gu - European Conference on Computer Vision, 2024 - Springer
We present AutoDIR, an innovative all-in-one image restoration system incorporating latent
diffusion. AutoDIR excels in its ability to automatically identify and restore images suffering …

Choose what you need: Disentangled representation learning for scene text recognition removal and editing

B Zhang, H **e, Z Gao, Y Wang - Proceedings of the IEEE …, 2024 - openaccess.thecvf.com
Scene text images contain not only style information (font background) but also content
information (character texture). Different scene text tasks need different information but …

Glyph-byt5: A customized text encoder for accurate visual text rendering

Z Liu, W Liang, Z Liang, C Luo, J Li, G Huang… - … on Computer Vision, 2024 - Springer
Visual text rendering poses a fundamental challenge for contemporary text-to-image
generation models, with the core problem lying in text encoder deficiencies. To achieve …