Svdqunat: Absorbing outliers by low-rank components for 4-bit diffusion models

M Li, Y Lin, Z Zhang, T Cai, X Li, J Guo, E **e… - arxiv preprint arxiv …, 2024 - arxiv.org
Diffusion models have been proven highly effective at generating high-quality images.
However, as these models grow larger, they require significantly more memory and suffer …

Collaborative Decoding Makes Visual Auto-Regressive Modeling Efficient

Z Chen, X Ma, G Fang, X Wang - arxiv preprint arxiv:2411.17787, 2024 - arxiv.org
In the rapidly advancing field of image generation, Visual Auto-Regressive (VAR) modeling
has garnered considerable attention for its innovative next-scale prediction approach. This …

Staleness-Centric Optimizations for Efficient Diffusion MoE Inference

J Luo, L Luo, J Xu, J Song, R Lu, C Tang… - arxiv preprint arxiv …, 2024 - arxiv.org
Mixture-of-experts-based (MoE-based) diffusion models have shown their scalability and
ability to generate high-quality images, making them a promising choice for efficient model …

[PDF][PDF] Diffusion Models at Scale: Techniques, Applications, and Challenges

RD Sa'dia Abul-Fazl, H Fairuza - 2025 - preprints.org
Diffusion models have emerged as a powerful class of generative models, offering state-
ofthe-art performance across various domains such as image synthesis, audio generation …

[PDF][PDF] Diffusion Models in Generative AI: Principles, Applications, and Future Directions

K Hadiyya, R Dina, B Mokhtar - 2025 - preprints.org
Diffusion models have emerged as a powerful class of generative models, offering state-
ofthe-art performance across a wide range of applications in artificial intelligence. Rooted in …

Highlight Diffusion: Training-Free Attention Guided Acceleration for Text-to-Image Models

K Nam, Y Kim, J Park - openreview.net
Diffusion models have achieved exceptional results in image synthesis, yet their sequential
processing nature imposes significant computational demands and latency, posing …