Freeinit: Bridging initialization gap in video diffusion models

T Wu, C Si, Y Jiang, Z Huang, Z Liu - European Conference on Computer …, 2024 - Springer
Though diffusion-based video generation has witnessed rapid progress, the inference
results of existing models still exhibit unsatisfactory temporal consistency and unnatural …

Stablenormal: Reducing diffusion variance for stable and sharp normal

C Ye, L Qiu, X Gu, Q Zuo, Y Wu, Z Dong, L Bo… - ACM Transactions on …, 2024 - dl.acm.org
This work addresses the challenge of high-quality surface normal estimation from monocular
colored inputs (ie, images and videos), a field which has recently been revolutionized by …

Dream: Diffusion rectification and estimation-adaptive models

J Zhou, T Ding, T Chen, J Jiang… - Proceedings of the …, 2024 - openaccess.thecvf.com
We present DREAM a novel training framework representing Diffusion Rectification and
Estimation-Adaptive Models requiring minimal code changes (just three lines) yet …

Generative diffusion for regional surrogate models from sea‐ice simulations

TS Finn, C Durand, A Farchi, M Bocquet… - Journal of Advances …, 2024 - Wiley Online Library
We introduce deep generative diffusion for multivariate and regional surrogate modeling
learned from sea‐ice simulations. Given initial conditions and atmospheric forcings, the …

Flame diffuser: Wildfire image synthesis using mask guided diffusion

H Wang, SPH Boroujeni, X Chen… - … Conference on Big …, 2024 - ieeexplore.ieee.org
Wildfires are a significant threat to ecosystems and human infrastructure, leading to
widespread destruction and environmental degradation. Recent advancements in deep …

FLAME Diffuser: Wildfire Image Synthesis using Mask Guided Diffusion

H Wang, SPH Boroujeni, X Chen, A Bastola… - ar** Stone for Text-to-Video Generation
X Guo, J Liu, M Cui, D Huang - arxiv preprint arxiv:2406.02230, 2024 - arxiv.org
Text-to-video generation has lagged behind text-to-image synthesis in quality and diversity
due to the complexity of spatio-temporal modeling and limited video-text datasets. This …