Freeinit: Bridging initialization gap in video diffusion models
Though diffusion-based video generation has witnessed rapid progress, the inference
results of existing models still exhibit unsatisfactory temporal consistency and unnatural …
results of existing models still exhibit unsatisfactory temporal consistency and unnatural …
Stablenormal: Reducing diffusion variance for stable and sharp normal
This work addresses the challenge of high-quality surface normal estimation from monocular
colored inputs (ie, images and videos), a field which has recently been revolutionized by …
colored inputs (ie, images and videos), a field which has recently been revolutionized by …
Dream: Diffusion rectification and estimation-adaptive models
We present DREAM a novel training framework representing Diffusion Rectification and
Estimation-Adaptive Models requiring minimal code changes (just three lines) yet …
Estimation-Adaptive Models requiring minimal code changes (just three lines) yet …
Generative diffusion for regional surrogate models from sea‐ice simulations
We introduce deep generative diffusion for multivariate and regional surrogate modeling
learned from sea‐ice simulations. Given initial conditions and atmospheric forcings, the …
learned from sea‐ice simulations. Given initial conditions and atmospheric forcings, the …
Flame diffuser: Wildfire image synthesis using mask guided diffusion
Wildfires are a significant threat to ecosystems and human infrastructure, leading to
widespread destruction and environmental degradation. Recent advancements in deep …
widespread destruction and environmental degradation. Recent advancements in deep …
FLAME Diffuser: Wildfire Image Synthesis using Mask Guided Diffusion
H Wang, SPH Boroujeni, X Chen, A Bastola… - ar** Stone for Text-to-Video Generation
Text-to-video generation has lagged behind text-to-image synthesis in quality and diversity
due to the complexity of spatio-temporal modeling and limited video-text datasets. This …
due to the complexity of spatio-temporal modeling and limited video-text datasets. This …