Google Učenjak

T Wu, YJ Yuan, LX Zhang, J Yang, YP Cao… - Computational Visual …, 2024 - Springer

The emergence of 3D Gaussian splatting (3DGS) has greatly accelerated rendering in novel
view synthesis. Unlike neural implicit representations like neural radiance fields (NeRFs) …

Shrani Navedi Navedeno v 69 virih Sorodni članki Vse različice: 8

[免费ChatGPT] [DeepSeek可用网址] [PDF] arxiv.org

Adversarial diffusion distillation

A Sauer, D Lorenz, A Blattmann… - European Conference on …, 2024 - Springer

Abstract We introduce Adversarial Diffusion Distillation (ADD), a novel training approach that
efficiently samples large-scale foundational image diffusion models in just 1–4 steps while …

Shrani Navedi Navedeno v 312 virih Sorodni članki Vse različice: 6

[免费ChatGPT] [DeepSeek可用网址] [PDF] neurips.cc

Visual autoregressive modeling: Scalable image generation via next-scale prediction

K Tian, Y Jiang, Z Yuan, B Peng… - Advances in neural …, 2025 - proceedings.neurips.cc

Abstract We present Visual AutoRegressive modeling (VAR), a new generation paradigm
that redefines the autoregressive learning on images as coarse-to-fine" next-scale …

Shrani Navedi Navedeno v 168 virih Sorodni članki Vse različice: 5 V obliki HTML

[免费ChatGPT] [DeepSeek可用网址] [PDF] arxiv.org

Lavie: High-quality video generation with cascaded latent diffusion models

Y Wang, X Chen, X Ma, S Zhou, Z Huang… - International Journal of …, 2024 - Springer

This work aims to learn a high-quality text-to-video (T2V) generative model by leveraging a
pre-trained text-to-image (T2I) model as a basis. It is a highly desirable yet challenging task …

Shrani Navedi Navedeno v 232 virih Sorodni članki Vse različice: 4

[免费ChatGPT] [DeepSeek可用网址] [PDF] thecvf.com

Diffusion model alignment using direct preference optimization

B Wallace, M Dang, R Rafailov… - Proceedings of the …, 2024 - openaccess.thecvf.com

Large language models (LLMs) are fine-tuned using human comparison data with
Reinforcement Learning from Human Feedback (RLHF) methods to make them better …

Shrani Navedi Navedeno v 147 virih Sorodni članki Vse različice: 6 V obliki HTML

[免费ChatGPT] [DeepSeek可用网址] [PDF] thecvf.com

Align your gaussians: Text-to-4d with dynamic 3d gaussians and composed diffusion models

H Ling, SW Kim, A Torralba… - Proceedings of the …, 2024 - openaccess.thecvf.com

Text-guided diffusion models have revolutionized image and video generation and have
also been successfully used for optimization-based 3D object synthesis. Here we instead …

Shrani Navedi Navedeno v 95 virih Sorodni članki Vse različice: 6 V obliki HTML

[免费ChatGPT] [DeepSeek可用网址] [PDF] thecvf.com

Distrifusion: Distributed parallel inference for high-resolution diffusion models

M Li, T Cai, J Cao, Q Zhang, H Cai… - Proceedings of the …, 2024 - openaccess.thecvf.com

Diffusion models have achieved great success in synthesizing high-quality images.
However generating high-resolution images with diffusion models is still challenging due to …

Shrani Navedi Navedeno v 47 virih Sorodni članki Vse različice: 8 V obliki HTML

[免费ChatGPT] [DeepSeek可用网址] [PDF] thecvf.com

Emu edit: Precise image editing via recognition and generation tasks

S Sheynin, A Polyak, U Singer… - Proceedings of the …, 2024 - openaccess.thecvf.com

Instruction-based image editing holds immense potential for a variety of applications as it
enables users to perform any editing operation using a natural language instruction …

Shrani Navedi Navedeno v 92 virih Sorodni članki Vse različice: 7 V obliki HTML

[免费ChatGPT] [DeepSeek可用网址] [PDF] arxiv.org

Fast high-resolution image synthesis with latent adversarial diffusion distillation

A Sauer, F Boesel, T Dockhorn, A Blattmann… - SIGGRAPH Asia 2024 …, 2024 - dl.acm.org

Diffusion models are the main driver of progress in image and video synthesis, but suffer
from slow inference speed. Distillation methods, like the recently introduced adversarial …

Shrani Navedi Navedeno v 85 virih Sorodni članki Vse različice: 3

[免费ChatGPT] [DeepSeek可用网址] [PDF] neurips.cc

Vista: A generalizable driving world model with high fidelity and versatile controllability

S Gao, J Yang, L Chen, K Chitta… - Advances in …, 2025 - proceedings.neurips.cc

World models can foresee the outcomes of different actions, which is of paramount
importance for autonomous driving. Nevertheless, existing driving world models still have …

Shrani Navedi Navedeno v 44 virih Sorodni članki Vse različice: 6 V obliki HTML

Ustvari opozorilo

Navedi

Napredno iskanje

Shranjeno v Mojo knjižnico

Emu: Enhancing image generation models using photogenic needles in a haystack

Recent advances in 3d gaussian splatting

Adversarial diffusion distillation

Visual autoregressive modeling: Scalable image generation via next-scale prediction

Lavie: High-quality video generation with cascaded latent diffusion models

Diffusion model alignment using direct preference optimization

Align your gaussians: Text-to-4d with dynamic 3d gaussians and composed diffusion models

Distrifusion: Distributed parallel inference for high-resolution diffusion models

Emu edit: Precise image editing via recognition and generation tasks

Fast high-resolution image synthesis with latent adversarial diffusion distillation

Vista: A generalizable driving world model with high fidelity and versatile controllability