State of the art on diffusion models for visual computing

R Po, W Yifan, V Golyanik, K Aberman… - Computer Graphics …, 2024‏ - Wiley Online Library
The field of visual computing is rapidly advancing due to the emergence of generative
artificial intelligence (AI), which unlocks unprecedented capabilities for the generation …

Recent advances in 3d gaussian splatting

T Wu, YJ Yuan, LX Zhang, J Yang, YP Cao… - Computational Visual …, 2024‏ - Springer
The emergence of 3D Gaussian splatting (3DGS) has greatly accelerated rendering in novel
view synthesis. Unlike neural implicit representations like neural radiance fields (NeRFs) …

Identifying and mitigating vulnerabilities in llm-integrated applications

F Jiang - 2024‏ - search.proquest.com
Large language models (LLMs) are increasingly deployed as the backend for various
applications, including code completion tools and AI-powered search engines. Unlike …

Stable video diffusion: Scaling latent video diffusion models to large datasets

A Blattmann, T Dockhorn, S Kulal… - arxiv preprint arxiv …, 2023‏ - arxiv.org
We present Stable Video Diffusion-a latent video diffusion model for high-resolution, state-of-
the-art text-to-video and image-to-video generation. Recently, latent diffusion models trained …

Wonder3d: Single image to 3d using cross-domain diffusion

X Long, YC Guo, C Lin, Y Liu, Z Dou… - Proceedings of the …, 2024‏ - openaccess.thecvf.com
In this work we introduce Wonder3D a novel method for generating high-fidelity textured
meshes from single-view images with remarkable efficiency. Recent methods based on the …

Lgm: Large multi-view gaussian model for high-resolution 3d content creation

J Tang, Z Chen, X Chen, T Wang, G Zeng… - European Conference on …, 2024‏ - Springer
Abstract 3D content creation has achieved significant progress in terms of both quality and
speed. Although current feed-forward models can produce 3D objects in seconds, their …

Text-to-3d using gaussian splatting

Z Chen, F Wang, Y Wang, H Liu - Proceedings of the IEEE …, 2024‏ - openaccess.thecvf.com
Automatic text-to-3D generation that combines Score Distillation Sampling (SDS) with the
optimization of volume rendering has achieved remarkable progress in synthesizing realistic …

One-2-3-45++: Fast single image to 3d objects with consistent multi-view generation and 3d diffusion

M Liu, R Shi, L Chen, Z Zhang, C Xu… - Proceedings of the …, 2024‏ - openaccess.thecvf.com
Recent advancements in open-world 3D object generation have been remarkable with
image-to-3D methods offering superior fine-grained control over their text-to-3D …

Splatter image: Ultra-fast single-view 3d reconstruction

S Szymanowicz, C Rupprecht… - Proceedings of the …, 2024‏ - openaccess.thecvf.com
Abstract We introduce the Splatter Image an ultra-efficient approach for monocular 3D object
reconstruction. Splatter Image is based on Gaussian Splatting which allows fast and high …

Sv3d: Novel multi-view synthesis and 3d generation from a single image using latent video diffusion

V Voleti, CH Yao, M Boss, A Letts, D Pankratz… - … on Computer Vision, 2024‏ - Springer
Abstract We present Stable Video 3D (SV3D)—a latent video diffusion model for high-
resolution, image-to-multi-view generation of orbital videos around a 3D object. Recent …