A complete survey on generative ai (aigc): Is chatgpt from gpt-4 to gpt-5 all you need?
As ChatGPT goes viral, generative AI (AIGC, aka AI-generated content) has made headlines
everywhere because of its ability to analyze and create text, images, and beyond. With such …
everywhere because of its ability to analyze and create text, images, and beyond. With such …
[HTML][HTML] Data augmentation: A comprehensive survey of modern approaches
A Mumuni, F Mumuni - Array, 2022 - Elsevier
To ensure good performance, modern machine learning models typically require large
amounts of quality annotated data. Meanwhile, the data collection and annotation processes …
amounts of quality annotated data. Meanwhile, the data collection and annotation processes …
Lgm: Large multi-view gaussian model for high-resolution 3d content creation
Abstract 3D content creation has achieved significant progress in terms of both quality and
speed. Although current feed-forward models can produce 3D objects in seconds, their …
speed. Although current feed-forward models can produce 3D objects in seconds, their …
K-planes: Explicit radiance fields in space, time, and appearance
We introduce k-planes, a white-box model for radiance fields in arbitrary dimensions. Our
model uses d-choose-2 planes to represent a d-dimensional scene, providing a seamless …
model uses d-choose-2 planes to represent a d-dimensional scene, providing a seamless …
Realfusion: 360deg reconstruction of any object from a single image
We consider the problem of reconstructing a full 360deg photographic model of an object
from a single image of it. We do so by fitting a neural radiance field to the image, but find this …
from a single image of it. We do so by fitting a neural radiance field to the image, but find this …
Hexplane: A fast representation for dynamic scenes
Modeling and re-rendering dynamic 3D scenes is a challenging task in 3D vision. Prior
approaches build on NeRF and rely on implicit representations. This is slow since it requires …
approaches build on NeRF and rely on implicit representations. This is slow since it requires …
Score jacobian chaining: Lifting pretrained 2d diffusion models for 3d generation
A diffusion model learns to predict a vector field of gradients. We propose to apply chain rule
on the learned gradients, and back-propagate the score of a diffusion model through the …
on the learned gradients, and back-propagate the score of a diffusion model through the …
Magic3d: High-resolution text-to-3d content creation
Recently, DreamFusion demonstrated the utility of a pretrained text-to-image diffusion model
to optimize Neural Radiance Fields (NeRF), achieving remarkable text-to-3D synthesis …
to optimize Neural Radiance Fields (NeRF), achieving remarkable text-to-3D synthesis …
Mvdream: Multi-view diffusion for 3d generation
We propose MVDream, a multi-view diffusion model that is able to generate geometrically
consistent multi-view images from a given text prompt. By leveraging image diffusion models …
consistent multi-view images from a given text prompt. By leveraging image diffusion models …
Dreambooth3d: Subject-driven text-to-3d generation
We present DreamBooth3D, an approach to personalize text-to-3D generative models from
as few as 3-6 casually captured images of a subject. Our approach combines recent …
as few as 3-6 casually captured images of a subject. Our approach combines recent …