A complete survey on generative ai (aigc): Is chatgpt from gpt-4 to gpt-5 all you need?

C Zhang, C Zhang, S Zheng, Y Qiao, C Li… - arxiv preprint arxiv …, 2023 - arxiv.org
As ChatGPT goes viral, generative AI (AIGC, aka AI-generated content) has made headlines
everywhere because of its ability to analyze and create text, images, and beyond. With such …

[HTML][HTML] Data augmentation: A comprehensive survey of modern approaches

A Mumuni, F Mumuni - Array, 2022 - Elsevier
To ensure good performance, modern machine learning models typically require large
amounts of quality annotated data. Meanwhile, the data collection and annotation processes …

Lgm: Large multi-view gaussian model for high-resolution 3d content creation

J Tang, Z Chen, X Chen, T Wang, G Zeng… - European Conference on …, 2024 - Springer
Abstract 3D content creation has achieved significant progress in terms of both quality and
speed. Although current feed-forward models can produce 3D objects in seconds, their …

K-planes: Explicit radiance fields in space, time, and appearance

S Fridovich-Keil, G Meanti… - Proceedings of the …, 2023 - openaccess.thecvf.com
We introduce k-planes, a white-box model for radiance fields in arbitrary dimensions. Our
model uses d-choose-2 planes to represent a d-dimensional scene, providing a seamless …

Realfusion: 360deg reconstruction of any object from a single image

L Melas-Kyriazi, I Laina… - Proceedings of the …, 2023 - openaccess.thecvf.com
We consider the problem of reconstructing a full 360deg photographic model of an object
from a single image of it. We do so by fitting a neural radiance field to the image, but find this …

Hexplane: A fast representation for dynamic scenes

A Cao, J Johnson - … of the IEEE/CVF Conference on …, 2023 - openaccess.thecvf.com
Modeling and re-rendering dynamic 3D scenes is a challenging task in 3D vision. Prior
approaches build on NeRF and rely on implicit representations. This is slow since it requires …

Score jacobian chaining: Lifting pretrained 2d diffusion models for 3d generation

H Wang, X Du, J Li, RA Yeh… - Proceedings of the …, 2023 - openaccess.thecvf.com
A diffusion model learns to predict a vector field of gradients. We propose to apply chain rule
on the learned gradients, and back-propagate the score of a diffusion model through the …

Magic3d: High-resolution text-to-3d content creation

CH Lin, J Gao, L Tang, T Takikawa… - Proceedings of the …, 2023 - openaccess.thecvf.com
Recently, DreamFusion demonstrated the utility of a pretrained text-to-image diffusion model
to optimize Neural Radiance Fields (NeRF), achieving remarkable text-to-3D synthesis …

Mvdream: Multi-view diffusion for 3d generation

Y Shi, P Wang, J Ye, M Long, K Li, X Yang - arxiv preprint arxiv …, 2023 - arxiv.org
We propose MVDream, a multi-view diffusion model that is able to generate geometrically
consistent multi-view images from a given text prompt. By leveraging image diffusion models …

Dreambooth3d: Subject-driven text-to-3d generation

A Raj, S Kaza, B Poole, M Niemeyer… - Proceedings of the …, 2023 - openaccess.thecvf.com
We present DreamBooth3D, an approach to personalize text-to-3D generative models from
as few as 3-6 casually captured images of a subject. Our approach combines recent …