[HTML][HTML] Data augmentation: A comprehensive survey of modern approaches

A Mumuni, F Mumuni - Array, 2022 - Elsevier
To ensure good performance, modern machine learning models typically require large
amounts of quality annotated data. Meanwhile, the data collection and annotation processes …

Zero-1-to-3: Zero-shot one image to 3d object

R Liu, R Wu, B Van Hoorick… - Proceedings of the …, 2023 - openaccess.thecvf.com
Abstract We introduce Zero-1-to-3, a framework for changing the camera viewpoint of an
object given just a single RGB image. To perform novel view synthesis in this …

Mvimgnet: A large-scale dataset of multi-view images

X Yu, M Xu, Y Zhang, H Liu, C Ye… - Proceedings of the …, 2023 - openaccess.thecvf.com
Being data-driven is one of the most iconic properties of deep learning algorithms. The birth
of ImageNet drives a remarkable trend of" learning from large-scale data" in computer vision …

Variable bitrate neural fields

T Takikawa, A Evans, J Tremblay, T Müller… - ACM SIGGRAPH 2022 …, 2022 - dl.acm.org
Neural approximations of scalar-and vector fields, such as signed distance functions and
radiance fields, have emerged as accurate, high-quality representations. State-of-the-art …

Removing objects from neural radiance fields

S Weder, G Garcia-Hernando… - Proceedings of the …, 2023 - openaccess.thecvf.com
Abstract Neural Radiance Fields (NeRFs) are emerging as a ubiquitous scene
representation that allows for novel view synthesis. Increasingly, NeRFs will be shareable …

Neo 360: Neural fields for sparse view synthesis of outdoor scenes

MZ Irshad, S Zakharov, K Liu… - Proceedings of the …, 2023 - openaccess.thecvf.com
Recent implicit neural representations have shown great results for novel view synthesis.
However, existing methods require expensive per-scene optimization from many views …

Eschernet: A generative model for scalable view synthesis

X Kong, S Liu, X Lyu, M Taher, X Qi… - Proceedings of the …, 2024 - openaccess.thecvf.com
We introduce EscherNet a multi-view conditioned diffusion model for view synthesis.
EscherNet learns implicit and generative 3D representations coupled with a specialised …

Shacira: Scalable hash-grid compression for implicit neural representations

S Girish, A Shrivastava, K Gupta - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
Abstract Implicit Neural Representations (INR) or neural fields have emerged as a popular
framework to encode multimedia signals such as images and radiance fields while retaining …

A survey of synthetic data augmentation methods in machine vision

A Mumuni, F Mumuni, NK Gerrar - Machine Intelligence Research, 2024 - Springer
The standard approach to tackling computer vision problems is to train deep convolutional
neural network (CNN) models using large-scale image datasets that are representative of …

Consistent123: Improve consistency for one image to 3d object synthesis

H Weng, T Yang, J Wang, Y Li, T Zhang… - arxiv preprint arxiv …, 2023 - arxiv.org
Large image diffusion models enable novel view synthesis with high quality and excellent
zero-shot capability. However, such models based on image-to-image translation have no …