A complete survey on generative ai (aigc): Is chatgpt from gpt-4 to gpt-5 all you need?

C Zhang, C Zhang, S Zheng, Y Qiao, C Li… - arxiv preprint arxiv …, 2023 - arxiv.org
As ChatGPT goes viral, generative AI (AIGC, aka AI-generated content) has made headlines
everywhere because of its ability to analyze and create text, images, and beyond. With such …

Robustness-aware 3d object detection in autonomous driving: A review and outlook

Z Song, L Liu, F Jia, Y Luo, C Jia… - IEEE Transactions …, 2024 - ieeexplore.ieee.org
In the realm of modern autonomous driving, the perception system is indispensable for
accurately assessing the state of the surrounding environment, thereby enabling informed …

Dust3r: Geometric 3d vision made easy

S Wang, V Leroy, Y Cabon… - Proceedings of the …, 2024 - openaccess.thecvf.com
Multi-view stereo reconstruction (MVS) in the wild requires to first estimate the camera
intrinsic and extrinsic parameters. These are usually tedious and cumbersome to obtain yet …

Locally attentional sdf diffusion for controllable 3d shape generation

XY Zheng, H Pan, PS Wang, X Tong, Y Liu… - ACM Transactions on …, 2023 - dl.acm.org
Although the recent rapid evolution of 3D generative neural networks greatly improves 3D
shape generation, it is still not convenient for ordinary users to create 3D shapes and control …

A survey of visual transformers

Y Liu, Y Zhang, Y Wang, F Hou, J Yuan… - … on Neural Networks …, 2023 - ieeexplore.ieee.org
Transformer, an attention-based encoder–decoder model, has already revolutionized the
field of natural language processing (NLP). Inspired by such significant achievements, some …

Vision transformer for nerf-based view synthesis from a single input image

KE Lin, YC Lin, WS Lai, TY Lin… - Proceedings of the …, 2023 - openaccess.thecvf.com
Although neural radiance fields (NeRF) have shown impressive advances in novel view
synthesis, most methods require multiple input images of the same scene with accurate …

Multiview compressive coding for 3D reconstruction

CY Wu, J Johnson, J Malik… - Proceedings of the …, 2023 - openaccess.thecvf.com
A central goal of visual recognition is to understand objects and scenes from a single image.
2D recognition has witnessed tremendous progress thanks to large-scale learning and …

Multi-view aggregation network for dichotomous image segmentation

Q Yu, X Zhao, Y Pang, L Zhang… - Proceedings of the IEEE …, 2024 - openaccess.thecvf.com
Abstract Dichotomous Image Segmentation (DIS) has recently emerged towards high-
precision object segmentation from high-resolution natural images. When designing an …

Sketch and text guided diffusion model for colored point cloud generation

Z Wu, Y Wang, M Feng, H **e… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
Diffusion probabilistic models have achieved remarkable success in text guided image
generation. However, generating 3D shapes is still challenging due to the lack of sufficient …

Deep learning-based 3D reconstruction: a survey

T Samavati, M Soryani - Artificial Intelligence Review, 2023 - Springer
Image-based 3D reconstruction is a long-established, ill-posed problem defined within the
scope of computer vision and graphics. The purpose of image-based 3D reconstruction is to …