Video super-resolution based on deep learning: a comprehensive survey

H Liu, Z Ruan, P Zhao, C Dong, F Shang, Y Liu… - Artificial Intelligence …, 2022 - Springer
Video super-resolution (VSR) is reconstructing high-resolution videos from low resolution
ones. Recently, the VSR methods based on deep neural networks have made great …

Guided depth map super-resolution: A survey

Z Zhong, X Liu, J Jiang, D Zhao, X Ji - ACM Computing Surveys, 2023 - dl.acm.org
Guided depth map super-resolution (GDSR), which aims to reconstruct a high-resolution
depth map from a low-resolution observation with the help of a paired high-resolution color …

Tri-perspective view for vision-based 3d semantic occupancy prediction

Y Huang, W Zheng, Y Zhang… - Proceedings of the …, 2023 - openaccess.thecvf.com
Modern methods for vision-centric autonomous driving perception widely adopt the bird's-
eye-view (BEV) representation to describe a 3D scene. Despite its better efficiency than …

Implicit diffusion models for continuous super-resolution

S Gao, X Liu, B Zeng, S Xu, Y Li… - Proceedings of the …, 2023 - openaccess.thecvf.com
Image super-resolution (SR) has attracted increasing attention due to its wide applications.
However, current SR methods generally suffer from over-smoothing and artifacts, and most …

3d neural field generation using triplane diffusion

JR Shue, ER Chan, R Po, Z Ankner… - Proceedings of the …, 2023 - openaccess.thecvf.com
Diffusion models have emerged as the state-of-the-art for image generation, among other
tasks. Here, we present an efficient diffusion-based model for 3D-aware generation of neural …

Petr: Position embedding transformation for multi-view 3d object detection

Y Liu, T Wang, X Zhang, J Sun - European Conference on Computer …, 2022 - Springer
In this paper, we develop position embedding transformation (PETR) for multi-view 3D
object detection. PETR encodes the position information of 3D coordinates into image …

Codef: Content deformation fields for temporally consistent video processing

H Ouyang, Q Wang, Y **ao, Q Bai… - Proceedings of the …, 2024 - openaccess.thecvf.com
We present the content deformation field (CoDeF) as a new type of video representation
which consists of a canonical content field aggregating the static contents in the entire video …

Efficient geometry-aware 3d generative adversarial networks

ER Chan, CZ Lin, MA Chan… - Proceedings of the …, 2022 - openaccess.thecvf.com
Unsupervised generation of high-quality multi-view-consistent images and 3D shapes using
only collections of single-view 2D photographs has been a long-standing challenge …

Alias-free generative adversarial networks

T Karras, M Aittala, S Laine… - Advances in neural …, 2021 - proceedings.neurips.cc
We observe that despite their hierarchical convolutional nature, the synthesis process of
typical generative adversarial networks depends on absolute pixel coordinates in an …

Scale-mae: A scale-aware masked autoencoder for multiscale geospatial representation learning

CJ Reed, R Gupta, S Li, S Brockman… - Proceedings of the …, 2023 - openaccess.thecvf.com
Large, pretrained models are commonly finetuned with imagery that is heavily augmented to
mimic different conditions and scales, with the resulting models used for various tasks with …