Video super-resolution based on deep learning: a comprehensive survey
Video super-resolution (VSR) is reconstructing high-resolution videos from low resolution
ones. Recently, the VSR methods based on deep neural networks have made great …
ones. Recently, the VSR methods based on deep neural networks have made great …
Guided depth map super-resolution: A survey
Guided depth map super-resolution (GDSR), which aims to reconstruct a high-resolution
depth map from a low-resolution observation with the help of a paired high-resolution color …
depth map from a low-resolution observation with the help of a paired high-resolution color …
Tri-perspective view for vision-based 3d semantic occupancy prediction
Modern methods for vision-centric autonomous driving perception widely adopt the bird's-
eye-view (BEV) representation to describe a 3D scene. Despite its better efficiency than …
eye-view (BEV) representation to describe a 3D scene. Despite its better efficiency than …
Implicit diffusion models for continuous super-resolution
Image super-resolution (SR) has attracted increasing attention due to its wide applications.
However, current SR methods generally suffer from over-smoothing and artifacts, and most …
However, current SR methods generally suffer from over-smoothing and artifacts, and most …
3d neural field generation using triplane diffusion
Diffusion models have emerged as the state-of-the-art for image generation, among other
tasks. Here, we present an efficient diffusion-based model for 3D-aware generation of neural …
tasks. Here, we present an efficient diffusion-based model for 3D-aware generation of neural …
Petr: Position embedding transformation for multi-view 3d object detection
In this paper, we develop position embedding transformation (PETR) for multi-view 3D
object detection. PETR encodes the position information of 3D coordinates into image …
object detection. PETR encodes the position information of 3D coordinates into image …
Codef: Content deformation fields for temporally consistent video processing
We present the content deformation field (CoDeF) as a new type of video representation
which consists of a canonical content field aggregating the static contents in the entire video …
which consists of a canonical content field aggregating the static contents in the entire video …
Efficient geometry-aware 3d generative adversarial networks
Unsupervised generation of high-quality multi-view-consistent images and 3D shapes using
only collections of single-view 2D photographs has been a long-standing challenge …
only collections of single-view 2D photographs has been a long-standing challenge …
Alias-free generative adversarial networks
We observe that despite their hierarchical convolutional nature, the synthesis process of
typical generative adversarial networks depends on absolute pixel coordinates in an …
typical generative adversarial networks depends on absolute pixel coordinates in an …
Scale-mae: A scale-aware masked autoencoder for multiscale geospatial representation learning
Large, pretrained models are commonly finetuned with imagery that is heavily augmented to
mimic different conditions and scales, with the resulting models used for various tasks with …
mimic different conditions and scales, with the resulting models used for various tasks with …