Vmambair: Visual state space model for image restoration
Image restoration is a critical task in low-level computer vision, aiming to restore high-quality
images from degraded inputs. Various models, such as convolutional neural networks …
images from degraded inputs. Various models, such as convolutional neural networks …
Unlimited-size diffusion restoration
Recently, using diffusion models for zero-shot image restoration (IR) has become a new hot
paradigm. This type of method only needs to use the pre-trained off-the-shelf diffusion …
paradigm. This type of method only needs to use the pre-trained off-the-shelf diffusion …
OPDN: Omnidirectional position-aware deformable network for omnidirectional image super-resolution
Abstract 360deg omnidirectional images have gained research attention due to their
immersive and interactive experience, particularly in AR/VR applications. However, they …
immersive and interactive experience, particularly in AR/VR applications. However, they …
Pvass-mdd: predictive visual-audio alignment self-supervision for multimodal deepfake detection
Deepfake techniques can forge the visual or audio signals in the video, which leads to
inconsistencies between visual and audio (VA) signals. Therefore, multimodal detection …
inconsistencies between visual and audio (VA) signals. Therefore, multimodal detection …
Blind face restoration for under-display camera via dictionary guided transformer
By hiding the front-facing camera below the display panel, Under-Display Camera (UDC)
provides users with a full-screen experience. However, due to the characteristics of the …
provides users with a full-screen experience. However, due to the characteristics of the …
Towards real-world blind face restoration with generative diffusion prior
Blind face restoration is an important task in computer vision and has gained significant
attention due to its wide-range applications. Previous works mainly exploit facial priors to …
attention due to its wide-range applications. Previous works mainly exploit facial priors to …
Aganet: Attention-guided generative adversarial network for corn hyperspectral images augmentation
Hyperspectral imaging represents a spectral technique that facilitates the non-destructive
detection of corn seeds. However, the application of deep learning techniques often …
detection of corn seeds. However, the application of deep learning techniques often …
Privacy-preserving remote heart rate estimation from facial videos
Remote Photoplethysmography (rPPG) is the process of estimating PPG from facial videos.
While this approach benefits from contactless interaction, it is reliant on videos of faces …
While this approach benefits from contactless interaction, it is reliant on videos of faces …
Clip2gan: Towards bridging text with the latent space of gans
In this work, we are dedicated to text-guided image generation and propose a novel
framework, ie., CLIP2GAN, by leveraging CLIP model and StyleGAN. The key idea of our …
framework, ie., CLIP2GAN, by leveraging CLIP model and StyleGAN. The key idea of our …
Overcoming False Illusions in Real-World Face Restoration with Multi-Modal Guided Diffusion Model
We introduce a novel Multi-modal Guided Real-World Face Restoration (MGFR) technique
designed to improve the quality of facial image restoration from low-quality inputs …
designed to improve the quality of facial image restoration from low-quality inputs …