Vmambair: Visual state space model for image restoration

Y Shi, B **a, X **, X Wang, T Zhao… - … on Circuits and …, 2025 - ieeexplore.ieee.org
Image restoration is a critical task in low-level computer vision, aiming to restore high-quality
images from degraded inputs. Various models, such as convolutional neural networks …

Unlimited-size diffusion restoration

Y Wang, J Yu, R Yu, J Zhang - Proceedings of the IEEE/CVF …, 2023 - openaccess.thecvf.com
Recently, using diffusion models for zero-shot image restoration (IR) has become a new hot
paradigm. This type of method only needs to use the pre-trained off-the-shelf diffusion …

OPDN: Omnidirectional position-aware deformable network for omnidirectional image super-resolution

X Sun, W Li, Z Zhang, Q Ma, X Sheng… - Proceedings of the …, 2023 - openaccess.thecvf.com
Abstract 360deg omnidirectional images have gained research attention due to their
immersive and interactive experience, particularly in AR/VR applications. However, they …

Pvass-mdd: predictive visual-audio alignment self-supervision for multimodal deepfake detection

Y Yu, X Liu, R Ni, S Yang, Y Zhao… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
Deepfake techniques can forge the visual or audio signals in the video, which leads to
inconsistencies between visual and audio (VA) signals. Therefore, multimodal detection …

Blind face restoration for under-display camera via dictionary guided transformer

J Tan, X Chen, T Wang, K Zhang… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
By hiding the front-facing camera below the display panel, Under-Display Camera (UDC)
provides users with a full-screen experience. However, due to the characteristics of the …

Towards real-world blind face restoration with generative diffusion prior

X Chen, J Tan, T Wang, K Zhang… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Blind face restoration is an important task in computer vision and has gained significant
attention due to its wide-range applications. Previous works mainly exploit facial priors to …

Aganet: Attention-guided generative adversarial network for corn hyperspectral images augmentation

W Zhang, Z Li, G Li, L Zhou, W Zhao… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Hyperspectral imaging represents a spectral technique that facilitates the non-destructive
detection of corn seeds. However, the application of deep learning techniques often …

Privacy-preserving remote heart rate estimation from facial videos

D Gupta, A Etemad - … on Systems, Man, and Cybernetics (SMC), 2023 - ieeexplore.ieee.org
Remote Photoplethysmography (rPPG) is the process of estimating PPG from facial videos.
While this approach benefits from contactless interaction, it is reliant on videos of faces …

Clip2gan: Towards bridging text with the latent space of gans

Y Wang, W Zhou, J Bao, W Wang… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
In this work, we are dedicated to text-guided image generation and propose a novel
framework, ie., CLIP2GAN, by leveraging CLIP model and StyleGAN. The key idea of our …

Overcoming False Illusions in Real-World Face Restoration with Multi-Modal Guided Diffusion Model

K Tao, J Gu, Y Zhang, X Wang, N Cheng - arxiv preprint arxiv:2410.04161, 2024 - arxiv.org
We introduce a novel Multi-modal Guided Real-World Face Restoration (MGFR) technique
designed to improve the quality of facial image restoration from low-quality inputs …