Adversarial attacks and defenses on text-to-image diffusion models: A survey
Recently, the text-to-image diffusion model has gained considerable attention from the
community due to its exceptional image generation capability. A representative model …
community due to its exceptional image generation capability. A representative model …
PromptFusion: Harmonized semantic prompt learning for infrared and visible image fusion
The goal of infrared and visible image fusion (TVIF) is to integrate the unique advantages of
both modalities to achieve a more comprehensive understanding of a scene. However …
both modalities to achieve a more comprehensive understanding of a scene. However …
An infrared and visible image fusion using knowledge measures for intuitionistic fuzzy sets and Swin Transformer
The objectives of infrared and visible image fusion are to generate a single image that
includes significant objects and rich texture information. However, the current deep-learning …
includes significant objects and rich texture information. However, the current deep-learning …
LG-Diff: Learning to follow local class-regional guidance for nearshore image cross-modality high-quality translation
A major obstacle for nearshore cross-modality visual tasks is the large-scale cross-modality
data collection and the long-tail distribution in special scenes. One of the most common …
data collection and the long-tail distribution in special scenes. One of the most common …
Terf: Text-driven and region-aware flexible visible and infrared image fusion
The fusion of visible and infrared images aims to produce high-quality fusion images with
rich textures and salient target information. Existing methods lack interactivity and flexibility …
rich textures and salient target information. Existing methods lack interactivity and flexibility …
TFDet: Target-aware fusion for RGB-T pedestrian detection
X Zhang, X Zhang, J Wang, J Ying… - … on Neural Networks …, 2024 - ieeexplore.ieee.org
Pedestrian detection plays a critical role in computer vision as it contributes to ensuring
traffic safety. Existing methods that rely solely on RGB images suffer from performance …
traffic safety. Existing methods that rely solely on RGB images suffer from performance …
DiffusionLSTM: a framework for image sequence generation and its application to oil spill monitoring and prediction
Oil remains the most important energy source in the world today, and tankers are its main
modes of transportation. However, there is a high risk of oil spills, which can cause serious …
modes of transportation. However, there is a high risk of oil spills, which can cause serious …
Inter-frame feature fusion enhanced spatio-temporal consistent video inpainting with sample-based techniques and adaptive local search
R Han, S Liao, S Fu, Y Zhou, Y Li, H Han - Journal of Computational and …, 2025 - Elsevier
Video inpainting is an ill-posed inverse problem in image processing focused on eliminating
unwanted objects and restoring damaged or corrupted regions within a video sequence …
unwanted objects and restoring damaged or corrupted regions within a video sequence …
Text-DiFuse: An Interactive Multi-Modal Image Fusion Framework based on Text-modulated Diffusion Model
Existing multi-modal image fusion methods fail to address the compound degradations
presented in source images, resulting in fusion images plagued by noise, color bias …
presented in source images, resulting in fusion images plagued by noise, color bias …
DGGI: Deep Generative Gradient Inversion with diffusion model
Federated learning is a privacy-preserving distributed framework that facilitates information
fusion and sharing among different clients, enabling the training of a global model without …
fusion and sharing among different clients, enabling the training of a global model without …