Adversarial attacks and defenses on text-to-image diffusion models: A survey

C Zhang, M Hu, W Li, L Wang - Information Fusion, 2024 - Elsevier
Recently, the text-to-image diffusion model has gained considerable attention from the
community due to its exceptional image generation capability. A representative model …

PromptFusion: Harmonized semantic prompt learning for infrared and visible image fusion

J Liu, X Li, Z Wang, Z Jiang, W Zhong… - IEEE/CAA Journal of …, 2024 - ieeexplore.ieee.org
The goal of infrared and visible image fusion (TVIF) is to integrate the unique advantages of
both modalities to achieve a more comprehensive understanding of a scene. However …

An infrared and visible image fusion using knowledge measures for intuitionistic fuzzy sets and Swin Transformer

MJ Khan, S Jiang, W Ding, J Huang, H Wang - Information Sciences, 2024 - Elsevier
The objectives of infrared and visible image fusion are to generate a single image that
includes significant objects and rich texture information. However, the current deep-learning …

LG-Diff: Learning to follow local class-regional guidance for nearshore image cross-modality high-quality translation

J Ding, Y Du, W Li, L Pei, N Cui - Information Fusion, 2025 - Elsevier
A major obstacle for nearshore cross-modality visual tasks is the large-scale cross-modality
data collection and the long-tail distribution in special scenes. One of the most common …

Terf: Text-driven and region-aware flexible visible and infrared image fusion

H Wang, H Zhang, X Yi, X **ang, L Fang… - Proceedings of the 32nd …, 2024 - dl.acm.org
The fusion of visible and infrared images aims to produce high-quality fusion images with
rich textures and salient target information. Existing methods lack interactivity and flexibility …

TFDet: Target-aware fusion for RGB-T pedestrian detection

X Zhang, X Zhang, J Wang, J Ying… - … on Neural Networks …, 2024 - ieeexplore.ieee.org
Pedestrian detection plays a critical role in computer vision as it contributes to ensuring
traffic safety. Existing methods that rely solely on RGB images suffer from performance …

DiffusionLSTM: a framework for image sequence generation and its application to oil spill monitoring and prediction

X Lyu, H Han, P Ren, C Grecos - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Oil remains the most important energy source in the world today, and tankers are its main
modes of transportation. However, there is a high risk of oil spills, which can cause serious …

Inter-frame feature fusion enhanced spatio-temporal consistent video inpainting with sample-based techniques and adaptive local search

R Han, S Liao, S Fu, Y Zhou, Y Li, H Han - Journal of Computational and …, 2025 - Elsevier
Video inpainting is an ill-posed inverse problem in image processing focused on eliminating
unwanted objects and restoring damaged or corrupted regions within a video sequence …

Text-DiFuse: An Interactive Multi-Modal Image Fusion Framework based on Text-modulated Diffusion Model

H Zhang, L Cao, J Ma - Advances in Neural Information …, 2025 - proceedings.neurips.cc
Existing multi-modal image fusion methods fail to address the compound degradations
presented in source images, resulting in fusion images plagued by noise, color bias …

DGGI: Deep Generative Gradient Inversion with diffusion model

L Wu, Z Liu, B Pu, K Wei, H Cao, S Yao - Information Fusion, 2025 - Elsevier
Federated learning is a privacy-preserving distributed framework that facilitates information
fusion and sharing among different clients, enabling the training of a global model without …