Mind the interference: Retaining pre-trained knowledge in parameter efficient continual learning of vision-language models

L Tang, Z Tian, K Li, C He, H Zhou, H Zhao, X Li… - … on Computer Vision, 2024 - Springer
This study addresses the Domain-Class Incremental Learning problem, a realistic but
challenging continual learning scenario where both the domain distribution and target …

Spiking wavelet transformer

Y Fang, Z Wang, L Zhang, J Cao, H Chen… - European Conference on …, 2024 - Springer
Spiking neural networks (SNNs) offer an energy-efficient alternative to conventional deep
learning by emulating the event-driven processing manner of the brain. Incorporating …

Instantswap: Fast customized concept swap** across sharp shape differences

C Zhu, K Li, Y Ma, L Tang, C Fang, C Chen… - ar** (CCS) enable a text-to-image model to
swap a concept in the source image with a customized target concept. However, the existing …

A survey of camouflaged object detection and beyond

F **ao, S Hu, Y Shen, C Fang, J Huang, C He… - arxiv preprint arxiv …, 2024 - arxiv.org
Camouflaged Object Detection (COD) refers to the task of identifying and segmenting
objects that blend seamlessly into their surroundings, posing a significant challenge for …

Real-world Image Dehazing with Coherence-based Pseudo Labeling and Cooperative Unfolding Network

C Fang, C He, F **ao, Y Zhang, L Tang… - The Thirty-eighth …, 2024 - openreview.net
Real-world Image Dehazing (RID) aims to alleviate haze-induced degradation in real-world
settings. This task remains challenging due to the complexities in accurately modeling real …

PrideDiff: Physics-Regularized Generalized Diffusion Model for CT Reconstruction

Z Lu, Q Gao, T Wang, Z Yang, Z Wang… - … on Radiation and …, 2024 - ieeexplore.ieee.org
Achieving a lower radiation dose and a faster imaging speed is a pivotal objective of
computed tomography (CT) reconstruction. However, these often come at the cost of …

Enat: Rethinking spatial-temporal interactions in token-based image synthesis

Z Ni, Y Wang, R Zhou, Y Han, J Guo, Z Liu… - arxiv preprint arxiv …, 2024 - arxiv.org
Recently, token-based generation have demonstrated their effectiveness in image synthesis.
As a representative example, non-autoregressive Transformers (NATs) can generate decent …

Will the Inclusion of Generated Data Amplify Bias Across Generations in Future Image Classification Models?

Z Zhang, X Liang, M Feng, S Liang, C Xu - arxiv preprint arxiv:2410.10160, 2024 - arxiv.org
As the demand for high-quality training data escalates, researchers have increasingly turned
to generative models to create synthetic data, addressing data scarcity and enabling …

Densely Connected Parameter-Efficient Tuning for Referring Image Segmentation

J Huang, Z Xu, T Liu, Y Liu, H Han, K Yuan… - arxiv preprint arxiv …, 2025 - arxiv.org
In the domain of computer vision, Parameter-Efficient Tuning (PET) is increasingly replacing
the traditional paradigm of pre-training followed by full fine-tuning. PET is particularly …

Resfusion: Denoising Diffusion Probabilistic Models for Image Restoration Based on Prior Residual Noise

Z Shi, H Zheng, C Xu, C Dong, B Pan, A He… - The Thirty-eighth …, 2023 - openreview.net
Recently, research on denoising diffusion models has expanded its application to the field of
image restoration. Traditional diffusion-based image restoration methods utilize degraded …