A unified continual learning framework with general parameter-efficient tuning

Q Gao, C Zhao, Y Sun, T **, G Zhang… - Proceedings of the …, 2023 - openaccess.thecvf.com
The" pre-training-downstream adaptation" presents both new opportunities and challenges
for Continual Learning (CL). Although the recent state-of-the-art in CL is achieved through …

Modular deep learning

J Pfeiffer, S Ruder, I Vulić, EM Ponti - arxiv preprint arxiv:2302.11529, 2023 - arxiv.org
Transfer learning has recently become the dominant paradigm of machine learning. Pre-
trained models fine-tuned for downstream tasks achieve better performance with fewer …

Toward the unification of generative and discriminative visual foundation model: A survey

X Liu, T Zhou, C Wang, Y Wang, Y Wang, Q Cao… - The Visual …, 2024 - Springer
The advent of foundation models, which are pre-trained on vast datasets, has ushered in a
new era of computer vision, characterized by their robustness and remarkable zero-shot …

Large scale foundation models for intelligent manufacturing applications: a survey

H Zhang, SD Semujju, Z Wang, X Lv, K Xu… - Journal of Intelligent …, 2025 - Springer
Although the applications of artificial intelligence especially deep learning have greatly
improved various aspects of intelligent manufacturing, they still face challenges for broader …

A survey on unsupervised anomaly detection algorithms for industrial images

Y Cui, Z Liu, S Lian - IEEE Access, 2023 - ieeexplore.ieee.org
In line with the development of Industry 4.0, surface defect detection/anomaly detection
becomes a topical subject in the industry field. Improving efficiency as well as saving labor …

Open-transmind: A new baseline and benchmark for 1st foundation model challenge of intelligent transportation

Y Shi, F Lv, X Wang, C **a, S Li… - Proceedings of the …, 2023 - openaccess.thecvf.com
With the continuous improvement of computing power and deep learning algorithms in
recent years, the foundation model has grown in popularity. Because of its powerful …

Mdl-nas: A joint multi-domain learning framework for vision transformer

S Wang, T **e, J Cheng, X Zhang… - Proceedings of the …, 2023 - openaccess.thecvf.com
In this work, we introduce MDL-NAS, a unified framework that integrates multiple vision tasks
into a manageable supernet and optimizes these tasks collectively under diverse dataset …

De-fine: De composing and re fin ing visual programs with auto-feedback

M Gao, J Li, H Fei, L Pang, W Ji, G Wang, Z Lv… - Proceedings of the …, 2024 - dl.acm.org
Visual programming, a modular paradigm, integrates different modules and Python
operators to solve various vision-language tasks. Unlike end-to-end models that need task …

Perfhd: Efficient vit architecture performance ranking using hyperdimensional computing

D Ma, P Zhao, X Jiao - … of the IEEE/CVF Conference on …, 2023 - openaccess.thecvf.com
Abstract Neural Architecture Search (NAS) aims at identifying the optimal network
architecture for a specific need in an automated manner, which serves as an alternative to …

Large occluded human image completion via image-prior cooperating

H Zhao, Y Zeng, H Lu, L Wang - … of the AAAI Conference on Artificial …, 2024 - ojs.aaai.org
The completion of large occluded human body images poses a unique challenge for
general image completion methods. The complex shape variations of human bodies make it …