Visual tuning

BXB Yu, J Chang, H Wang, L Liu, S Wang… - ACM Computing …, 2024 - dl.acm.org
Fine-tuning visual models has been widely shown promising performance on many
downstream visual tasks. With the surprising development of pre-trained visual foundation …

One prompt word is enough to boost adversarial robustness for pre-trained vision-language models

L Li, H Guan, J Qiu, M Spratling - Proceedings of the IEEE …, 2024 - openaccess.thecvf.com
Abstract Large pre-trained Vision-Language Models (VLMs) like CLIP despite having
remarkable generalization ability are highly vulnerable to adversarial examples. This work …

AGD-GAN: Adaptive Gradient-Guided and Depth-supervised generative adversarial networks for ancient mural sketch extraction

Z Yu, S Peng, S Qu, Q Zhang, J Wang… - Expert Systems with …, 2024 - Elsevier
To address the overlooked issues of multi-scale detail feature extraction and disease noise
suppression in mural sketch extraction, we proposed a novel generative adversarial network …

Enhancing object coherence in layout-to-image synthesis

Y Wang, H Xu, C Zhou, W Zhang, C ** - ar** function to transform images into
different styles or domains while preserving their key structures. Typically, I2I models require …

InstaFormer++: Multi-Domain Instance-Aware Image-to-Image Translation with Transformer

S Kim, J Baek, J Park, E Ha, H Jung, T Lee… - International Journal of …, 2024 - Springer
We present a novel Transformer-based network architecture for instance-aware image-to-
image translation, dubbed InstaFormer, to effectively integrate global-and instance-level …

Empowering LLMs for Multi-Page Layout Generation via Consistency-Oriented In-Context Learning

M Chen, X Zhang, J Zhang, Q Li, T Liu - Proceedings of the 33rd ACM …, 2024 - dl.acm.org
Document layout generation, a burgeoning field of document intelligence, entails positioning
and sizing various elements within given constraints. While significant strides have been …