[HTML][HTML] Data augmentation: A comprehensive survey of modern approaches

A Mumuni, F Mumuni - Array, 2022 - Elsevier
To ensure good performance, modern machine learning models typically require large
amounts of quality annotated data. Meanwhile, the data collection and annotation processes …

A complete survey on generative ai (aigc): Is chatgpt from gpt-4 to gpt-5 all you need?

C Zhang, C Zhang, S Zheng, Y Qiao, C Li… - arxiv preprint arxiv …, 2023 - arxiv.org
As ChatGPT goes viral, generative AI (AIGC, aka AI-generated content) has made headlines
everywhere because of its ability to analyze and create text, images, and beyond. With such …

Instructpix2pix: Learning to follow image editing instructions

T Brooks, A Holynski, AA Efros - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
We propose a method for editing images from human instructions: given an input image and
a written instruction that tells the model what to do, our model follows these instructions to …

Large-scale photonic chiplet Taichi empowers 160-TOPS/W artificial general intelligence

Z Xu, T Zhou, M Ma, CC Deng, Q Dai, L Fang - Science, 2024 - science.org
The pursuit of artificial general intelligence (AGI) continuously demands higher computing
performance. Despite the superior processing speed and efficiency of integrated photonic …

Deep learning for image inpainting: A survey

H **ang, Q Zou, MA Nawaz, X Huang, F Zhang, H Yu - Pattern Recognition, 2023 - Elsevier
Image inpainting has been widely exploited in the field of computer vision and image
processing. The main purpose of image inpainting is to produce visually plausible structure …

Pixel-aware stable diffusion for realistic image super-resolution and personalized stylization

T Yang, R Wu, P Ren, X **e, L Zhang - European Conference on Computer …, 2024 - Springer
Diffusion models have demonstrated impressive performance in various image generation,
editing, enhancement and translation tasks. In particular, the pre-trained text-to-image stable …

Design guidelines for prompt engineering text-to-image generative models

V Liu, LB Chilton - Proceedings of the 2022 CHI conference on human …, 2022 - dl.acm.org
Text-to-image generative models are a new and powerful way to generate visual artwork.
However, the open-ended nature of text as interaction is double-edged; while users can …

Dreamsim: Learning new dimensions of human visual similarity using synthetic data

S Fu, N Tamir, S Sundaram, L Chai, R Zhang… - arxiv preprint arxiv …, 2023 - arxiv.org
Current perceptual similarity metrics operate at the level of pixels and patches. These
metrics compare images in terms of their low-level colors and textures, but fail to capture mid …

Learning what not to segment: A new perspective on few-shot segmentation

C Lang, G Cheng, B Tu, J Han - Proceedings of the IEEE …, 2022 - openaccess.thecvf.com
Recently few-shot segmentation (FSS) has been extensively developed. Most previous
works strive to achieve generalization through the meta-learning framework derived from …

Underwater dam crack image generation based on unsupervised image-to-image translation

B Huang, F Kang, X Li, S Zhu - Automation in Construction, 2024 - Elsevier
Underwater crack detection is necessary for the safe operation of concrete dams. Current
deep-learning-based underwater crack detection methods rely heavily on a large number of …