A survey of mix-based data augmentation: Taxonomy, methods, applications, and explainability

C Cao, F Zhou, Y Dai, J Wang, K Zhang - ACM Computing Surveys, 2024 - dl.acm.org
Data augmentation (DA) is indispensable in modern machine learning and deep neural
networks. The basic idea of DA is to construct new training data to improve the model's …

Lift: Language-interfaced fine-tuning for non-language machine learning tasks

T Dinh, Y Zeng, R Zhang, Z Lin… - Advances in …, 2022 - proceedings.neurips.cc
Fine-tuning pretrained language models (LMs) without making any architectural changes
has become a norm for learning various language downstream tasks. However, for non …

A unified analysis of mixed sample data augmentation: A loss function perspective

C Park, S Yun, S Chun - Advances in neural information …, 2022 - proceedings.neurips.cc
We propose the first unified theoretical analysis of mixed sample data augmentation
(MSDA), such as Mixup and CutMix. Our theoretical results show that regardless of the …

What makes a" good" data augmentation in knowledge distillation-a statistical perspective

H Wang, S Lohit, MN Jones… - Advances in Neural …, 2022 - proceedings.neurips.cc
Abstract Knowledge distillation (KD) is a general neural network training approach that uses
a teacher model to guide the student model. Existing works mainly study KD from the …

Over-training with mixup may hurt generalization

Z Liu, Z Wang, H Guo, Y Mao - arxiv preprint arxiv:2303.01475, 2023 - arxiv.org
Mixup, which creates synthetic training instances by linearly interpolating random sample
pairs, is a simple and yet effective regularization technique to boost the performance of deep …

A dynamic semantic segmentation algorithm with encoder-crossor-decoder structure for pixel-level building cracks

Y Chen, S Dong, B Hu, Q Liu, Y Qu - Measurement Science and …, 2023 - iopscience.iop.org
A large number of newly built infrastructures as well as those constructed in the early stage
are faced with the problems of detection and maintenance. However, it is difficult to detect …

A Survey on Mixup Augmentations and Beyond

X **, H Zhu, S Li, Z Wang, Z Liu, C Yu, H Qin… - arxiv preprint arxiv …, 2024 - arxiv.org
As Deep Neural Networks have achieved thrilling breakthroughs in the past decade, data
augmentations have garnered increasing attention as regularization techniques when …

IntraMix: Intra-Class Mixup Generation for Accurate Labels and Neighbors

S Zheng, H Wang, X Liu - Advances in Neural Information …, 2025 - proceedings.neurips.cc
Abstract Graph Neural Networks (GNNs) have shown great performance in various tasks,
with the core idea of learning from data labels and aggregating messages within the …

Gbmix: Enhancing fairness by group-balanced mixup

S Hong, Y Yoon, H Joo, J Lee - IEEE Access, 2024 - ieeexplore.ieee.org
Mixup is a powerful data augmentation strategy that has been shown to improve the
generalization and adversarial robustness of machine learning classifiers, particularly in …

Mix from Failure: Confusion-Pairing Mixup for Long-Tailed Recognition

Y Yoon, S Hong, H Joo, Y Qin, H Jeong… - arxiv preprint arxiv …, 2024 - arxiv.org
Long-tailed image recognition is a computer vision problem considering a real-world class
distribution rather than an artificial uniform. Existing methods typically detour the problem by …