A survey of mix-based data augmentation: Taxonomy, methods, applications, and explainability

C Cao, F Zhou, Y Dai, J Wang, K Zhang - ACM Computing Surveys, 2024 - dl.acm.org
Data augmentation (DA) is indispensable in modern machine learning and deep neural
networks. The basic idea of DA is to construct new training data to improve the model's …

Video-text as game players: Hierarchical banzhaf interaction for cross-modal representation learning

P **, J Huang, P **ong, S Tian, C Liu… - Proceedings of the …, 2023 - openaccess.thecvf.com
Contrastive learning-based video-language representation learning approaches, eg, CLIP,
have achieved outstanding performance, which pursue semantic interaction upon pre …

Omg: Towards effective graph classification against label noise

N Yin, L Shen, M Wang, X Luo, Z Luo… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
Graph classification is a fundamental problem with diverse applications in bioinformatics
and chemistry. Due to the intricate procedures of manual annotations in graphical domains …

Graph mixup with soft alignments

H Ling, Z Jiang, M Liu, S Ji… - … Conference on Machine …, 2023 - proceedings.mlr.press
We study graph data augmentation by mixup, which has been used successfully on images.
A key operation of mixup is to compute a convex combination of a pair of inputs. This …

Text-video retrieval with disentangled conceptualization and set-to-set alignment

P **, H Li, Z Cheng, J Huang, Z Wang, L Yuan… - arxiv preprint arxiv …, 2023 - arxiv.org
Text-video retrieval is a challenging cross-modal task, which aims to align visual entities with
natural language descriptions. Current methods either fail to leverage the local details or are …

Halp: Hallucinating latent positives for skeleton-based self-supervised learning of actions

A Shah, A Roy, K Shah, S Mishra… - Proceedings of the …, 2023 - openaccess.thecvf.com
Supervised learning of skeleton sequence encoders for action recognition has received
significant attention in recent times. However, learning such encoders without labels …

Hierarchical skeleton meta-prototype contrastive learning with hard skeleton mining for unsupervised person re-identification

H Rao, C Leung, C Miao - International Journal of Computer Vision, 2024 - Springer
With rapid advancements in depth sensors and deep learning, skeleton-based person re-
identification (re-ID) models have recently achieved remarkable progress with many …

Embedding space interpolation beyond mini-batch, beyond pairs and beyond examples

S Venkataramanan, E Kijak… - Advances in neural …, 2024 - proceedings.neurips.cc
Mixup refers to interpolation-based data augmentation, originally motivated as a way to go
beyond empirical risk minimization (ERM). Its extensions mostly focus on the definition of …

R-mixup: Riemannian mixup for biological networks

X Kan, Z Li, H Cui, Y Yu, R Xu, S Yu, Z Zhang… - Proceedings of the 29th …, 2023 - dl.acm.org
Biological networks are commonly used in biomedical and healthcare domains to effectively
model the structure of complex biological systems with interactions linking biological entities …

Ensemble quadratic assignment network for graph matching

H Tan, C Wang, S Wu, XY Zhang, F Yin… - International Journal of …, 2024 - Springer
Graph matching is a commonly used technique in computer vision and pattern recognition.
Recent data-driven approaches have improved the graph matching accuracy remarkably …