Dataset distillation by matching training trajectories

G Cazenavette, T Wang, A Torralba… - Proceedings of the …, 2022 - openaccess.thecvf.com
Dataset distillation is the task of synthesizing a small dataset such that a model trained on
the synthetic set will match the test accuracy of the model trained on the full dataset. In this …

Dream: Efficient dataset distillation by representative matching

Y Liu, J Gu, K Wang, Z Zhu… - Proceedings of the …, 2023 - openaccess.thecvf.com
Dataset distillation aims to synthesize small datasets with little information loss from original
large-scale ones for reducing storage and training costs. Recent state-of-the-art methods …

Dataset distillation

T Wang, JY Zhu, A Torralba, AA Efros - arxiv preprint arxiv:1811.10959, 2018 - arxiv.org
Model distillation aims to distill the knowledge of a complex model into a simpler one. In this
paper, we consider an alternative formulation called dataset distillation: we keep the model …

Influence function based data poisoning attacks to top-n recommender systems

M Fang, NZ Gong, J Liu - Proceedings of The Web Conference 2020, 2020 - dl.acm.org
Recommender system is an essential component of web services to engage users. Popular
recommender systems model user preferences and item properties using a large amount of …

Self-paced learning with diversity

L Jiang, D Meng, SI Yu, Z Lan… - Advances in neural …, 2014 - proceedings.neurips.cc
Self-paced learning (SPL) is a recently proposed learning regime inspired by the learning
process of humans and animals that gradually incorporates easy to more complex samples …

Flexible dataset distillation: Learn labels instead of images

O Bohdal, Y Yang, T Hospedales - arxiv preprint arxiv:2006.08572, 2020 - arxiv.org
We study the problem of dataset distillation-creating a small set of synthetic examples
capable of training a good model. In particular, we study the problem of label distillation …

Compressed gastric image generation based on soft-label dataset distillation for medical data sharing

G Li, R Togo, T Ogawa, M Haseyama - Computer Methods and Programs in …, 2022 - Elsevier
Background and objective: Sharing of medical data is required to enable the cross-agency
flow of healthcare information and construct high-accuracy computer-aided diagnosis …

A theoretical understanding of self-paced learning

D Meng, Q Zhao, L Jiang - Information Sciences, 2017 - Elsevier
Self-paced learning (SPL) is a recently proposed methodology designed by mimicking
through the learning principle of humans/animals. A variety of SPL realization schemes have …

Self paced deep learning for weakly supervised object detection

E Sangineto, M Nabi, D Culibrk… - IEEE transactions on …, 2018 - ieeexplore.ieee.org
In a weakly-supervised scenario object detectors need to be trained using image-level
annotation alone. Since bounding-box-level ground truth is not available, most of the …

Background data resampling for outlier-aware classification

Y Li, N Vasconcelos - … of the IEEE/CVF Conference on …, 2020 - openaccess.thecvf.com
The problem of learning an image classifier that allows detection of out-of-distribution (OOD)
examples, with the help of auxiliary background datasets, is studied. While training with …