Synthetic Data--what, why and how?

J Jordon, L Szpruch, F Houssiau, M Bottarelli… - arxiv preprint arxiv …, 2022 - arxiv.org
This explainer document aims to provide an overview of the current state of the rapidly
expanding work on synthetic data technologies, with a particular focus on privacy. The …

Evaluating differentially private machine learning in practice

B Jayaraman, D Evans - 28th USENIX Security Symposium (USENIX …, 2019 - usenix.org
Differential privacy is a strong notion for privacy that can be used to prove formal
guarantees, in terms of a privacy budget, ε, about how much information is leaked by a …

Recent advances of differential privacy in centralized deep learning: A systematic survey

L Demelius, R Kern, A Trügler - ACM Computing Surveys, 2025 - dl.acm.org
Differential privacy has become a widely popular method for data protection in machine
learning, especially since it allows formulating strict mathematical privacy guarantees. This …

Privacy preserving synthetic data release using deep learning

NC Abay, Y Zhou, M Kantarcioglu… - Machine Learning and …, 2019 - Springer
For many critical applications ranging from health care to social sciences, releasing
personal data while protecting individual privacy is paramount. Over the years, data …

Sok: Privacy-preserving data synthesis

Y Hu, F Wu, Q Li, Y Long, GM Garrido… - … IEEE Symposium on …, 2024 - ieeexplore.ieee.org
As the prevalence of data analysis grows, safeguarding data privacy has become a
paramount concern. Consequently, there has been an upsurge in the development of …

Differentially private latent diffusion models

MF Liu, S Lyu, M Vinaroz, M Park - arxiv preprint arxiv:2305.15759, 2023 - arxiv.org
Diffusion models (DMs) are one of the most widely used generative models for producing
high quality images. However, a flurry of recent papers points out that DMs are least private …

Robust and privacy-preserving collaborative training: a comprehensive survey

F Yang, X Zhang, S Guo, D Chen, Y Gan… - Artificial Intelligence …, 2024 - Springer
Increasing numbers of artificial intelligence systems are employing collaborative machine
learning techniques, such as federated learning, to build a shared powerful deep model …

P3gm: Private high-dimensional data release via privacy preserving phased generative model

S Takagi, T Takahashi, Y Cao… - 2021 IEEE 37th …, 2021 - ieeexplore.ieee.org
How can we release a massive volume of sensitive data while mitigating privacy risks?
Privacy-preserving data synthesis enables the data holder to outsource analytical tasks to …

Pre-trained perceptual features improve differentially private image generation

F Harder, MJ Asadabadi, DJ Sutherland… - arxiv preprint arxiv …, 2022 - arxiv.org
Training even moderately-sized generative models with differentially-private stochastic
gradient descent (DP-SGD) is difficult: the required level of noise for reasonable levels of …

Exact inference with approximate computation for differentially private data via perturbations

R Gong - arxiv preprint arxiv:1909.12237, 2019 - arxiv.org
This paper discusses how two classes of approximate computation algorithms can be
adapted, in a modular fashion, to achieve exact statistical inference from differentially private …