[PDF][PDF] Synthetic Data: A Look Back and A Look Forward.

JP Reiter - Trans. Data Priv., 2023 - tdp.cat
When initially proposed, synthetic data for disclosure control was generally dismissed as
unlikely to be implemented in practice. Thirty years later, synthetic data are becoming a …

Comparing the utility and disclosure risk of synthetic data with samples of microdata

C Little, M Elliot, R Allmendinger - International Conference on Privacy in …, 2022 - Springer
Most statistical agencies release randomly selected samples of Census microdata, usually
with sample fractions under 10% and with other forms of statistical disclosure control (SDC) …

Synthetic census microdata generation: A comparative study of synthesis methods examining the trade-off between disclosure risk and utility

C Little, R Allmendinger, M Elliot - Journal of Official Statistics, 2024 - journals.sagepub.com
There is growing interest in synthetic data generation as a means of allowing access to
useful data whilst preserving confidentiality. In particular, synthetic microdata generation …

Assessing, visualizing and improving the utility of synthetic data

GM Raab, B Nowok, C Dibben - arxiv preprint arxiv:2109.12717, 2021 - arxiv.org
The synthpop package for R https://www. synthpop. org. uk provides tools to allow data
custodians to create synthetic versions of confidential microdata that can be distributed with …

Demonstrating an approach for evaluating synthetic geospatial and temporal epidemiologic data utility: results from analyzing> 1.8 million SARS-CoV-2 tests in the …

JA Thomas, RE Foraker, N Zamstein… - Journal of the …, 2022 - academic.oup.com
Objective This study sought to evaluate whether synthetic data derived from a national
coronavirus disease 2019 (COVID-19) dataset could be used for geospatial and temporal …

Privacy risk from synthetic data: Practical proposals

GM Raab - International Conference on Privacy in Statistical …, 2024 - Springer
This paper proposes and compares measures of identity and attribute disclosure risk for
synthetic data. Data custodians can use the methods proposed here to inform the decision …

Towards a Taxonomy for the Use of Synthetic Data in Advanced Analytics

P Kowalczyk, G Welsch, F Thiesse - arxiv preprint arxiv:2212.02622, 2022 - arxiv.org
The proliferation of deep learning techniques led to a wide range of advanced analytics
applications in important business areas such as predictive maintenance or product …

The Real Deal Behind the Artificial Appeal: Inferential Utility of Tabular Synthetic Data

A Decruyenaere, H Dehaene, P Rabaey… - The 40th Conference on … - openreview.net
Recent advances in generative models facilitate the creation of synthetic data to be made
available for research in privacy-sensitive contexts. However, the analysis of synthetic data …