Synthetic Data--what, why and how?

J Jordon, L Szpruch, F Houssiau, M Bottarelli… - arxiv preprint arxiv …, 2022 - arxiv.org
This explainer document aims to provide an overview of the current state of the rapidly
expanding work on synthetic data technologies, with a particular focus on privacy. The …

Pioneering new paths: the role of generative modelling in neurological disease research

M Seiler, K Ritter - Pflügers Archiv-European Journal of Physiology, 2024 - Springer
Recently, deep generative modelling has become an increasingly powerful tool with seminal
work in a myriad of disciplines. This powerful modelling approach is supposed to not only …

Noise-aware statistical inference with differentially private synthetic data

O Räisä, J Jälkö, S Kaski… - … Conference on Artificial …, 2023 - proceedings.mlr.press
While generation of synthetic data under differential privacy (DP) has received a lot of
attention in the data privacy community, analysis of synthetic data has received much less …

Mitigating statistical bias within differentially private synthetic data

S Ghalebikesabi, H Wilde, J Jewson… - Uncertainty in …, 2022 - proceedings.mlr.press
Increasing interest in privacy-preserving machine learning has led to new and evolved
approaches for generating private synthetic data from undisclosed real data. However …

Collaborative Learning From Distributed Data With Differentially Private Synthetic Twin Data

L Prediger, J Jälkö, A Honkela, S Kaski - arxiv preprint arxiv:2308.04755, 2023 - arxiv.org
Consider a setting where multiple parties holding sensitive data aim to collaboratively learn
population level statistics, but pooling the sensitive data sets is not possible. We propose a …

Collaborative learning from distributed data with differentially private synthetic data

L Prediger, J Jälkö, A Honkela, S Kaski - BMC Medical Informatics and …, 2024 - Springer
Background Consider a setting where multiple parties holding sensitive data aim to
collaboratively learn population level statistics, but pooling the sensitive data sets is not …

Bayesian inference for inflation volatility modeling in Ghana

CH Korkpoe, F Ahiakpor… - African Journal of …, 2024 - emerald.com
Purpose The purpose of this paper is to emphasize the risks involved in modeling inflation
volatility in the context of macroeconomic policy. For countries like Ghana that are always …

Debiasing Synthetic Data Generated by Deep Generative Models

A Decruyenaere, H Dehaene, P Rabaey… - arxiv preprint arxiv …, 2024 - arxiv.org
While synthetic data hold great promise for privacy protection, their statistical analysis poses
significant challenges that necessitate innovative solutions. The use of deep generative …

Advancing Retail Data Science: Comprehensive Evaluation of Synthetic Data

Y **a, CH Wang, J Mabry, G Cheng - arxiv preprint arxiv:2406.13130, 2024 - arxiv.org
The evaluation of synthetic data generation is crucial, especially in the retail sector where
data accuracy is paramount. This paper introduces a comprehensive framework for …

Privacy-Protected Spatial Autoregressive Model

D Huang, Z Kong, S Wu, H Wang - arxiv preprint arxiv:2403.16773, 2024 - arxiv.org
Spatial autoregressive (SAR) models are important tools for studying network effects.
However, with an increasing emphasis on data privacy, data providers often implement …