Anonymization: The imperfect science of using data while preserving privacy

A Gadotti, L Rocher, F Houssiau, AM Creţu… - Science …, 2024 - science.org
Information about us, our actions, and our preferences is created at scale through surveys or
scientific studies or as a result of our interaction with digital devices such as smartphones …

A sco** review of privacy and utility metrics in medical synthetic data

B Kaabachi, J Despraz, T Meurers, K Otte… - npj Digital …, 2025 - nature.com
The use of synthetic data is a promising solution to facilitate the sharing and reuse of health-
related data beyond its initial collection while addressing privacy concerns. However, there …

Achilles' heels: vulnerable record identification in synthetic data publishing

M Meeus, F Guepin, AM Creţu… - European Symposium on …, 2023 - Springer
Synthetic data is seen as the most promising solution to share individual-level data while
preserving privacy. Shadow modeling-based Membership Inference Attacks (MIAs) have …

Auditing and generating synthetic data with controllable trust trade-offs

B Belgodere, P Dognin, A Ivankay… - IEEE Journal on …, 2024 - ieeexplore.ieee.org
Real-world data often exhibits bias, imbalance, and privacy risks. Synthetic datasets have
emerged to address these issues by enabling a paradigm that relies on generative AI …

On the Inadequacy of Similarity-based Privacy Metrics: Reconstruction Attacks against" Truly Anonymous Synthetic Data''

G Ganev, E De Cristofaro - arxiv preprint arxiv:2312.05114, 2023 - arxiv.org
Training generative models to produce synthetic data is meant to provide a privacy-friendly
approach to data release. However, we get robust guarantees only when models are trained …

FLAIM: AIM-based synthetic data generation in the federated setting

S Maddock, G Cormode, C Maple - Proceedings of the 30th ACM …, 2024 - dl.acm.org
Preserving individual privacy while enabling collaborative data sharing is crucial for
organizations. Synthetic data generation is one solution, producing artificial data that mirrors …

A unified view of differentially private deep generative modeling

D Chen, R Kerkouche, M Fritz - arxiv preprint arxiv:2309.15696, 2023 - arxiv.org
The availability of rich and vast data sources has greatly advanced machine learning
applications in various domains. However, data with privacy concerns comes with stringent …

A zero auxiliary knowledge membership inference attack on aggregate location data

V Guan, F Guépin, AM Cretu… - arxiv preprint arxiv …, 2024 - arxiv.org
Location data is frequently collected from populations and shared in aggregate form to guide
policy and decision making. However, the prevalence of aggregated data also raises the …

NetDPSyn: Synthesizing Network Traces under Differential Privacy

D Sun, JQ Chen, C Gong, T Wang, Z Li - … of the 2024 ACM on Internet …, 2024 - dl.acm.org
As the utilization of network traces for the network measurement research becomes
increasingly prevalent, concerns regarding privacy leakage from network traces have …

Snake challenge: Sanitization algorithms under attack

T Allard, L Béziaud, S Gambs - Proceedings of the 32nd ACM …, 2023 - dl.acm.org
While there were already some privacy challenges organized in the domain of data
sanitization, they have mainly focused on the defense side of the problem. To favor the …