Anonymization: The imperfect science of using data while preserving privacy
Information about us, our actions, and our preferences is created at scale through surveys or
scientific studies or as a result of our interaction with digital devices such as smartphones …
scientific studies or as a result of our interaction with digital devices such as smartphones …
A sco** review of privacy and utility metrics in medical synthetic data
The use of synthetic data is a promising solution to facilitate the sharing and reuse of health-
related data beyond its initial collection while addressing privacy concerns. However, there …
related data beyond its initial collection while addressing privacy concerns. However, there …
Achilles' heels: vulnerable record identification in synthetic data publishing
Synthetic data is seen as the most promising solution to share individual-level data while
preserving privacy. Shadow modeling-based Membership Inference Attacks (MIAs) have …
preserving privacy. Shadow modeling-based Membership Inference Attacks (MIAs) have …
Auditing and generating synthetic data with controllable trust trade-offs
Real-world data often exhibits bias, imbalance, and privacy risks. Synthetic datasets have
emerged to address these issues by enabling a paradigm that relies on generative AI …
emerged to address these issues by enabling a paradigm that relies on generative AI …
On the Inadequacy of Similarity-based Privacy Metrics: Reconstruction Attacks against" Truly Anonymous Synthetic Data''
Training generative models to produce synthetic data is meant to provide a privacy-friendly
approach to data release. However, we get robust guarantees only when models are trained …
approach to data release. However, we get robust guarantees only when models are trained …
FLAIM: AIM-based synthetic data generation in the federated setting
Preserving individual privacy while enabling collaborative data sharing is crucial for
organizations. Synthetic data generation is one solution, producing artificial data that mirrors …
organizations. Synthetic data generation is one solution, producing artificial data that mirrors …
A unified view of differentially private deep generative modeling
The availability of rich and vast data sources has greatly advanced machine learning
applications in various domains. However, data with privacy concerns comes with stringent …
applications in various domains. However, data with privacy concerns comes with stringent …
A zero auxiliary knowledge membership inference attack on aggregate location data
Location data is frequently collected from populations and shared in aggregate form to guide
policy and decision making. However, the prevalence of aggregated data also raises the …
policy and decision making. However, the prevalence of aggregated data also raises the …
NetDPSyn: Synthesizing Network Traces under Differential Privacy
As the utilization of network traces for the network measurement research becomes
increasingly prevalent, concerns regarding privacy leakage from network traces have …
increasingly prevalent, concerns regarding privacy leakage from network traces have …
Snake challenge: Sanitization algorithms under attack
While there were already some privacy challenges organized in the domain of data
sanitization, they have mainly focused on the defense side of the problem. To favor the …
sanitization, they have mainly focused on the defense side of the problem. To favor the …