Anonymization techniques for privacy preserving data publishing: A comprehensive survey
A Majeed, S Lee - IEEE access, 2020 - ieeexplore.ieee.org
Anonymization is a practical solution for preserving user's privacy in data publishing. Data
owners such as hospitals, banks, social network (SN) service providers, and insurance …
owners such as hospitals, banks, social network (SN) service providers, and insurance …
A software engineering perspective on engineering machine learning systems: State of the art and challenges
G Giray - Journal of Systems and Software, 2021 - Elsevier
Context: Advancements in machine learning (ML) lead to a shift from the traditional view of
software development, where algorithms are hard-coded by humans, to ML systems …
software development, where algorithms are hard-coded by humans, to ML systems …
A multifaceted benchmarking of synthetic electronic health record generation models
Synthetic health data have the potential to mitigate privacy concerns in supporting
biomedical research and healthcare applications. Modern approaches for data generation …
biomedical research and healthcare applications. Modern approaches for data generation …
Using gans for sharing networked time series data: Challenges, initial promise, and open questions
Limited data access is a longstanding barrier to data-driven research and development in
the networked systems community. In this work, we explore if and how generative …
the networked systems community. In this work, we explore if and how generative …
Measuring large-scale social networks with high resolution
This paper describes the deployment of a large-scale study designed to measure human
interactions across a variety of communication channels, with high temporal resolution and …
interactions across a variety of communication channels, with high temporal resolution and …
Gradient-leakage resilient federated learning
Federated learning (FL) is an emerging distributed learning paradigm with default client
privacy because clients can keep sensitive data on their devices and only share local …
privacy because clients can keep sensitive data on their devices and only share local …
Anonymization of location data does not work: A large-scale measurement study
We examine a very large-scale data set of more than 30 billion call records made by 25
million cell phone users across all 50 states of the US and attempt to determine to what …
million cell phone users across all 50 states of the US and attempt to determine to what …
Utility-privacy tradeoffs in databases: An information-theoretic approach
L Sankar, SR Rajagopalan… - IEEE Transactions on …, 2013 - ieeexplore.ieee.org
Ensuring the usefulness of electronic data sources while providing necessary privacy
guarantees is an important unsolved problem. This problem drives the need for an analytical …
guarantees is an important unsolved problem. This problem drives the need for an analytical …
Slicing: A new approach for privacy preserving data publishing
Several anonymization techniques, such as generalization and bucketization, have been
designed for privacy preserving microdata publishing. Recent work has shown that …
designed for privacy preserving microdata publishing. Recent work has shown that …
Privacy-and utility-preserving textual analysis via calibrated multivariate perturbations
Accurately learning from user data while providing quantifiable privacy guarantees provides
an opportunity to build better ML models while maintaining user trust. This paper presents a …
an opportunity to build better ML models while maintaining user trust. This paper presents a …