A survey on differential privacy for unstructured data content

Y Zhao, J Chen - ACM Computing Surveys (CSUR), 2022 - dl.acm.org
Huge amounts of unstructured data including image, video, audio, and text are ubiquitously
generated and shared, and it is a challenge to protect sensitive personal information in …

[Retracted] The Rise of Cloud Computing: Data Protection, Privacy, and Open Research Challenges—A Systematic Literature Review (SLR)

J Hassan, D Shehzad, U Habib… - Computational …, 2022 - Wiley Online Library
Cloud computing is a long‐standing dream of computing as a utility, where users can store
their data remotely in the cloud to enjoy on‐demand services and high‐quality applications …

Anonymisation models for text data: State of the art, challenges and future directions

P Lison, I Pilán, D Sánchez, M Batet… - Proceedings of the 59th …, 2021 - aclanthology.org
This position paper investigates the problem of automated text anonymisation, which is a
prerequisite for secure sharing of documents containing sensitive information about …

Privacy-preserving cloud computing on sensitive data: A survey of methods, products and challenges

J Domingo-Ferrer, O Farras, J Ribes-González… - Computer …, 2019 - Elsevier
The increasing volume of personal and sensitive data being harvested by data controllers
makes it increasingly necessary to use the cloud not just to store the data, but also to …

The text anonymization benchmark (tab): A dedicated corpus and evaluation framework for text anonymization

I Pilán, P Lison, L Øvrelid, A Papadopoulou… - Computational …, 2022 - direct.mit.edu
We present a novel benchmark and associated evaluation metrics for assessing the
performance of text anonymization methods. Text anonymization, defined as the task of …

Digital forgetting in large language models: A survey of unlearning methods

A Blanco-Justicia, N Jebreel… - Artificial Intelligence …, 2025 - Springer
Large language models (LLMs) have become the state of the art in natural language
processing. The massive adoption of generative LLMs and the capabilities they have shown …

Privacy-and utility-preserving textual analysis via calibrated multivariate perturbations

O Feyisetan, B Balle, T Drake, T Diethe - … on web search and data mining, 2020 - dl.acm.org
Accurately learning from user data while providing quantifiable privacy guarantees provides
an opportunity to build better ML models while maintaining user trust. This paper presents a …

[HTML][HTML] Towards a universal privacy model for electronic health record systems: an ontology and machine learning approach

R Nowrozy, K Ahmed, H Wang, T Mcintosh - Informatics, 2023 - mdpi.com
This paper proposed a novel privacy model for Electronic Health Records (EHR) systems
utilizing a conceptual privacy ontology and Machine Learning (ML) methodologies. It …

[PDF][PDF] Generalised differential privacy for text document processing

N Fernandes, M Dras, A McIver - … , POST 2019, Held as Part of the …, 2019 - library.oapen.org
We address the problem of how to “obfuscate” texts by removing stylistic clues which can
identify authorship, whilst preserving (as much as possible) the content of the text. In this …

Unlocking digital archives: cross-disciplinary perspectives on AI and born-digital data

L Jaillant, A Caputo - AI & society, 2022 - Springer
Co-authored by a Computer Scientist and a Digital Humanist, this article examines the
challenges faced by cultural heritage institutions in the digital age, which have led to the …