A survey of controllable text generation using transformer-based pre-trained language models

H Zhang, H Song, S Li, M Zhou, D Song - ACM Computing Surveys, 2023 - dl.acm.org
Controllable Text Generation (CTG) is an emerging area in the field of natural language
generation (NLG). It is regarded as crucial for the development of advanced text generation …

[HTML][HTML] A survey on large language model (llm) security and privacy: The good, the bad, and the ugly

Y Yao, J Duan, K Xu, Y Cai, Z Sun, Y Zhang - High-Confidence Computing, 2024 - Elsevier
Abstract Large Language Models (LLMs), such as ChatGPT and Bard, have revolutionized
natural language understanding and generation. They possess deep language …

" I'm sorry to hear that": Finding New Biases in Language Models with a Holistic Descriptor Dataset

EM Smith, M Hall, M Kambadur, E Presani… - arxiv preprint arxiv …, 2022 - arxiv.org
As language models grow in popularity, it becomes increasingly important to clearly
measure all possible markers of demographic identity in order to avoid perpetuating existing …

Safetybench: Evaluating the safety of large language models with multiple choice questions

Z Zhang, L Lei, L Wu, R Sun, Y Huang, C Long… - arxiv preprint arxiv …, 2023 - arxiv.org
With the rapid development of Large Language Models (LLMs), increasing attention has
been paid to their safety concerns. Consequently, evaluating the safety of LLMs has become …

Fairness in large language models: A taxonomic survey

Z Chu, Z Wang, W Zhang - ACM SIGKDD explorations newsletter, 2024 - dl.acm.org
Large Language Models (LLMs) have demonstrated remarkable success across various
domains. However, despite their promising performance in numerous real-world …

Sustainable modular debiasing of language models

A Lauscher, T Lueken, G Glavaš - arxiv preprint arxiv:2109.03646, 2021 - arxiv.org
Unfair stereotypical biases (eg, gender, racial, or religious biases) encoded in modern
pretrained language models (PLMs) have negative ethical implications for widespread …

Towards trustworthy LLMs: a review on debiasing and dehallucinating in large language models

Z Lin, S Guan, W Zhang, H Zhang, Y Li… - Artificial Intelligence …, 2024 - Springer
Recently, large language models (LLMs) have attracted considerable attention due to their
remarkable capabilities. However, LLMs' generation of biased or hallucinatory content …

“I'm fully who I am”: Towards Centering Transgender and Non-Binary Voices to Measure Biases in Open Language Generation

A Ovalle, P Goyal, J Dhamala, Z Jaggers… - Proceedings of the …, 2023 - dl.acm.org
Warning: This paper contains examples of gender non-affirmative language which could be
offensive, upsetting, and/or triggering. Transgender and non-binary (TGNB) individuals …