- Academic Search

H Zhang, H Song, S Li, M Zhou, D Song - ACM Computing Surveys, 2023 - dl.acm.org

Controllable Text Generation (CTG) is an emerging area in the field of natural language
generation (NLG). It is regarded as crucial for the development of advanced text generation …

Save Cite Cited by 344 Related articles All 3 versions Free GPT-4

[Free GPT-4]

[HTML] sciencedirect.com

[HTML][HTML] A survey on large language model (llm) security and privacy: The good, the bad, and the ugly

Y Yao, J Duan, K Xu, Y Cai, Z Sun, Y Zhang - High-Confidence Computing, 2024 - Elsevier

Abstract Large Language Models (LLMs), such as ChatGPT and Bard, have revolutionized
natural language understanding and generation. They possess deep language …

Save Cite Cited by 509 Related articles All 11 versions Free GPT-4

[Free GPT-4]

[PDF] arxiv.org

Marked personas: Using natural language prompts to measure stereotypes in language models

M Cheng, E Durmus, D Jurafsky - ar** techniques …

Save Cite Cited by 192 Related articles All 5 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] arxiv.org

" I'm sorry to hear that": Finding New Biases in Language Models with a Holistic Descriptor Dataset

EM Smith, M Hall, M Kambadur, E Presani… - arxiv preprint arxiv …, 2022 - arxiv.org

As language models grow in popularity, it becomes increasingly important to clearly
measure all possible markers of demographic identity in order to avoid perpetuating existing …

Save Cite Cited by 147 Related articles All 3 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] arxiv.org

Safetybench: Evaluating the safety of large language models with multiple choice questions

Z Zhang, L Lei, L Wu, R Sun, Y Huang, C Long… - arxiv preprint arxiv …, 2023 - arxiv.org

With the rapid development of Large Language Models (LLMs), increasing attention has
been paid to their safety concerns. Consequently, evaluating the safety of LLMs has become …

Save Cite Cited by 129 Related articles All 2 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] arxiv.org

Fairness in large language models: A taxonomic survey

Z Chu, Z Wang, W Zhang - ACM SIGKDD explorations newsletter, 2024 - dl.acm.org

Large Language Models (LLMs) have demonstrated remarkable success across various
domains. However, despite their promising performance in numerous real-world …

Save Cite Cited by 27 Related articles All 2 versions Free GPT-4

[Free GPT-4]

[PDF] arxiv.org

Sustainable modular debiasing of language models

A Lauscher, T Lueken, G Glavaš - arxiv preprint arxiv:2109.03646, 2021 - arxiv.org

Unfair stereotypical biases (eg, gender, racial, or religious biases) encoded in modern
pretrained language models (PLMs) have negative ethical implications for widespread …

Save Cite Cited by 144 Related articles All 5 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] springer.com

Towards trustworthy LLMs: a review on debiasing and dehallucinating in large language models

Z Lin, S Guan, W Zhang, H Zhang, Y Li… - Artificial Intelligence …, 2024 - Springer

Recently, large language models (LLMs) have attracted considerable attention due to their
remarkable capabilities. However, LLMs' generation of biased or hallucinatory content …

Save Cite Cited by 19 Related articles All 4 versions Free GPT-4

[Free GPT-4]

[PDF] acm.org

“I'm fully who I am”: Towards Centering Transgender and Non-Binary Voices to Measure Biases in Open Language Generation

A Ovalle, P Goyal, J Dhamala, Z Jaggers… - Proceedings of the …, 2023 - dl.acm.org

Warning: This paper contains examples of gender non-affirmative language which could be
offensive, upsetting, and/or triggering. Transgender and non-binary (TGNB) individuals …

Save Cite Cited by 51 Related articles All 4 versions Free GPT-4

Create alert

Cite

Advanced search

Saved to My library

RedditBias: A real-world resource for bias evaluation and debiasing of conversational language...

A survey of controllable text generation using transformer-based pre-trained language models

[HTML][HTML] A survey on large language model (llm) security and privacy: The good, the bad, and the ugly

Marked personas: Using natural language prompts to measure stereotypes in language models

" I'm sorry to hear that": Finding New Biases in Language Models with a Holistic Descriptor Dataset

Safetybench: Evaluating the safety of large language models with multiple choice questions

Fairness in large language models: A taxonomic survey

Sustainable modular debiasing of language models

Towards trustworthy LLMs: a review on debiasing and dehallucinating in large language models

“I'm fully who I am”: Towards Centering Transgender and Non-Binary Voices to Measure Biases in Open Language Generation