Evaluating large language models: A comprehensive survey
Large language models (LLMs) have demonstrated remarkable capabilities across a broad
spectrum of tasks. They have attracted significant attention and been deployed in numerous …
spectrum of tasks. They have attracted significant attention and been deployed in numerous …
Language (technology) is power: A critical survey of" bias" in nlp
We survey 146 papers analyzing" bias" in NLP systems, finding that their motivations are
often vague, inconsistent, and lacking in normative reasoning, despite the fact that …
often vague, inconsistent, and lacking in normative reasoning, despite the fact that …
CrowS-pairs: A challenge dataset for measuring social biases in masked language models
Pretrained language models, especially masked language models (MLMs) have seen
success across many NLP tasks. However, there is ample evidence that they use the cultural …
success across many NLP tasks. However, there is ample evidence that they use the cultural …
Large language model alignment: A survey
Recent years have witnessed remarkable progress made in large language models (LLMs).
Such advancements, while garnering significant attention, have concurrently elicited various …
Such advancements, while garnering significant attention, have concurrently elicited various …
ChatGPT is on the horizon: could a large language model be suitable for intelligent traffic safety research and applications?
ChatGPT embarks on a new era of artificial intelligence and will revolutionize the way we
approach intelligent traffic safety systems. This paper begins with a brief introduction about …
approach intelligent traffic safety systems. This paper begins with a brief introduction about …
Quantifying social biases in NLP: A generalization and empirical comparison of extrinsic fairness metrics
Measuring bias is key for better understanding and addressing unfairness in NLP/ML
models. This is often done via fairness metrics, which quantify the differences in a model's …
models. This is often done via fairness metrics, which quantify the differences in a model's …
Queens are powerful too: Mitigating gender bias in dialogue generation
Models often easily learn biases present in the training data, and their predictions directly
reflect this bias. We analyze gender bias in dialogue data, and examine how this bias is …
reflect this bias. We analyze gender bias in dialogue data, and examine how this bias is …
Theories of “gender” in nlp bias research
The rise of concern around Natural Language Processing (NLP) technologies containing
and perpetuating social biases has led to a rich and rapidly growing area of research …
and perpetuating social biases has led to a rich and rapidly growing area of research …
On learning fairness and accuracy on multiple subgroups
We propose an analysis in fair learning that preserves the utility of the data while reducing
prediction disparities under the criteria of group sufficiency. We focus on the scenario where …
prediction disparities under the criteria of group sufficiency. We focus on the scenario where …
Multi-dimensional gender bias classification
Machine learning models are trained to find patterns in data. NLP models can inadvertently
learn socially undesirable patterns when training on gender biased text. In this work, we …
learn socially undesirable patterns when training on gender biased text. In this work, we …