Holistic evaluation of language models

P Liang, R Bommasani, T Lee, D Tsipras… - arxiv preprint arxiv …, 2022 - arxiv.org
Language models (LMs) are becoming the foundation for almost all major language
technologies, but their capabilities, limitations, and risks are not well understood. We present …

Natural language processing for dialects of a language: A survey

A Joshi, R Dabre, D Kanojia, Z Li, H Zhan… - ACM Computing …, 2024 - dl.acm.org
State-of-the-art natural language processing (NLP) models are trained on massive training
corpora, and report a superlative performance on evaluation datasets. This survey delves …

Prompting gpt-3 to be reliable

C Si, Z Gan, Z Yang, S Wang, J Wang… - arxiv preprint arxiv …, 2022 - arxiv.org
Large language models (LLMs) show impressive abilities via few-shot prompting.
Commercialized APIs such as OpenAI GPT-3 further increase their use in real-world …

AI generates covertly racist decisions about people based on their dialect

V Hofmann, PR Kalluri, D Jurafsky, S King - Nature, 2024 - nature.com
Hundreds of millions of people now interact with language models, with uses ranging from
help with writing, to informing hiring decisions. However, these language models are known …

Parameter-efficient fine-tuning design spaces

J Chen, A Zhang, X Shi, M Li, A Smola… - arxiv preprint arxiv …, 2023 - arxiv.org
Parameter-efficient fine-tuning aims to achieve performance comparable to fine-tuning,
using fewer trainable parameters. Several strategies (eg, Adapters, prefix tuning, BitFit, and …

Multi-VALUE: A framework for cross-dialectal English NLP

C Ziems, W Held, J Yang, J Dhamala, R Gupta… - arxiv preprint arxiv …, 2022 - arxiv.org
Dialect differences caused by regional, social, and economic factors cause performance
discrepancies for many groups of language technology users. Inclusive and equitable …

Evaluation of African American language bias in natural language generation

N Deas, J Grieser, S Kleiner, D Patton, E Turcan… - arxiv preprint arxiv …, 2023 - arxiv.org
We evaluate how well LLMs understand African American Language (AAL) in comparison to
their performance on White Mainstream English (WME), the encouraged" standard" form of …

Bias and fairness in large language models: A survey

IO Gallegos, RA Rossi, J Barrow, MM Tanjim… - Computational …, 2024 - direct.mit.edu
Rapid advancements of large language models (LLMs) have enabled the processing,
understanding, and generation of human-like text, with increasing integration into systems …

Mc2: Towards transparent and culturally-aware nlp for minority languages in china

C Zhang, M Tao, Q Huang, J Lin, Z Chen… - Proceedings of the …, 2024 - aclanthology.org
Current large language models demonstrate deficiencies in understanding low-resource
languages, particularly the minority languages in China. This limitation stems from the …

Dialect prejudice predicts AI decisions about people's character, employability, and criminality

V Hofmann, PR Kalluri, D Jurafsky, S King - arxiv preprint arxiv …, 2024 - arxiv.org
Hundreds of millions of people now interact with language models, with uses ranging from
serving as a writing aid to informing hiring decisions. Yet these language models are known …