Typology of risks of generative text-to-image models

C Bird, E Ungless, A Kasirzadeh - Proceedings of the 2023 AAAI/ACM …, 2023 - dl.acm.org
This paper investigates the direct risks and harms associated with modern text-to-image
generative models, such as DALL-E and Midjourney, through a comprehensive literature …

Algorithmic bias in education

RS Baker, A Hawn - International journal of artificial intelligence in …, 2022 - Springer
In this paper, we review algorithmic bias in education, discussing the causes of that bias and
reviewing the empirical literature on the specific ways that algorithmic bias is known to have …

Holistic evaluation of language models

P Liang, R Bommasani, T Lee, D Tsipras… - arxiv preprint arxiv …, 2022 - arxiv.org
Language models (LMs) are becoming the foundation for almost all major language
technologies, but their capabilities, limitations, and risks are not well understood. We present …

Doremi: Optimizing data mixtures speeds up language model pretraining

SM **e, H Pham, X Dong, N Du, H Liu… - Advances in …, 2023 - proceedings.neurips.cc
The mixture proportions of pretraining data domains (eg, Wikipedia, books, web text) greatly
affect language model (LM) performance. In this paper, we propose Domain Reweighting …

Data selection for language models via importance resampling

SM **e, S Santurkar, T Ma… - Advances in Neural …, 2023 - proceedings.neurips.cc
Selecting a suitable pretraining dataset is crucial for both general-domain (eg, GPT-3) and
domain-specific (eg, Codex) language models (LMs). We formalize this problem as selecting …

AI generates covertly racist decisions about people based on their dialect

V Hofmann, PR Kalluri, D Jurafsky, S King - Nature, 2024 - nature.com
Hundreds of millions of people now interact with language models, with uses ranging from
help with writing, to informing hiring decisions. However, these language models are known …

On the opportunities and risks of foundation models

R Bommasani, DA Hudson, E Adeli, R Altman… - arxiv preprint arxiv …, 2021 - arxiv.org
AI is undergoing a paradigm shift with the rise of models (eg, BERT, DALL-E, GPT-3) that are
trained on broad data at scale and are adaptable to a wide range of downstream tasks. We …

The evolution of distributed systems for graph neural networks and their origin in graph processing and deep learning: A survey

J Vatter, R Mayer, HA Jacobsen - ACM Computing Surveys, 2023 - dl.acm.org
Graph neural networks (GNNs) are an emerging research field. This specialized deep
neural network architecture is capable of processing graph structured data and bridges the …

Language (technology) is power: A critical survey of" bias" in nlp

SL Blodgett, S Barocas, H Daumé III… - arxiv preprint arxiv …, 2020 - arxiv.org
We survey 146 papers analyzing" bias" in NLP systems, finding that their motivations are
often vague, inconsistent, and lacking in normative reasoning, despite the fact that …

Racial disparities in automated speech recognition

A Koenecke, A Nam, E Lake, J Nudell… - Proceedings of the …, 2020 - pnas.org
Automated speech recognition (ASR) systems, which use sophisticated machine-learning
algorithms to convert spoken language to text, have become increasingly widespread …