Using large language models in psychology

D Demszky, D Yang, DS Yeager, CJ Bryan… - Nature Reviews …, 2023 - nature.com
Large language models (LLMs), such as OpenAI's GPT-4, Google's Bard or Meta's LLaMa,
have created unprecedented opportunities for analysing and generating language data on a …

A review on fairness in machine learning

D Pessach, E Shmueli - ACM Computing Surveys (CSUR), 2022 - dl.acm.org
An increasing number of decisions regarding the daily lives of human beings are being
controlled by artificial intelligence and machine learning (ML) algorithms in spheres ranging …

Gpt-4 technical report

J Achiam, S Adler, S Agarwal, L Ahmad… - arxiv preprint arxiv …, 2023 - arxiv.org
We report the development of GPT-4, a large-scale, multimodal model which can accept
image and text inputs and produce text outputs. While less capable than humans in many …

Eyes wide shut? exploring the visual shortcomings of multimodal llms

S Tong, Z Liu, Y Zhai, Y Ma… - Proceedings of the …, 2024 - openaccess.thecvf.com
Is vision good enough for language? Recent advancements in multimodal models primarily
stem from the powerful reasoning abilities of large language models (LLMs). However the …

Biases in large language models: origins, inventory, and discussion

R Navigli, S Conia, B Ross - ACM Journal of Data and Information …, 2023 - dl.acm.org
In this article, we introduce and discuss the pervasive issue of bias in the large language
models that are currently at the core of mainstream approaches to Natural Language …

Beyond the imitation game: Quantifying and extrapolating the capabilities of language models

A Srivastava, A Rastogi, A Rao, AAM Shoeb… - arxiv preprint arxiv …, 2022 - arxiv.org
Language models demonstrate both quantitative improvement and new qualitative
capabilities with increasing scale. Despite their potentially transformative impact, these new …

Large language models can accurately predict searcher preferences

P Thomas, S Spielman, N Craswell… - Proceedings of the 47th …, 2024 - dl.acm.org
Much of the evaluation and tuning of a search system relies on relevance labels---
annotations that say whether a document is useful for a given search and searcher. Ideally …

Bias of AI-generated content: an examination of news produced by large language models

X Fang, S Che, M Mao, H Zhang, M Zhao, X Zhao - Scientific Reports, 2024 - nature.com
Large language models (LLMs) have the potential to transform our lives and work through
the content they generate, known as AI-Generated Content (AIGC). To harness this …

On second thought, let's not think step by step! bias and toxicity in zero-shot reasoning

O Shaikh, H Zhang, W Held, M Bernstein… - arxiv preprint arxiv …, 2022 - arxiv.org
Generating a Chain of Thought (CoT) has been shown to consistently improve large
language model (LLM) performance on a wide range of NLP tasks. However, prior work has …

On the opportunities and risks of foundation models

R Bommasani, DA Hudson, E Adeli, R Altman… - arxiv preprint arxiv …, 2021 - arxiv.org
AI is undergoing a paradigm shift with the rise of models (eg, BERT, DALL-E, GPT-3) that are
trained on broad data at scale and are adaptable to a wide range of downstream tasks. We …