Federated benchmarking of medical artificial intelligence with MedPerf

A Karargyris, R Umeton, MJ Sheller… - Nature machine …, 2023 - nature.com
Medical artificial intelligence (AI) has tremendous potential to advance healthcare by
supporting and contributing to the evidence-based practice of medicine, personalizing …

Power hungry processing: Watts driving the cost of AI deployment?

S Luccioni, Y Jernite, E Strubell - The 2024 ACM Conference on …, 2024 - dl.acm.org
Recent years have seen a surge in the popularity of commercial AI products based on
generative, multi-purpose AI systems promising a unified approach to building machine …

Holistic evaluation of language models

R Bommasani, P Liang, T Lee - … of the New York Academy of …, 2023 - Wiley Online Library
Abstract Language models (LMs) like GPT‐3, PaLM, and ChatGPT are the foundation for
almost all major language technologies, but their capabilities, limitations, and risks are not …

Hype, Sustainability, and the Price of the Bigger-is-Better Paradigm in AI

G Varoquaux, AS Luccioni, M Whittaker - arxiv preprint arxiv:2409.14160, 2024 - arxiv.org
With the growing attention and investment in recent AI approaches such as large language
models, the narrative that the larger the AI system the more valuable, powerful and …

Fly-swat or cannon? cost-effective language model choice via meta-modeling

M Šakota, M Peyrard, R West - … of the 17th ACM International Conference …, 2024 - dl.acm.org
Generative language models (LMs) have become omnipresent across data science. For a
wide variety of tasks, inputs can be phrased as natural language prompts for an LM, from …

Language model crossover: Variation through few-shot prompting

E Meyerson, MJ Nelson, H Bradley, A Gaier… - ACM Transactions on …, 2024 - dl.acm.org
This article pursues the insight that language models naturally enable an intelligent variation
operator similar in spirit to evolutionary crossover. In particular, language models of …

Reclaiming the digital commons: A public data trust for training data

A Chan, H Bradley, N Rajkumar - Proceedings of the 2023 AAAI/ACM …, 2023 - dl.acm.org
Democratization of AI means not only that people can freely use AI, but also that people can
collectively decide how AI is to be used. In particular, collective decision-making power is …

Chef: A comprehensive evaluation framework for standardized assessment of multimodal large language models

Z Shi, Z Wang, H Fan, Z Yin, L Sheng, Y Qiao… - arxiv preprint arxiv …, 2023 - arxiv.org
Multimodal Large Language Models (MLLMs) have shown impressive abilities in interacting
with visual content with myriad potential downstream tasks. However, even though a list of …

Clinical efficacy of pre-trained large language models through the lens of aphasia

Y Cong, AN LaCroix, J Lee - Scientific Reports, 2024 - nature.com
The rapid development of large language models (LLMs) motivates us to explore how such
state-of-the-art natural language processing systems can inform aphasia research. What …

InterroLang: Exploring NLP models and datasets through dialogue-based explanations

N Feldhus, Q Wang, T Anikina, S Chopra… - arxiv preprint arxiv …, 2023 - arxiv.org
While recently developed NLP explainability methods let us open the black box in various
ways (Madsen et al., 2022), a missing ingredient in this endeavor is an interactive tool …