Causal machine learning for predicting treatment outcomes

S Feuerriegel, D Frauen, V Melnychuk, J Schweisthal… - Nature Medicine, 2024 - nature.com
Causal machine learning (ML) offers flexible, data-driven methods for predicting treatment
outcomes including efficacy and toxicity, thereby supporting the assessment and safety of …

Foundations & trends in multimodal machine learning: Principles, challenges, and open questions

PP Liang, A Zadeh, LP Morency - ACM Computing Surveys, 2024 - dl.acm.org
Multimodal machine learning is a vibrant multi-disciplinary research field that aims to design
computer agents with intelligent capabilities such as understanding, reasoning, and learning …

Towards automated circuit discovery for mechanistic interpretability

A Conmy, A Mavor-Parker, A Lynch… - Advances in …, 2023 - proceedings.neurips.cc
Through considerable effort and intuition, several recent works have reverse-engineered
nontrivial behaviors oftransformer models. This paper systematizes the mechanistic …

Trustworthy llms: a survey and guideline for evaluating large language models' alignment

Y Liu, Y Yao, JF Ton, X Zhang, R Guo, H Cheng… - arxiv preprint arxiv …, 2023 - arxiv.org
Ensuring alignment, which refers to making models behave in accordance with human
intentions [1, 2], has become a critical task before deploying large language models (LLMs) …

Causal inference for time series

J Runge, A Gerhardus, G Varando, V Eyring… - Nature Reviews Earth & …, 2023 - nature.com
Many research questions in Earth and environmental sciences are inherently causal,
requiring robust analyses to establish whether and how changes in one variable cause …

Emulation of randomized clinical trials with nonrandomized database analyses: results of 32 clinical trials

SV Wang, S Schneeweiss, JM Franklin, RJ Desai… - Jama, 2023 - jamanetwork.com
Importance Nonrandomized studies using insurance claims databases can be analyzed to
produce real-world evidence on the effectiveness of medical products. Given the lack of …

How does GPT-2 compute greater-than?: Interpreting mathematical abilities in a pre-trained language model

M Hanna, O Liu, A Variengien - Advances in Neural …, 2023 - proceedings.neurips.cc
Pre-trained language models can be surprisingly adept at tasks they were not explicitly
trained on, but how they implement these capabilities is poorly understood. In this paper, we …

Causal reasoning and large language models: Opening a new frontier for causality

E Kiciman, R Ness, A Sharma, C Tan - Transactions on Machine …, 2023 - openreview.net
The causal capabilities of large language models (LLMs) are a matter of significant debate,
with critical implications for the use of LLMs in societally impactful domains such as …

Using cognitive psychology to understand GPT-3

M Binz, E Schulz - Proceedings of the National Academy of Sciences, 2023 - pnas.org
We study GPT-3, a recent large language model, using tools from cognitive psychology.
More specifically, we assess GPT-3's decision-making, information search, deliberation, and …

Beyond the imitation game: Quantifying and extrapolating the capabilities of language models

A Srivastava, A Rastogi, A Rao, AAM Shoeb… - arxiv preprint arxiv …, 2022 - arxiv.org
Language models demonstrate both quantitative improvement and new qualitative
capabilities with increasing scale. Despite their potentially transformative impact, these new …