Machine behaviour
Abstract Machines powered by artificial intelligence increasingly mediate our social, cultural,
economic and political interactions. Understanding the behaviour of artificial intelligence …
economic and political interactions. Understanding the behaviour of artificial intelligence …
Visual interpretability for deep learning: a survey
This paper reviews recent studies in understanding neural-network representations and
learning neural networks with interpretable/disentangled middle-layer representations …
learning neural networks with interpretable/disentangled middle-layer representations …
Sparks of artificial general intelligence: Early experiments with gpt-4
S Bubeck, V Chandrasekaran, R Eldan… - ar** and refining large language
models (LLMs) that exhibit remarkable capabilities across a variety of domains and tasks …
models (LLMs) that exhibit remarkable capabilities across a variety of domains and tasks …
Does the whole exceed its parts? the effect of ai explanations on complementary team performance
Many researchers motivate explainable AI with studies showing that human-AI team
performance on decision-making tasks improves when the AI explains its recommendations …
performance on decision-making tasks improves when the AI explains its recommendations …
Supporting human-ai collaboration in auditing llms with llms
Large language models (LLMs) are increasingly becoming all-powerful and pervasive via
deployment in sociotechnical systems. Yet these language models, be it for classification or …
deployment in sociotechnical systems. Yet these language models, be it for classification or …
Improving fairness in machine learning systems: What do industry practitioners need?
The potential for machine learning (ML) systems to amplify social inequities and unfairness
is receiving increasing popular and academic attention. A surge of recent work has focused …
is receiving increasing popular and academic attention. A surge of recent work has focused …
Beyond accuracy: The role of mental models in human-AI team performance
Decisions made by human-AI teams (eg., AI-advised humans) are increasingly common in
high-stakes domains such as healthcare, criminal justice, and finance. Achieving high team …
high-stakes domains such as healthcare, criminal justice, and finance. Achieving high team …
On interpretability of artificial neural networks: A survey
Deep learning as performed by artificial deep neural networks (DNNs) has achieved great
successes recently in many important areas that deal with text, images, videos, graphs, and …
successes recently in many important areas that deal with text, images, videos, graphs, and …
Understanding the effect of accuracy on trust in machine learning models
We address a relatively under-explored aspect of human-computer interaction: people's
abilities to understand the relationship between a machine learning model's stated …
abilities to understand the relationship between a machine learning model's stated …
Interpretable convolutional neural networks
This paper proposes a method to modify a traditional convolutional neural network (CNN)
into an interpretable CNN, in order to clarify knowledge representations in high conv-layers …
into an interpretable CNN, in order to clarify knowledge representations in high conv-layers …