Machine behaviour

I Rahwan, M Cebrian, N Obradovich, J Bongard… - Nature, 2019 - nature.com
Abstract Machines powered by artificial intelligence increasingly mediate our social, cultural,
economic and political interactions. Understanding the behaviour of artificial intelligence …

Visual interpretability for deep learning: a survey

Q Zhang, SC Zhu - Frontiers of Information Technology & Electronic …, 2018 - Springer
This paper reviews recent studies in understanding neural-network representations and
learning neural networks with interpretable/disentangled middle-layer representations …

Does the whole exceed its parts? the effect of ai explanations on complementary team performance

G Bansal, T Wu, J Zhou, R Fok, B Nushi… - Proceedings of the …, 2021 - dl.acm.org
Many researchers motivate explainable AI with studies showing that human-AI team
performance on decision-making tasks improves when the AI explains its recommendations …

Supporting human-ai collaboration in auditing llms with llms

C Rastogi, M Tulio Ribeiro, N King, H Nori… - Proceedings of the 2023 …, 2023 - dl.acm.org
Large language models (LLMs) are increasingly becoming all-powerful and pervasive via
deployment in sociotechnical systems. Yet these language models, be it for classification or …

Improving fairness in machine learning systems: What do industry practitioners need?

K Holstein, J Wortman Vaughan, H Daumé III… - Proceedings of the …, 2019 - dl.acm.org
The potential for machine learning (ML) systems to amplify social inequities and unfairness
is receiving increasing popular and academic attention. A surge of recent work has focused …

Beyond accuracy: The role of mental models in human-AI team performance

G Bansal, B Nushi, E Kamar, WS Lasecki… - Proceedings of the AAAI …, 2019 - aaai.org
Decisions made by human-AI teams (eg., AI-advised humans) are increasingly common in
high-stakes domains such as healthcare, criminal justice, and finance. Achieving high team …

On interpretability of artificial neural networks: A survey

FL Fan, J **ong, M Li, G Wang - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
Deep learning as performed by artificial deep neural networks (DNNs) has achieved great
successes recently in many important areas that deal with text, images, videos, graphs, and …

Understanding the effect of accuracy on trust in machine learning models

M Yin, J Wortman Vaughan, H Wallach - … of the 2019 chi conference on …, 2019 - dl.acm.org
We address a relatively under-explored aspect of human-computer interaction: people's
abilities to understand the relationship between a machine learning model's stated …

Interpretable convolutional neural networks

Q Zhang, YN Wu, SC Zhu - Proceedings of the IEEE …, 2018 - openaccess.thecvf.com
This paper proposes a method to modify a traditional convolutional neural network (CNN)
into an interpretable CNN, in order to clarify knowledge representations in high conv-layers …