Explaining explanations: An overview of interpretability of machine learning

LH Gilpin, D Bau, BZ Yuan, A Bajwa… - 2018 IEEE 5th …, 2018 - ieeexplore.ieee.org
There has recently been a surge of work in explanatory artificial intelligence (XAI). This
research area tackles the important problem that complex machines and algorithms often …

[HTML][HTML] The role of artificial intelligence in achieving the Sustainable Development Goals

R Vinuesa, H Azizpour, I Leite, M Balaam… - Nature …, 2020 - nature.com
The emergence of artificial intelligence (AI) and its progressively wider impact on many
sectors requires an assessment of its effect on the achievement of the Sustainable …

[BOOK][B] Towards a standard for identifying and managing bias in artificial intelligence

R Schwartz, R Schwartz, A Vassilev, K Greene… - 2022 - dwt.com
As individuals and communities interact in and with an environment that is increasingly
virtual, they are often vulnerable to the commodification of their digital footprint. Concepts …

Concrete problems in AI safety

D Amodei, C Olah, J Steinhardt, P Christiano… - arxiv preprint arxiv …, 2016 - arxiv.org
Rapid progress in machine learning and artificial intelligence (AI) has brought increasing
attention to the potential impacts of AI technologies on society. In this paper we discuss one …

Thinking responsibly about responsible AI and 'the dark side'of AI

P Mikalef, K Conboy, JE Lundström… - European Journal of …, 2022 - Taylor & Francis
Artificial Intelligence (AI) has been argued to offer a myriad of improvements in how we work
and live. The notion of AI comprises a wide-ranging set of technologies that allow individuals …

Foundational challenges in assuring alignment and safety of large language models

U Anwar, A Saparov, J Rando, D Paleka… - arxiv preprint arxiv …, 2024 - arxiv.org
This work identifies 18 foundational challenges in assuring the alignment and safety of large
language models (LLMs). These challenges are organized into three different categories …

[BOOK][B] The precipice: Existential risk and the future of humanity

T Ord - 2020 - books.google.com
This urgent and eye-opening book makes the case that protecting humanity's future is the
central challenge of our time. If all goes well, human history is just beginning. Our species …

Unsolved problems in ml safety

D Hendrycks, N Carlini, J Schulman… - arxiv preprint arxiv …, 2021 - arxiv.org
Machine learning (ML) systems are rapidly increasing in size, are acquiring new
capabilities, and are increasingly deployed in high-stakes settings. As with other powerful …

A review of research into automation in tourism: Launching the Annals of Tourism Research Curated Collection on Artificial Intelligence and Robotics in Tourism

I Tussyadiah - Annals of Tourism Research, 2020 - Elsevier
Driven by the advancements in artificial intelligence (AI) and its related technologies, the
application of intelligent automation in travel and tourism is expected to increase in the …

Artificial intelligence, values, and alignment

I Gabriel - Minds and machines, 2020 - Springer
This paper looks at philosophical questions that arise in the context of AI alignment. It
defends three propositions. First, normative and technical aspects of the AI alignment …