When to make exceptions: Exploring language models as accounts of human moral judgment

Z **, S Levine, F Gonzalez Adauto… - Advances in neural …, 2022 - proceedings.neurips.cc
AI systems are becoming increasingly intertwined with human life. In order to effectively
collaborate with humans and ensure safety, AI systems need to be able to understand …

Open-Ethical AI: Advancements in Open-Source Human-Centric Neural Language Models

S Sicari, JF Cevallos M, A Rizzardi… - ACM Computing …, 2024 - dl.acm.org
This survey summarises the most recent methods for building and assessing helpful, honest,
and harmless neural language models, considering small, medium, and large-size models …

The logic of universalization guides moral judgment

S Levine, M Kleiman-Weiner, L Schulz… - Proceedings of the …, 2020 - National Acad Sciences
To explain why an action is wrong, we sometimes say,“What if everybody did that?” In other
words, even if a single person's behavior is harmless, that behavior may be wrong if it would …

Resource-rational contractualism: A triple theory of moral cognition

S Levine, N Chater, JB Tenenbaum… - Behavioral and Brain …, 2023 - cambridge.org
It is widely agreed upon that morality guides people with conflicting interests towards
agreements of mutual benefit. We therefore might expect numerous proposals for organizing …

When is it acceptable to break the rules? Knowledge representation of moral judgements based on empirical data

E Awad, S Levine, A Loreggia, N Mattei… - Autonomous Agents and …, 2024 - Springer
Constraining the actions of AI systems is one promising way to ensure that these systems
behave in a way that is morally acceptable to humans. But constraints alone come with …

[HTML][HTML] Instantaneous systems of communicative conventions through virtual bargaining

J Misyak, N Chater - Cognition, 2022 - Elsevier
People can instantaneously create novel conventions that link individual communicative
signals to meanings, both in experiments and everyday communication. Yet a basic …

What Makes it Ok to Set a Fire? Iterative Self-distillation of Contexts and Rationales for Disambiguating Defeasible Social and Moral Situations

K Rao, L Jiang, V Pyatkin, Y Gu, N Tandon… - arxiv preprint arxiv …, 2023 - arxiv.org
Moral or ethical judgments rely heavily on the specific contexts in which they occur.
Understanding varying shades of defeasible contextualizations (ie, additional information …

How Gene–Culture Coevolution can—but Probably did not—Track Mind-Independent Moral Truth

N Cofnas - The Philosophical Quarterly, 2023 - academic.oup.com
I argue that our general disposition to make moral judgments and our core moral intuitions
are likely the product of social selection—a kind of gene–culture coevolution driven by the …

A survey on modelling morality for text analysis

I Reinig, M Becker, I Rehbein, SP Ponzetto - 2024 - madoc.bib.uni-mannheim.de
In this survey, we provide a systematic review of recent work on modelling morality in text, an
area of research that has garnered increasing attention in recent years. Our survey is …

Moral flexibility in applying queuing norms can be explained by contractualist principles and game-theoretic considerations

JP White, R Bhui, F Cushman… - Proceedings of the …, 2023 - escholarship.org
People sometimes display moral flexibility by deciding that a commonly accepted moral
norm ought not to apply in particular circumstances. But how? We explore this question in …