- Academic Search

T Zhi-Xuan, M Carroll, M Franklin, H Ashton - Philosophical Studies, 2024 - Springer

The dominant practice of AI alignment assumes (1) that preferences are an adequate
representation of human values,(2) that human rationality can be understood in terms of …

Spara Citera Citerat av 9 Relaterade artiklar Alla 6 versionerna

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

The Moral Case for Using Language Model Agents for Recommendation

S Lazar, L Thorburn, T **, L Belli - arxiv preprint arxiv:2410.12123, 2024 - arxiv.org

Our information and communication environment has fallen short of the ideals that
networked global communication might have served. Identifying all the causes of its …

Spara Citera Citerat av 2 Relaterade artiklar Alla 2 versionerna Se som HTML-version

[Free GPT-4]
[DeepSeek]

[PDF] techrxiv.org

Evaluating the cybersecurity robustness of commercial llms against adversarial prompts: A promptbench analysis

T Goto, K Ono, A Morita - Authorea Preprints, 2024 - techrxiv.org

This study presents a comprehensive evaluation of the cybersecurity robustness of five
leading Large Language Models (LLMs)-ChatGPT-4, Google Gemini, Anthropic Claude …

Spara Citera Citerat av 4 Relaterade artiklar Alla 4 versionerna Se som HTML-version

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

GPT-ology, Computational Models, Silicon Sampling: How should we think about LLMs in Cognitive Science?

DC Ong - arxiv preprint arxiv:2406.09464, 2024 - arxiv.org

Large Language Models have taken the cognitive science world by storm. It is perhaps
timely now to take stock of the various research paradigms that have been used to make …

Spara Citera Citerat av 2 Relaterade artiklar Alla 3 versionerna Se som HTML-version

[Free GPT-4]
[DeepSeek]

[PDF] objectives.institute

[PDF][PDF] Value as semantics: Representations of human moral and hedonic value in large language models

A Leshinskaya, C San Franscisco… - … 2023 Workshop: AI …, 2023 - ai.objectives.institute

Aligning AI with human objectives can be facilitated by enabling it to learn and veridically
represent our values. In modern AI agents, value is a scalar magnitude reflecting the …

Spara Citera Citerat av 2 Relaterade artiklar Alla 2 versionerna Se som HTML-version

[Free GPT-4]
[DeepSeek]

[PDF] byu.edu

A Comparative Analysis of Human and Machine Translation Quality

C Marshall - 2024 - scholarsarchive.byu.edu

A common question raised by both translators and Machine Translation developers is Will
Machine Translation (MT) ever attain the level of Human Translation (HT) quality …

Spara Citera Relaterade artiklar Se som HTML-version

[Free GPT-4]
[DeepSeek]

[PDF] aaai.org

Contractual AI: Toward More Aligned, Transparent, and Robust Dialogue Agents

CJ Bates, R Bose, RG Keeney… - Proceedings of the AAAI …, 2023 - ojs.aaai.org

We present a new framework for AI alignment called Contractual AI, and apply it to the
setting of dialogue agents chatting with humans. This framework incorporates and builds on …

Spara Citera Relaterade artiklar Alla 2 versionerna Se som HTML-version

Skapa alarm

Citera

Avancerad sökning

Har sparats i Mitt bibliotek

Neuro-Symbolic Models of Human Moral Judgment: LLMs as Automatic Feature Extractors

Beyond preferences in ai alignment

The Moral Case for Using Language Model Agents for Recommendation

Evaluating the cybersecurity robustness of commercial llms against adversarial prompts: A promptbench analysis

GPT-ology, Computational Models, Silicon Sampling: How should we think about LLMs in Cognitive Science?

[PDF][PDF] Value as semantics: Representations of human moral and hedonic value in large language models

A Comparative Analysis of Human and Machine Translation Quality

Contractual AI: Toward More Aligned, Transparent, and Robust Dialogue Agents