- Academic Search

Waitgpt: Monitoring and steering conversational llm agent in data analysis with on-the-fly code visualization

L **e, C Zheng, H **a, H Qu, C Zhu-Tian - Proceedings of the 37th …, 2024 - dl.acm.org

Large language models (LLMs) support data analysis through conversational user
interfaces, as exemplified in OpenAI's ChatGPT (formally known as Advanced Data Analysis …

Lagre Referanse Sitert av 7 Beslektede artikler Alle 6 versjoner

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Ai safety in generative ai large language models: A survey

J Chua, Y Li, S Yang, C Wang, L Yao - arxiv preprint arxiv:2407.18369, 2024 - arxiv.org

Large Language Model (LLMs) such as ChatGPT that exhibit generative AI capabilities are
facing accelerated adoption and innovation. The increased presence of Generative AI (GAI) …

Lagre Referanse Sitert av 9 Beslektede artikler Alle 2 versjoner HTML-versjon

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Ethical AI Governance: Methods for Evaluating Trustworthy AI

L McCormack, M Bendechache - arxiv preprint arxiv:2409.07473, 2024 - arxiv.org

Trustworthy Artificial Intelligence (TAI) integrates ethics that align with human values, looking
at their influence on AI behaviour and decision-making. Primarily dependent on self …

Lagre Referanse Sitert av 1 Beslektede artikler Alle 3 versjoner HTML-versjon

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Valuecompass: A framework of fundamental values for human-ai alignment

H Shen, T Knearem, R Ghosh, YJ Yang, T Mitra… - arxiv preprint arxiv …, 2024 - arxiv.org

As AI systems become more advanced, ensuring their alignment with a diverse range of
individuals and societal values becomes increasingly critical. But how can we capture …

Lagre Referanse Sitert av 4 Beslektede artikler Alle 4 versjoner HTML-versjon

Empirical Impacts of Independent and Collaborative Training on Task Performance and Improvement in Human-AI Teams

C Flathmann, BG Schelble… - Proceedings of the …, 2024 - journals.sagepub.com

With improving AI technology, human-AI teams are becoming increasingly common in
research. Within these teams, humans and AI can work collaboratively to complete shared …

Lagre Referanse Sitert av 2 Beslektede artikler

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

SafetyAnalyst: Interpretable, transparent, and steerable LLM safety moderation

JJ Li, V Pyatkin, M Kleiman-Weiner, L Jiang… - arxiv preprint arxiv …, 2024 - arxiv.org

The ideal LLM content moderation system would be both structurally interpretable (so its
decisions can be explained to users) and steerable (to reflect a community's values or align …

Lagre Referanse Sitert av 2 Beslektede artikler Alle 3 versjoner HTML-versjon

[Free GPT-4]
[DeepSeek]

[PDF] tandfonline.com

Framework for human–XAI symbiosis: extended self from the dual-process theory perspective

Y Litvinova, P Mikalef, X Luo - Journal of Business Analytics, 2024 - Taylor & Francis

The use of artificial intelligence (AI)-based decision support systems (DSSs) is expected to
enable superior human–XAI performance. To enhance decision-making performance …

Lagre Referanse Beslektede artikler Alle 3 versjoner

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Why human-AI relationships need socioaffective alignment

HR Kirk, I Gabriel, C Summerfield, B Vidgen… - arxiv preprint arxiv …, 2025 - arxiv.org

Humans strive to design safe AI systems that align with our goals and remain under our
control. However, as AI capabilities advance, we face a new challenge: the emergence of …

Lagre Referanse Sitert av 1 Beslektede artikler Alle 2 versjoner HTML-versjon

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

C3AI: Crafting and Evaluating Constitutions for Constitutional AI

Y Kyrychenko, K Zhou, E Bogucka… - arxiv preprint arxiv …, 2025 - arxiv.org

Constitutional AI (CAI) guides LLM behavior using constitutions, but identifying which
principles are most effective for model alignment remains an open challenge. We introduce …

Lagre Referanse Beslektede artikler HTML-versjon

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

To Rely or Not to Rely? Evaluating Interventions for Appropriate Reliance on Large Language Models

JY Bo, S Wan, A Anderson - arxiv preprint arxiv:2412.15584, 2024 - arxiv.org

As Large Language Models become integral to decision-making, optimism about their
power is tempered with concern over their errors. Users may over-rely on LLM advice that is …

Lagre Referanse Sitert av 1 Beslektede artikler Alle 2 versjoner HTML-versjon

Opprett varsel

Referanse

Avansert søk

Lagret i Mitt bibliotek

Towards bidirectional human-ai alignment: A systematic review for clarifications, framework,...

Waitgpt: Monitoring and steering conversational llm agent in data analysis with on-the-fly code visualization

Ai safety in generative ai large language models: A survey

Ethical AI Governance: Methods for Evaluating Trustworthy AI

Valuecompass: A framework of fundamental values for human-ai alignment

Empirical Impacts of Independent and Collaborative Training on Task Performance and Improvement in Human-AI Teams

SafetyAnalyst: Interpretable, transparent, and steerable LLM safety moderation

Framework for human–XAI symbiosis: extended self from the dual-process theory perspective

Why human-AI relationships need socioaffective alignment

C3AI: Crafting and Evaluating Constitutions for Constitutional AI

To Rely or Not to Rely? Evaluating Interventions for Appropriate Reliance on Large Language Models