How do large language models navigate conflicts between honesty and helpfulness?

R Liu, TR Sumers, I Dasgupta, TL Griffiths - arxiv preprint arxiv …, 2024 - arxiv.org
In day-to-day communication, people often approximate the truth-for example, rounding the
time or omitting details-in order to be maximally helpful to the listener. How do large …

Psychomatics—A multidisciplinary framework for understanding artificial minds

G Riva, F Mantovani, BK Wiederhold… - … , Behavior, and Social …, 2024 - liebertpub.com
Although large language models (LLMs) and other artificial intelligence systems
demonstrate cognitive skills similar to humans, such as concept learning and language …

Discursive Socratic Questioning: Evaluating the Faithfulness of Language Models' Understanding of Discourse Relations

Y Miao, H Liu, W Lei, N Chen… - Proceedings of the 62nd …, 2024 - aclanthology.org
While large language models have significantly enhanced the effectiveness of discourse
relation classifications, it remains unclear whether their comprehension is faithful and …

Is On-Device AI Broken and Exploitable? Assessing the Trust and Ethics in Small Language Models

K Nakka, J Dani, N Saxena - arxiv preprint arxiv:2406.05364, 2024 - arxiv.org
In this paper, we present a very first study to investigate trust and ethical implications of on-
device artificial intelligence (AI), focusing on''small''language models (SLMs) amenable for …

Visual Contexts Clarify Ambiguous Expressions: A Benchmark Dataset

H Nam, J Ahn - arxiv preprint arxiv:2411.14137, 2024 - arxiv.org
The ability to perform complex reasoning across multimodal inputs is essential for models to
effectively interact with humans in real-world scenarios. Advancements in vision-language …

Manner implicatures in large language models

Y Cong - Scientific Reports, 2024 - nature.com
In human speakers' daily conversations, what we do not say matters. We not only compute
the literal semantics but also go beyond and draw inferences from what we could have said …

Multiprageval: Multilingual pragmatic evaluation of large language models

D Park, J Lee, S Park, H Jeong, Y Koo… - arxiv preprint arxiv …, 2024 - arxiv.org
As the capabilities of Large Language Models (LLMs) expand, it becomes increasingly
important to evaluate them beyond basic knowledge assessment, focusing on higher-level …

How Useful is Context, Actually? Comparing LLMs and Humans on Discourse Marker Prediction

E Sadlier-Brown, M Lou, M Silfverberg… - Proceedings of the …, 2024 - aclanthology.org
This paper investigates the adverbial discourse particle actually. We compare LLM and
human performance on cloze tests involving actually on examples sourced from the …

Large Language Models forecast Patient Health Trajectories enabling Digital Twins

N Makarov, M Bordukova, R Rodriguez-Esteban… - medRxiv, 2024 - medrxiv.org
Background Generative artificial intelligence (AI) facilitates the development of digital twins,
which enable virtual representations of real patients to explore, predict and simulate patient …

[HTML][HTML] Evaluating large language models' ability using a psychiatric screening tool based on metaphor and sarcasm scenarios

H Yakura - Journal of Intelligence, 2024 - mdpi.com
Metaphors and sarcasm are precious fruits of our highly evolved social communication
skills. However, children with the condition then known as Asperger syndrome are known to …