Google Наука

Turnitin 降AI改写早检测系统早降重系统 Turnitin-UK版万方检测-期刊版维普编辑部版 Grammarly检测 Paperpass检测 checkpass检测 PaperYY检测

How do large language models navigate conflicts between honesty and helpfulness?

R Liu, TR Sumers, I Dasgupta, TL Griffiths - arxiv preprint arxiv …, 2024 - arxiv.org

In day-to-day communication, people often approximate the truth-for example, rounding the
time or omitting details-in order to be maximally helpful to the listener. How do large …

Запазване Позоваване С позовавания в 15 Сродни статии Всички 8 версии Във вид на HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Psychomatics—A multidisciplinary framework for understanding artificial minds

G Riva, F Mantovani, BK Wiederhold… - … , Behavior, and Social …, 2024 - liebertpub.com

Although large language models (LLMs) and other artificial intelligence systems
demonstrate cognitive skills similar to humans, such as concept learning and language …

Запазване Позоваване С позовавания в 1 Сродни статии Всички 7 версии

[Free GPT-4]
[DeepSeek]

[PDF] aclanthology.org

Discursive Socratic Questioning: Evaluating the Faithfulness of Language Models' Understanding of Discourse Relations

Y Miao, H Liu, W Lei, N Chen… - Proceedings of the 62nd …, 2024 - aclanthology.org

While large language models have significantly enhanced the effectiveness of discourse
relation classifications, it remains unclear whether their comprehension is faithful and …

Запазване Позоваване Сродни статии Всички 3 версии Във вид на HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Is On-Device AI Broken and Exploitable? Assessing the Trust and Ethics in Small Language Models

K Nakka, J Dani, N Saxena - arxiv preprint arxiv:2406.05364, 2024 - arxiv.org

In this paper, we present a very first study to investigate trust and ethical implications of on-
device artificial intelligence (AI), focusing on''small''language models (SLMs) amenable for …

Запазване Позоваване С позовавания в 1 Сродни статии Всички 3 версии Във вид на HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Visual Contexts Clarify Ambiguous Expressions: A Benchmark Dataset

H Nam, J Ahn - arxiv preprint arxiv:2411.14137, 2024 - arxiv.org

The ability to perform complex reasoning across multimodal inputs is essential for models to
effectively interact with humans in real-world scenarios. Advancements in vision-language …

Запазване Позоваване С позовавания в 1 Сродни статии Всички 2 версии Във вид на HTML

[Free GPT-4]
[DeepSeek]

[PDF] nature.com

Manner implicatures in large language models

Y Cong - Scientific Reports, 2024 - nature.com

In human speakers' daily conversations, what we do not say matters. We not only compute
the literal semantics but also go beyond and draw inferences from what we could have said …

Запазване Позоваване Сродни статии Всички 5 версии

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Multiprageval: Multilingual pragmatic evaluation of large language models

D Park, J Lee, S Park, H Jeong, Y Koo… - arxiv preprint arxiv …, 2024 - arxiv.org

As the capabilities of Large Language Models (LLMs) expand, it becomes increasingly
important to evaluate them beyond basic knowledge assessment, focusing on higher-level …

Запазване Позоваване С позовавания в 2 Сродни статии Всички 4 версии Във вид на HTML

[Free GPT-4]
[DeepSeek]

[PDF] aclanthology.org

How Useful is Context, Actually? Comparing LLMs and Humans on Discourse Marker Prediction

E Sadlier-Brown, M Lou, M Silfverberg… - Proceedings of the …, 2024 - aclanthology.org

This paper investigates the adverbial discourse particle actually. We compare LLM and
human performance on cloze tests involving actually on examples sourced from the …

Запазване Позоваване С позовавания в 2 Сродни статии Всички 2 версии Във вид на HTML

[Free GPT-4]
[DeepSeek]

[PDF] medrxiv.org

Large Language Models forecast Patient Health Trajectories enabling Digital Twins

N Makarov, M Bordukova, R Rodriguez-Esteban… - medRxiv, 2024 - medrxiv.org

Background Generative artificial intelligence (AI) facilitates the development of digital twins,
which enable virtual representations of real patients to explore, predict and simulate patient …

Запазване Позоваване С позовавания в 1 Сродни статии Всички 4 версии Във вид на HTML

[Free GPT-4]
[DeepSeek]

[HTML] mdpi.com

[HTML][HTML] Evaluating large language models' ability using a psychiatric screening tool based on metaphor and sarcasm scenarios

H Yakura - Journal of Intelligence, 2024 - mdpi.com

Metaphors and sarcasm are precious fruits of our highly evolved social communication
skills. However, children with the condition then known as Asperger syndrome are known to …

Запазване Позоваване С позовавания в 1 Сродни статии Всички 9 версии Кеширана версия

Позоваване

Разширено търсене

Запазено в „Моята библиотека“

How do large language models navigate conflicts between honesty and helpfulness?

Psychomatics—A multidisciplinary framework for understanding artificial minds

Discursive Socratic Questioning: Evaluating the Faithfulness of Language Models' Understanding of Discourse Relations

Is On-Device AI Broken and Exploitable? Assessing the Trust and Ethics in Small Language Models

Visual Contexts Clarify Ambiguous Expressions: A Benchmark Dataset

Manner implicatures in large language models

Multiprageval: Multilingual pragmatic evaluation of large language models

How Useful is Context, Actually? Comparing LLMs and Humans on Discourse Marker Prediction

Large Language Models forecast Patient Health Trajectories enabling Digital Twins

[HTML][HTML] Evaluating large language models' ability using a psychiatric screening tool based on metaphor and sarcasm scenarios