Dissociating language and thought in large language models

K Mahowald, AA Ivanova, IA Blank, N Kanwisher… - Trends in Cognitive …, 2024 - cell.com
Large language models (LLMs) have come closest among all models to date to mastering
human language, yet opinions about their linguistic and cognitive capabilities remain split …

Opportunities and challenges for ChatGPT and large language models in biomedicine and health

S Tian, Q **, L Yeganova, PT Lai, Q Zhu… - Briefings in …, 2024 - academic.oup.com
ChatGPT has drawn considerable attention from both the general public and domain experts
with its remarkable text generation capabilities. This has subsequently led to the emergence …

Factscore: Fine-grained atomic evaluation of factual precision in long form text generation

S Min, K Krishna, X Lyu, M Lewis, W Yih… - arxiv preprint arxiv …, 2023 - arxiv.org
Evaluating the factuality of long-form text generated by large language models (LMs) is non-
trivial because (1) generations often contain a mixture of supported and unsupported pieces …

Enabling large language models to generate text with citations

T Gao, H Yen, J Yu, D Chen - arxiv preprint arxiv:2305.14627, 2023 - arxiv.org
Large language models (LLMs) have emerged as a widely-used tool for information
seeking, but their generated outputs are prone to hallucination. In this work, our aim is to …

Benchmarking large language models in retrieval-augmented generation

J Chen, H Lin, X Han, L Sun - Proceedings of the AAAI Conference on …, 2024 - ojs.aaai.org
Retrieval-Augmented Generation (RAG) is a promising approach for mitigating the
hallucination of large language models (LLMs). However, existing research lacks rigorous …

[PDF][PDF] Ai transparency in the age of llms: A human-centered research roadmap

QV Liao, JW Vaughan - arxiv preprint arxiv:2306.01941, 2023 - assets.pubpub.org
The rise of powerful large language models (LLMs) brings about tremendous opportunities
for innovation but also looming risks for individuals and society at large. We have reached a …

Crud-rag: A comprehensive chinese benchmark for retrieval-augmented generation of large language models

Y Lyu, Z Li, S Niu, F **ong, B Tang, W Wang… - ACM Transactions on …, 2024 - dl.acm.org
Retrieval-Augmented Generation (RAG) is a technique that enhances the capabilities of
large language models (LLMs) by incorporating external knowledge sources. This method …

Automatic evaluation of attribution by large language models

X Yue, B Wang, Z Chen, K Zhang, Y Su… - arxiv preprint arxiv …, 2023 - arxiv.org
A recent focus of large language model (LLM) development, as exemplified by generative
search engines, is to incorporate external references to generate and support its claims …

Lm vs lm: Detecting factual errors via cross examination

R Cohen, M Hamri, M Geva, A Globerson - arxiv preprint arxiv …, 2023 - arxiv.org
A prominent weakness of modern language models (LMs) is their tendency to generate
factually incorrect text, which hinders their usability. A natural question is whether such …

Right to be forgotten in the era of large language models: Implications, challenges, and solutions

D Zhang, P Finckenberg-Broman, T Hoang, S Pan… - AI and Ethics, 2024 - Springer
Abstract The Right to be Forgotten (RTBF) was first established as the result of the ruling of
Google Spain SL, Google Inc. v AEPD, Mario Costeja González, and was later included as …