Factuality challenges in the era of large language models and opportunities for fact-checking

I Augenstein, T Baldwin, M Cha… - Nature Machine …, 2024 - nature.com
The emergence of tools based on large language models (LLMs), such as OpenAI's
ChatGPT and Google's Gemini, has garnered immense public attention owing to their …

Trustllm: Trustworthiness in large language models

Y Huang, L Sun, H Wang, S Wu, Q Zhang, Y Li… - arxiv preprint arxiv …, 2024 - arxiv.org
Large language models (LLMs), exemplified by ChatGPT, have gained considerable
attention for their excellent natural language processing capabilities. Nonetheless, these …

[HTML][HTML] Position: TrustLLM: Trustworthiness in large language models

Y Huang, L Sun, H Wang, S Wu… - International …, 2024 - proceedings.mlr.press
Large language models (LLMs) have gained considerable attention for their excellent
natural language processing capabilities. Nonetheless, these LLMs present many …

Factuality challenges in the era of large language models

I Augenstein, T Baldwin, M Cha, T Chakraborty… - arxiv preprint arxiv …, 2023 - arxiv.org
The emergence of tools based on Large Language Models (LLMs), such as OpenAI's
ChatGPT, Microsoft's Bing Chat, and Google's Bard, has garnered immense public attention …

Mitigating the alignment tax of rlhf

Y Lin, H Lin, W **ong, S Diao, J Liu… - Proceedings of the …, 2024 - aclanthology.org
LLMs acquire a wide range of abilities during pre-training, but aligning LLMs under
Reinforcement Learning with Human Feedback (RLHF) can lead to forgetting pretrained …

Resolving knowledge conflicts in large language models

Y Wang, S Feng, H Wang, W Shi… - arxiv preprint arxiv …, 2023 - arxiv.org
Large language models (LLMs) often encounter knowledge conflicts, scenarios where
discrepancy arises between the internal parametric knowledge of LLMs and non-parametric …

Creator: Tool creation for disentangling abstract and concrete reasoning of large language models

C Qian, C Han, YR Fung, Y Qin, Z Liu, H Ji - arxiv preprint arxiv …, 2023 - arxiv.org
Large Language Models (LLMs) have demonstrated significant progress in utilizing external
APIs as tools for various tasks. However, their tool-using ability is limited by the availability of …

Defining a new NLP playground

S Li, C Han, P Yu, C Edwards, M Li, X Wang… - arxiv preprint arxiv …, 2023 - arxiv.org
The recent explosion of performance of large language models (LLMs) has changed the
field of Natural Language Processing (NLP) more abruptly and seismically than any other …

Towards lifespan cognitive systems

Y Wang, C Han, T Wu, X He, W Zhou, N Sadeq… - arxiv preprint arxiv …, 2024 - arxiv.org
Building a human-like system that continuously interacts with complex environments--
whether simulated digital worlds or human society--presents several key challenges. Central …

Establishing Knowledge Preference in Language Models

S Zhou, S Li, Y Meng, Y Jiao, H Ji, J Han - arxiv preprint arxiv:2407.13048, 2024 - arxiv.org
Language models are known to encode a great amount of factual knowledge through
pretraining. However, such knowledge might be insufficient to cater to user requests …