A survey of safety and trustworthiness of large language models through the lens of verification and validation

X Huang, W Ruan, W Huang, G **, Y Dong… - Artificial Intelligence …, 2024 - Springer
Large language models (LLMs) have exploded a new heatwave of AI for their ability to
engage end-users in human-level conversations with detailed and articulate answers across …

Verify-and-edit: A knowledge-enhanced chain-of-thought framework

R Zhao, X Li, S Joty, C Qin, L Bing - arxiv preprint arxiv:2305.03268, 2023 - arxiv.org
As large language models (LLMs) have become the norm in NLP, demonstrating good
performance in generation and reasoning tasks, one of its most fatal disadvantages is the …

An overview of the capabilities of ChatGPT for medical writing and its implications for academic integrity

H Liu, M Azam, S Bin Naeem… - Health Information & …, 2023 - Wiley Online Library
The artificial intelligence (AI) tool ChatGPT, which is based on a large language model
(LLM), is gaining popularity in academic institutions, notably in the medical field. This article …

[PDF][PDF] Is gpt-3 a psychopath? evaluating large language models from a psychological perspective

X Li, Y Li, L Liu, L Bing, S Joty - arxiv preprint arxiv:2212.10529, 2022 - researchgate.net
Are large language models (LLMs) like GPT-3 psychologically safe? In this work, we design
unbiased prompts to evaluate LLMs systematically from a psychological perspective. Firstly …

[HTML][HTML] The potential of ChatGPT as a self-diagnostic tool in common orthopedic diseases: exploratory study

T Kuroiwa, A Sarcon, T Ibara, E Yamada… - Journal of medical …, 2023 - jmir.org
Background Artificial intelligence (AI) has gained tremendous popularity recently, especially
the use of natural language processing (NLP). ChatGPT is a state-of-the-art chatbot capable …

Chatgpt's one-year anniversary: are open-source large language models catching up?

H Chen, F Jiao, X Li, C Qin, M Ravaut, R Zhao… - arxiv preprint arxiv …, 2023 - arxiv.org
Upon its release in late 2022, ChatGPT has brought a seismic shift in the entire landscape of
AI, both in research and commerce. Through instruction-tuning a large language model …

Consistency analysis of chatgpt

ME Jang, T Lukasiewicz - arxiv preprint arxiv:2303.06273, 2023 - arxiv.org
ChatGPT has gained a huge popularity since its introduction. Its positive aspects have been
reported through many media platforms, and some analyses even showed that ChatGPT …

Evaluating psychological safety of large language models

X Li, Y Li, L Qiu, S Joty, L Bing - Proceedings of the 2024 …, 2024 - aclanthology.org
In this work, we designed unbiased prompts to systematically evaluate the psychological
safety of large language models (LLMs). First, we tested five different LLMs by using two …

Retrieving multimodal information for augmented generation: A survey

R Zhao, H Chen, W Wang, F Jiao, XL Do, C Qin… - arxiv preprint arxiv …, 2023 - arxiv.org
As Large Language Models (LLMs) become popular, there emerged an important trend of
using multimodality to augment the LLMs' generation ability, which enables LLMs to better …

Chain-of-knowledge: Grounding large language models via dynamic knowledge adapting over heterogeneous sources

X Li, R Zhao, YK Chia, B Ding, S Joty, S Poria… - arxiv preprint arxiv …, 2023 - arxiv.org
We present chain-of-knowledge (CoK), a novel framework that augments large language
models (LLMs) by dynamically incorporating grounding information from heterogeneous …