Trustllm: Trustworthiness in large language models

Y Huang, L Sun, H Wang, S Wu, Q Zhang, Y Li… - arxiv preprint arxiv …, 2024 - arxiv.org
Large language models (LLMs), exemplified by ChatGPT, have gained considerable
attention for their excellent natural language processing capabilities. Nonetheless, these …

[HTML][HTML] Position: TrustLLM: Trustworthiness in large language models

Y Huang, L Sun, H Wang, S Wu… - International …, 2024 - proceedings.mlr.press
Large language models (LLMs) have gained considerable attention for their excellent
natural language processing capabilities. Nonetheless, these LLMs present many …

Datasets for large language models: A comprehensive survey

Y Liu, J Cao, C Liu, K Ding, L ** - arxiv preprint arxiv:2402.18041, 2024 - arxiv.org
This paper embarks on an exploration into the Large Language Model (LLM) datasets,
which play a crucial role in the remarkable advancements of LLMs. The datasets serve as …

Towards a psychological generalist ai: A survey of current applications of large language models and future prospects

T He, G Fu, Y Yu, F Wang, J Li, Q Zhao, C Song… - arxiv preprint arxiv …, 2023 - arxiv.org
The complexity of psychological principles underscore a significant societal challenge, given
the vast social implications of psychological problems. Bridging the gap between …

TencentLLMEval: a hierarchical evaluation of Real-World capabilities for human-aligned LLMs

S **e, W Yao, Y Dai, S Wang, D Zhou, L **… - arxiv preprint arxiv …, 2023 - arxiv.org
Large language models (LLMs) have shown impressive capabilities across various natural
language tasks. However, evaluating their alignment with human preferences remains a …

Generative AI-Based Text Generation Methods Using Pre-Trained GPT-2 Model

R Pandey, H Waghela, S Rakshit, A Rangari… - arxiv preprint arxiv …, 2024 - arxiv.org
This work delved into the realm of automatic text generation, exploring a variety of
techniques ranging from traditional deterministic approaches to more modern stochastic …

The ethical evaluation of large language models and its optimization

Y Lyu, Y Du - AI and Ethics, 2025 - Springer
The utilization of large language models (LLMs) has experienced tremendous growth in the
past few years, bringing numerous benefits and conveniences. Yet, this expansion has also …

CLEVA: Toward Comprehensive and Contamination-Free Language Model Evaluation

Y Li, TL Wong, CT Hung, J Zhao, D Zheng… - arxiv preprint arxiv …, 2024 - arxiv.org
Recent advances in large language models (LLMs) have shown significant promise, yet
their evaluation raises concerns, particularly regarding data contamination due to the lack of …

AC-EVAL: Evaluating Ancient Chinese Language Understanding in Large Language Models

Y Wei, Y Xu, X Wei, S Yang, Y Zhu, Y Li, D Liu… - arxiv preprint arxiv …, 2024 - arxiv.org
Given the importance of ancient Chinese in capturing the essence of rich historical and
cultural heritage, the rapid advancements in Large Language Models (LLMs) necessitate …

ZhuJiu-Knowledge: A Fairer Platform for Evaluating Multiple Knowledge Types in Large Language Models

P Du, S Liang, B Zhang, P Cao, Y Chen… - Proceedings of the …, 2024 - aclanthology.org
The swift advancement in large language models (LLMs) has heightened the importance of
model evaluations. LLMs have acquired a substantial amount of knowledge, and evaluating …