Μελετητής Google

Y Huang, L Sun, H Wang, S Wu, Q Zhang, Y Li… - arxiv preprint arxiv …, 2024 - arxiv.org

Large language models (LLMs), exemplified by ChatGPT, have gained considerable
attention for their excellent natural language processing capabilities. Nonetheless, these …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 251 Σχετικά άρθρα Όλες οι 4 εκδοχές Προβολή ως HTML

[Free GPT-4]
[DeepSeek]

[HTML] mlr.press

[HTML][HTML] Position: TrustLLM: Trustworthiness in large language models

Y Huang, L Sun, H Wang, S Wu… - International …, 2024 - proceedings.mlr.press

Large language models (LLMs) have gained considerable attention for their excellent
natural language processing capabilities. Nonetheless, these LLMs present many …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 46 Σχετικά άρθρα Προσωρινά αποθηκευμένη

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Datasets for large language models: A comprehensive survey

Y Liu, J Cao, C Liu, K Ding, L ** - arxiv preprint arxiv:2402.18041, 2024 - arxiv.org

This paper embarks on an exploration into the Large Language Model (LLM) datasets,
which play a crucial role in the remarkable advancements of LLMs. The datasets serve as …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 62 Σχετικά άρθρα Όλες οι 4 εκδοχές Προβολή ως HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Towards a psychological generalist ai: A survey of current applications of large language models and future prospects

T He, G Fu, Y Yu, F Wang, J Li, Q Zhao, C Song… - arxiv preprint arxiv …, 2023 - arxiv.org

The complexity of psychological principles underscore a significant societal challenge, given
the vast social implications of psychological problems. Bridging the gap between …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 16 Σχετικά άρθρα Όλες οι 2 εκδοχές Προβολή ως HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

TencentLLMEval: a hierarchical evaluation of Real-World capabilities for human-aligned LLMs

S **e, W Yao, Y Dai, S Wang, D Zhou, L **… - arxiv preprint arxiv …, 2023 - arxiv.org

Large language models (LLMs) have shown impressive capabilities across various natural
language tasks. However, evaluating their alignment with human preferences remains a …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 4 Σχετικά άρθρα Όλες οι 2 εκδοχές Προβολή ως HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Generative AI-Based Text Generation Methods Using Pre-Trained GPT-2 Model

R Pandey, H Waghela, S Rakshit, A Rangari… - arxiv preprint arxiv …, 2024 - arxiv.org

This work delved into the realm of automatic text generation, exploring a variety of
techniques ranging from traditional deterministic approaches to more modern stochastic …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 7 Σχετικά άρθρα Όλες οι 6 εκδοχές Προβολή ως HTML

The ethical evaluation of large language models and its optimization

Y Lyu, Y Du - AI and Ethics, 2025 - Springer

The utilization of large language models (LLMs) has experienced tremendous growth in the
past few years, bringing numerous benefits and conveniences. Yet, this expansion has also …

Αποθήκευση Παράθεση Σχετικά άρθρα

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

CLEVA: Toward Comprehensive and Contamination-Free Language Model Evaluation

Y Li, TL Wong, CT Hung, J Zhao, D Zheng… - arxiv preprint arxiv …, 2024 - arxiv.org

Recent advances in large language models (LLMs) have shown significant promise, yet
their evaluation raises concerns, particularly regarding data contamination due to the lack of …

Αποθήκευση Παράθεση Σχετικά άρθρα Όλες οι 2 εκδοχές Προβολή ως HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

AC-EVAL: Evaluating Ancient Chinese Language Understanding in Large Language Models

Y Wei, Y Xu, X Wei, S Yang, Y Zhu, Y Li, D Liu… - arxiv preprint arxiv …, 2024 - arxiv.org

Given the importance of ancient Chinese in capturing the essence of rich historical and
cultural heritage, the rapid advancements in Large Language Models (LLMs) necessitate …

Αποθήκευση Παράθεση Σχετικά άρθρα Όλες οι 2 εκδοχές Προβολή ως HTML

[Free GPT-4]
[DeepSeek]

[PDF] aclanthology.org

ZhuJiu-Knowledge: A Fairer Platform for Evaluating Multiple Knowledge Types in Large Language Models

P Du, S Liang, B Zhang, P Cao, Y Chen… - Proceedings of the …, 2024 - aclanthology.org

The swift advancement in large language models (LLMs) has heightened the importance of
model evaluations. LLMs have acquired a substantial amount of knowledge, and evaluating …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 1 Σχετικά άρθρα Όλες οι 2 εκδοχές Προβολή ως HTML

Δημιουργία ειδοποίησης

Παράθεση

Σύνθετη αναζήτηση

Αποθηκεύτηκε στη Βιβλιοθήκη μου

Cleva: Chinese language models evaluation platform

Trustllm: Trustworthiness in large language models

[HTML][HTML] Position: TrustLLM: Trustworthiness in large language models

Datasets for large language models: A comprehensive survey

Towards a psychological generalist ai: A survey of current applications of large language models and future prospects

TencentLLMEval: a hierarchical evaluation of Real-World capabilities for human-aligned LLMs

Generative AI-Based Text Generation Methods Using Pre-Trained GPT-2 Model

The ethical evaluation of large language models and its optimization

CLEVA: Toward Comprehensive and Contamination-Free Language Model Evaluation

AC-EVAL: Evaluating Ancient Chinese Language Understanding in Large Language Models

ZhuJiu-Knowledge: A Fairer Platform for Evaluating Multiple Knowledge Types in Large Language Models