- Academic Search

Y Liu, J Cao, C Liu, K Ding, L ** - arxiv preprint arxiv:2402.18041, 2024 - arxiv.org

This paper embarks on an exploration into the Large Language Model (LLM) datasets,
which play a crucial role in the remarkable advancements of LLMs. The datasets serve as …

保存引用被引用次数：128 相关文章所有 9 个版本 HTML 版

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

A survey on large language models with multilingualism: Recent advances and new frontiers

K Huang, F Mo, H Li, Y Li, Y Zhang, W Yi, Y Mao… - arxiv preprint arxiv …, 2024 - arxiv.org

The rapid development of Large Language Models (LLMs) demonstrates remarkable
multilingual capabilities in natural language processing, attracting global attention in both …

保存引用被引用次数：16 相关文章所有 3 个版本 HTML 版

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Simpo: Simple preference optimization with a reference-free reward

Y Meng, M **a, D Chen - Advances in Neural Information …, 2025 - proceedings.neurips.cc

Abstract Direct Preference Optimization (DPO) is a widely used offline preference
optimization algorithm that reparameterizes reward functions in reinforcement learning from …

保存引用被引用次数：205 相关文章所有 5 个版本 HTML 版

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Llamafactory: Unified efficient fine-tuning of 100+ language models

Y Zheng, R Zhang, J Zhang, Y Ye, Z Luo… - arxiv preprint arxiv …, 2024 - arxiv.org

Efficient fine-tuning is vital for adapting large language models (LLMs) to downstream tasks.
However, it requires non-trivial efforts to implement these methods on different models. We …

保存引用被引用次数：292 相关文章所有 4 个版本 HTML 版

[Free GPT-4]
[DeepSeek]

[PDF] utk.edu

[PDF][PDF] Trustllm: Trustworthiness in large language models

L Sun, Y Huang, H Wang, S Wu, Q Zhang… - arxiv preprint arxiv …, 2024 - mosis.eecs.utk.edu

Large language models (LLMs), exemplified by ChatGPT, have gained considerable
attention for their excellent natural language processing capabilities. Nonetheless, these …

保存引用被引用次数：252 相关文章所有 6 个版本 HTML 版

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Harmbench: A standardized evaluation framework for automated red teaming and robust refusal

M Mazeika, L Phan, X Yin, A Zou, Z Wang, N Mu… - arxiv preprint arxiv …, 2024 - arxiv.org

Automated red teaming holds substantial promise for uncovering and mitigating the risks
associated with the malicious use of large language models (LLMs), yet the field lacks a …

保存引用被引用次数：194 相关文章所有 8 个版本 HTML 版

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Personal llm agents: Insights and survey about the capability, efficiency and security

Y Li, H Wen, W Wang, X Li, Y Yuan, G Liu, J Liu… - arxiv preprint arxiv …, 2024 - arxiv.org

Since the advent of personal computing devices, intelligent personal assistants (IPAs) have
been one of the key technologies that researchers and engineers have focused on, aiming …

保存引用被引用次数：128 相关文章所有 3 个版本 HTML 版

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Camels in a changing climate: Enhancing lm adaptation with tulu 2

H Ivison, Y Wang, V Pyatkin, N Lambert… - arxiv preprint arxiv …, 2023 - arxiv.org

Since the release of T\" ULU [Wang et al., 2023b], open resources for instruction tuning have
developed quickly, from better base models to new finetuning techniques. We test and …

保存引用被引用次数：164 相关文章所有 3 个版本 HTML 版

[Free GPT-4]
[DeepSeek]

[PDF] openreview.net

Mmlu-pro: A more robust and challenging multi-task language understanding benchmark

Y Wang, X Ma, G Zhang, Y Ni, A Chandra… - The Thirty-eight …, 2024 - openreview.net

In the age of large-scale language models, benchmarks like the Massive Multitask
Language Understanding (MMLU) have been pivotal in pushing the boundaries of what AI …

保存引用被引用次数：135 相关文章所有 9 个版本 HTML 版

[Free GPT-4]
[DeepSeek]

[HTML] mlr.press

[HTML][HTML] Position: TrustLLM: Trustworthiness in large language models

Y Huang, L Sun, H Wang, S Wu… - International …, 2024 - proceedings.mlr.press

Large language models (LLMs) have gained considerable attention for their excellent
natural language processing capabilities. Nonetheless, these LLMs present many …

保存引用被引用次数：53 相关文章所有 11 个版本网页快照

创建快讯

引用

高级搜索

已保存到“我的图书馆”

Openchat: Advancing open-source language models with mixed-quality data

Datasets for large language models: A comprehensive survey

A survey on large language models with multilingualism: Recent advances and new frontiers

Simpo: Simple preference optimization with a reference-free reward

Llamafactory: Unified efficient fine-tuning of 100+ language models

[PDF][PDF] Trustllm: Trustworthiness in large language models

Harmbench: A standardized evaluation framework for automated red teaming and robust refusal

Personal llm agents: Insights and survey about the capability, efficiency and security

Camels in a changing climate: Enhancing lm adaptation with tulu 2

Mmlu-pro: A more robust and challenging multi-task language understanding benchmark

[HTML][HTML] Position: TrustLLM: Trustworthiness in large language models