Google Učenjak

E Yang, L Shen, G Guo, X Wang, X Cao… - arxiv preprint arxiv …, 2024 - arxiv.org

Model merging is an efficient empowerment technique in the machine learning community
that does not require the collection of raw training data and does not require expensive …

Shrani Navedi Navedeno v 58 virih Sorodni članki Vse različice: 2 V obliki HTML

[免费ChatGPT] [DeepSeek可用网址] [PDF] arxiv.org

Merge, ensemble, and cooperate! a survey on collaborative strategies in the era of large language models

J Lu, Z Pang, M **ao, Y Zhu, R **a, J Zhang - arxiv preprint arxiv …, 2024 - arxiv.org

The remarkable success of Large Language Models (LLMs) has ushered natural language
processing (NLP) research into a new era. Despite their diverse capabilities, LLMs trained …

Shrani Navedi Navedeno v 19 virih Sorodni članki Vse različice: 2 V obliki HTML

[免费ChatGPT] [DeepSeek可用网址] [PDF] arxiv.org

Reinforcement Learning Enhanced LLMs: A Survey

S Wang, S Zhang, J Zhang, R Hu, X Li, T Zhang… - arxiv preprint arxiv …, 2024 - arxiv.org

This paper surveys research in the rapidly growing field of enhancing large language
models (LLMs) with reinforcement learning (RL), a technique that enables LLMs to improve …

Shrani Navedi Navedeno v 2 virih Sorodni članki Vse različice: 2 V obliki HTML

[免费ChatGPT] [DeepSeek可用网址] [PDF] arxiv.org

A Comprehensive Survey of Direct Preference Optimization: Datasets, Theories, Variants, and Applications

W **ao, Z Wang, L Gan, S Zhao, W He, LA Tuan… - arxiv preprint arxiv …, 2024 - arxiv.org

With the rapid advancement of large language models (LLMs), aligning policy models with
human preferences has become increasingly critical. Direct Preference Optimization (DPO) …

Shrani Navedi Navedeno v 2 virih Sorodni članki V obliki HTML

[免费ChatGPT] [DeepSeek可用网址] [PDF] arxiv.org

Eliminating biased length reliance of direct preference optimization via down-sampled kl divergence

J Lu, J Li, S An, M Zhao, Y He, D Yin, X Sun - arxiv preprint arxiv …, 2024 - arxiv.org

Direct Preference Optimization (DPO) has emerged as a prominent algorithm for the direct
and robust alignment of Large Language Models (LLMs) with human preferences, offering a …

Shrani Navedi Navedeno v 11 virih Sorodni članki Vse različice: 8 V obliki HTML

[免费ChatGPT] [DeepSeek可用网址] [PDF] arxiv.org

From lists to emojis: How format bias affects model alignment

X Zhang, W **ong, L Chen, T Zhou, H Huang… - arxiv preprint arxiv …, 2024 - arxiv.org

In this paper, we study format biases in reinforcement learning from human feedback
(RLHF). We observe that many widely-used preference models, including human …

Shrani Navedi Navedeno v 5 virih Sorodni članki Vse različice: 2 V obliki HTML

[免费ChatGPT] [DeepSeek可用网址] [PDF] arxiv.org

Aqulia-med llm: Pioneering full-process open-source medical language models

L Zhao, W Zeng, X Shi, H Zhou, D Hao, Y Lin - arxiv preprint arxiv …, 2024 - arxiv.org

Recently, both closed-source LLMs and open-source communities have made significant
strides, outperforming humans in various general domains. However, their performance in …

Shrani Navedi Navedeno v 5 virih Sorodni članki Vse različice: 3 V obliki HTML

[免费ChatGPT] [DeepSeek可用网址] [PDF] arxiv.org

Unlocking Decoding-time Controllability: Gradient-Free Multi-Objective Alignment with Contrastive Prompts

T Fu, Y Hou, J McAuley, R Yan - arxiv preprint arxiv:2408.05094, 2024 - arxiv.org

The task of multi-objective alignment aims at balancing and controlling the different
alignment objectives (eg, helpfulness, harmlessness and honesty) of large language models …

Shrani Navedi Navedeno v 3 virih Sorodni članki Vse različice: 4 V obliki HTML

[免费ChatGPT] [DeepSeek可用网址] [PDF] arxiv.org

Superficial safety alignment hypothesis

J Li, JE Kim - arxiv preprint arxiv:2410.10862, 2024 - arxiv.org

As large language models (LLMs) are overwhelmingly more and more integrated into
various applications, ensuring they generate safe and aligned responses is a pressing …

Shrani Navedi Navedeno v 4 virih Sorodni članki Vse različice: 2 V obliki HTML

[免费ChatGPT] [DeepSeek可用网址] [PDF] arxiv.org

BAPO: Base-Anchored Preference Optimization for Overcoming Forgetting in Large Language Models Personalization

G Lee, M Jeong, Y Kim, H Jung, J Oh, S Kim… - arxiv preprint arxiv …, 2024 - arxiv.org

While learning to align Large Language Models (LLMs) with human preferences has shown
remarkable success, aligning these models to meet the diverse user preferences presents …

Shrani Navedi Sorodni članki Vse različice: 3 V obliki HTML

Ustvari opozorilo

Navedi

Napredno iskanje

Shranjeno v Mojo knjižnico

Online merging optimizers for boosting rewards and mitigating tax in alignment

Model merging in llms, mllms, and beyond: Methods, theories, applications and opportunities

Merge, ensemble, and cooperate! a survey on collaborative strategies in the era of large language models

Reinforcement Learning Enhanced LLMs: A Survey

A Comprehensive Survey of Direct Preference Optimization: Datasets, Theories, Variants, and Applications

Eliminating biased length reliance of direct preference optimization via down-sampled kl divergence

From lists to emojis: How format bias affects model alignment

Aqulia-med llm: Pioneering full-process open-source medical language models

Unlocking Decoding-time Controllability: Gradient-Free Multi-Objective Alignment with Contrastive Prompts

Superficial safety alignment hypothesis

BAPO: Base-Anchored Preference Optimization for Overcoming Forgetting in Large Language Models Personalization