Študovňa Google

Pre-train, prompt, and predict: A systematic survey of prompting methods in natural language processing

P Liu, W Yuan, J Fu, Z Jiang, H Hayashi… - ACM computing …, 2023 - dl.acm.org

This article surveys and organizes research works in a new paradigm in natural language
processing, which we dub “prompt-based learning.” Unlike traditional supervised learning …

Uložiť Citovať Citované 5062-krát Súvisiace články Všetky verzie 5

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

A survey of confidence estimation and calibration in large language models

J Geng, F Cai, Y Wang, H Koeppl, P Nakov… - arxiv preprint arxiv …, 2023 - arxiv.org

Large language models (LLMs) have demonstrated remarkable capabilities across a wide
range of tasks in various domains. Despite their impressive performance, they can be …

Uložiť Citovať Citované 42-krát Súvisiace články Všetky verzie 4 HTML verzia

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Simpo: Simple preference optimization with a reference-free reward

Y Meng, M **a, D Chen - Advances in Neural Information …, 2025 - proceedings.neurips.cc

Abstract Direct Preference Optimization (DPO) is a widely used offline preference
optimization algorithm that reparameterizes reward functions in reinforcement learning from …

Uložiť Citovať Citované 227-krát Súvisiace články Všetky verzie 5 HTML verzia

[Free GPT-4]
[DeepSeek]

[PDF] chatgpthero.io

[PDF][PDF] Automatic chain of thought prompting in large language models

Z Zhang, A Zhang, M Li, A Smola - arxiv preprint arxiv:2210.03493, 2022 - chatgpthero.io

Large language models (LLMs) can perform complex reasoning by generating intermediate
reasoning steps. Providing these steps for prompting demonstrations is called chain-of …

Uložiť Citovať Citované 874-krát Súvisiace články Všetky verzie 4 HTML verzia

[Free GPT-4]
[DeepSeek]

[PDF] aclanthology.org

Making language models better reasoners with step-aware verifier

Y Li, Z Lin, S Zhang, Q Fu, B Chen… - Proceedings of the …, 2023 - aclanthology.org

Few-shot learning is a challenging task that requires language models to generalize from
limited examples. Large language models like GPT-3 and PaLM have made impressive …

Uložiť Citovať Citované 165-krát Súvisiace články Všetky verzie 3 HTML verzia

[Free GPT-4]
[DeepSeek]

[PDF] aclanthology.org

Trusting your evidence: Hallucinate less with context-aware decoding

W Shi, X Han, M Lewis, Y Tsvetkov… - Proceedings of the …, 2024 - aclanthology.org

Abstract Language models (LMs) often struggle to pay enough attention to the input context,
and generate texts that are unfaithful or contain hallucinations. To mitigate this issue, we …

Uložiť Citovať Citované 147-krát Súvisiace články Všetky verzie 5 HTML verzia

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Rethinking the role of demonstrations: What makes in-context learning work?

S Min, X Lyu, A Holtzman, M Artetxe, M Lewis… - arxiv preprint arxiv …, 2022 - arxiv.org

Large language models (LMs) are able to in-context learn--perform a new task via inference
alone by conditioning on a few input-label pairs (demonstrations) and making predictions for …

Uložiť Citovať Citované 1296-krát Súvisiace články Všetky verzie 8 HTML verzia

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Ask me anything: A simple strategy for prompting language models

S Arora, A Narayan, MF Chen, L Orr, N Guha… - arxiv preprint arxiv …, 2022 - arxiv.org

Large language models (LLMs) transfer well to new tasks out-of-the-box simply given a
natural language prompt that demonstrates how to perform the task and no additional …

Uložiť Citovať Citované 223-krát Súvisiace články Všetky verzie 4 HTML verzia

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Learning to retrieve prompts for in-context learning

O Rubin, J Herzig, J Berant - arxiv preprint arxiv:2112.08633, 2021 - arxiv.org

In-context learning is a recent paradigm in natural language understanding, where a large
pre-trained language model (LM) observes a test instance and a few training examples as …

Uložiť Citovať Citované 625-krát Súvisiace články Všetky verzie 5 HTML verzia

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Finetuned language models are zero-shot learners

J Wei, M Bosma, VY Zhao, K Guu, AW Yu… - arxiv preprint arxiv …, 2021 - arxiv.org

This paper explores a simple method for improving the zero-shot learning abilities of
language models. We show that instruction tuning--finetuning language models on a …

Uložiť Citovať Citované 3534-krát Súvisiace články Všetky verzie 8 HTML verzia

Vytvoriť upozornenie

Citovať

Rozšírené vyhľadávanie

Uložené do mojej knižnice

Surface form competition: Why the highest probability answer isn't always right

Pre-train, prompt, and predict: A systematic survey of prompting methods in natural language processing

A survey of confidence estimation and calibration in large language models

Simpo: Simple preference optimization with a reference-free reward

[PDF][PDF] Automatic chain of thought prompting in large language models

Making language models better reasoners with step-aware verifier

Trusting your evidence: Hallucinate less with context-aware decoding

Rethinking the role of demonstrations: What makes in-context learning work?

Ask me anything: A simple strategy for prompting language models

Learning to retrieve prompts for in-context learning

Finetuned language models are zero-shot learners