Google Academic

C Wang, X Liu, Y Yue, X Tang, T Zhang… - ar** them with a
non-parametric memory component. However, most existing approaches only introduce …

Salvați Citați Citat de 136 ori Articole cu conținut similar Toate cele 4 versiuni Afișare ca HTML

[免费ChatGPT] [DeepSeek可用网址] [PDF] arxiv.org

Retrieval-augmented generation for natural language processing: A survey

S Wu, Y **ong, Y Cui, H Wu, C Chen, Y Yuan… - arxiv preprint arxiv …, 2024 - arxiv.org

Large language models (LLMs) have demonstrated great success in various fields,
benefiting from their huge amount of parameters that store knowledge. However, LLMs still …

Salvați Citați Citat de 32 ori Articole cu conținut similar Toate cele 4 versiuni Afișare ca HTML

[免费ChatGPT] [DeepSeek可用网址] [PDF] thecvf.com

Fine-tuning image transformers using learnable memory

M Sandler, A Zhmoginov… - Proceedings of the …, 2022 - openaccess.thecvf.com

In this paper we propose augmenting Vision Transformer models with learnable memory
tokens. Our approach allows the model to adapt to new tasks, using few parameters, while …

Salvați Citați Citat de 53 ori Articole cu conținut similar Toate cele 8 versiuni Afișare ca HTML

[免费ChatGPT] [DeepSeek可用网址] [PDF] arxiv.org

Graphreader: Building graph-based agent to enhance long-context abilities of large language models

S Li, Y He, H Guo, X Bu, G Bai, J Liu, J Liu, X Qu… - arxiv preprint arxiv …, 2024 - arxiv.org

Long-context capabilities are essential for large language models (LLMs) to tackle complex
and long-input tasks. Despite numerous efforts made to optimize LLMs for long contexts …

Salvați Citați Citat de 18 ori Articole cu conținut similar Toate cele 7 versiuni Afișare ca HTML

Creează alerta

Citați

Căutare avansată

Salvat în Bibliotecă

Mention memory: incorporating textual knowledge into transformers through entity mention attention

Survey on factuality in large language models: Knowledge, retrieval and domain-specificity

Retrieval-augmented generation for natural language processing: A survey

Fine-tuning image transformers using learnable memory

Graphreader: Building graph-based agent to enhance long-context abilities of large language models