Google Tudós

Large dual encoders are generalizable retrievers

Turnitin 降AI改写早检测系统早降重系统 Turnitin-UK版万方检测-期刊版维普编辑部版 Grammarly检测 Paperpass检测 checkpass检测 PaperYY检测

Siren's song in the AI ocean: a survey on hallucination in large language models

Y Zhang, Y Li, L Cui, D Cai, L Liu, T Fu… - arxiv preprint arxiv …, 2023 - arxiv.org

While large language models (LLMs) have demonstrated remarkable capabilities across a
range of downstream tasks, a significant concern revolves around their propensity to exhibit …

Mentés Hivatkozás Idézetek száma: 981 Kapcsolódó cikkek Mind a(z) 2 változat Tárolt változat

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

C-pack: Packed resources for general chinese embeddings

S **ao, Z Liu, P Zhang, N Muennighoff, D Lian… - Proceedings of the 47th …, 2024 - dl.acm.org

We introduce C-Pack, a package of resources that significantly advances the field of general
text embeddings for Chinese. C-Pack includes three critical resources. 1) C-MTP is a …

Mentés Hivatkozás Idézetek száma: 451 Kapcsolódó cikkek Mind a(z) 5 változat

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Factscore: Fine-grained atomic evaluation of factual precision in long form text generation

S Min, K Krishna, X Lyu, M Lewis, W Yih… - arxiv preprint arxiv …, 2023 - arxiv.org

Evaluating the factuality of long-form text generated by large language models (LMs) is non-
trivial because (1) generations often contain a mixture of supported and unsupported pieces …

Mentés Hivatkozás Idézetek száma: 486 Kapcsolódó cikkek Mind a(z) 9 változat HTML-változat

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Improving text embeddings with large language models

L Wang, N Yang, X Huang, L Yang… - arxiv preprint arxiv …, 2023 - arxiv.org

In this paper, we introduce a novel and simple method for obtaining high-quality text
embeddings using only synthetic data and less than 1k training steps. Unlike existing …

Mentés Hivatkozás Idézetek száma: 280 Kapcsolódó cikkek Mind a(z) 5 változat HTML-változat

Text embeddings by weakly-supervised contrastive pre-training

L Wang, N Yang, X Huang, B Jiao, L Yang… - arxiv preprint arxiv …, 2022 - arxiv.org

This paper presents E5, a family of state-of-the-art text embeddings that transfer well to a
wide range of tasks. The model is trained in a contrastive manner with weak supervision …

Mentés Hivatkozás Idézetek száma: 438 Kapcsolódó cikkek Mind a(z) 2 változat Tárolt változat

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Large language models for information retrieval: A survey

Y Zhu, H Yuan, S Wang, J Liu, W Liu, C Deng… - arxiv preprint arxiv …, 2023 - arxiv.org

As a primary means of information acquisition, information retrieval (IR) systems, such as
search engines, have integrated themselves into our daily lives. These systems also serve …

Mentés Hivatkozás Idézetek száma: 314 Kapcsolódó cikkek Mind a(z) 3 változat HTML-változat

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

MTEB: Massive text embedding benchmark

N Muennighoff, N Tazi, L Magne, N Reimers - arxiv preprint arxiv …, 2022 - arxiv.org

Text embeddings are commonly evaluated on a small set of datasets from a single task not
covering their possible applications to other tasks. It is unclear whether state-of-the-art …

Mentés Hivatkozás Idézetek száma: 651 Kapcsolódó cikkek Mind a(z) 4 változat HTML-változat

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Enabling large language models to generate text with citations

T Gao, H Yen, J Yu, D Chen - arxiv preprint arxiv:2305.14627, 2023 - arxiv.org

Large language models (LLMs) have emerged as a widely-used tool for information
seeking, but their generated outputs are prone to hallucination. In this work, our aim is to …

Mentés Hivatkozás Idézetek száma: 255 Kapcsolódó cikkek Mind a(z) 8 változat HTML-változat

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

One embedder, any task: Instruction-finetuned text embeddings

H Su, W Shi, J Kasai, Y Wang, Y Hu… - arxiv preprint arxiv …, 2022 - arxiv.org

We introduce INSTRUCTOR, a new method for computing text embeddings given task
instructions: every text input is embedded together with instructions explaining the use case …

Mentés Hivatkozás Idézetek száma: 259 Kapcsolódó cikkek Mind a(z) 4 változat HTML-változat

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Multilingual e5 text embeddings: A technical report

L Wang, N Yang, X Huang, L Yang… - arxiv preprint arxiv …, 2024 - arxiv.org

This technical report presents the training methodology and evaluation results of the open-
source multilingual E5 text embedding models, released in mid-2023. Three embedding …

Mentés Hivatkozás Idézetek száma: 162 Kapcsolódó cikkek Mind a(z) 2 változat HTML-változat

Értesítés létrehozása

Hivatkozás

Speciális keresés

Mentve a Saját könyvtárba

Large dual encoders are generalizable retrievers

Siren's song in the AI ocean: a survey on hallucination in large language models

C-pack: Packed resources for general chinese embeddings

Factscore: Fine-grained atomic evaluation of factual precision in long form text generation

Improving text embeddings with large language models

Text embeddings by weakly-supervised contrastive pre-training

Large language models for information retrieval: A survey

MTEB: Massive text embedding benchmark

Enabling large language models to generate text with citations

One embedder, any task: Instruction-finetuned text embeddings

Multilingual e5 text embeddings: A technical report