- Academic Search

H Naveed, AU Khan, S Qiu, M Saqib, S Anwar… - arxiv preprint arxiv …, 2023 - arxiv.org

Large Language Models (LLMs) have recently demonstrated remarkable capabilities in
natural language processing tasks and beyond. This success of LLMs has led to a large …

Speichern Zitieren Zitiert von: 693 Ähnliche Artikel Alle 3 Versionen HTML-Version

[Free GPT-4]

[PDF] arxiv.org

C-pack: Packed resources for general chinese embeddings

S **ao, Z Liu, P Zhang, N Muennighoff, D Lian… - Proceedings of the 47th …, 2024 - dl.acm.org

We introduce C-Pack, a package of resources that significantly advances the field of general
text embeddings for Chinese. C-Pack includes three critical resources. 1) C-MTP is a …

Speichern Zitieren Zitiert von: 415 Ähnliche Artikel Alle 2 Versionen

[Free GPT-4]

[PDF] arxiv.org

Crosslingual generalization through multitask finetuning

N Muennighoff, T Wang, L Sutawika, A Roberts… - arxiv preprint arxiv …, 2022 - arxiv.org

Multitask prompted finetuning (MTF) has been shown to help large language models
generalize to new tasks in a zero-shot setting, but so far explorations of MTF have focused …

Speichern Zitieren Zitiert von: 683 Ähnliche Artikel Alle 5 Versionen HTML-Version

[Free GPT-4]

[PDF] arxiv.org

Ernie 3.0: Large-scale knowledge enhanced pre-training for language understanding and generation

Y Sun, S Wang, S Feng, S Ding, C Pang… - arxiv preprint arxiv …, 2021 - arxiv.org

Pre-trained models have achieved state-of-the-art results in various Natural Language
Processing (NLP) tasks. Recent works such as T5 and GPT-3 have shown that scaling up …

Speichern Zitieren Zitiert von: 532 Ähnliche Artikel Alle 3 Versionen HTML-Version

[Free GPT-4]

[PDF] mit.edu

Byt5: Towards a token-free future with pre-trained byte-to-byte models

L Xue, A Barua, N Constant, R Al-Rfou… - Transactions of the …, 2022 - direct.mit.edu

Most widely used pre-trained language models operate on sequences of tokens
corresponding to word or subword units. By comparison, token-free models that operate …

Speichern Zitieren Zitiert von: 462 Ähnliche Artikel Alle 10 Versionen

[Free GPT-4]

[PDF] pkwyx.com

[PDF][PDF] mt5: A massively multilingual pre-trained text-to-text transformer

L Xue - arxiv preprint arxiv:2010.11934, 2020 - fq.pkwyx.com

The recent" Text-to-Text Transfer Transformer"(T5) leveraged a unified text-to-text format and
scale to attain state-of-the-art results on a wide variety of English-language NLP tasks. In this …

Speichern Zitieren Zitiert von: 2502 Ähnliche Artikel Im Cache

[Free GPT-4]

[PDF] arxiv.org

Datasets for large language models: A comprehensive survey

Y Liu, J Cao, C Liu, K Ding, L ** - arxiv preprint arxiv:2402.18041, 2024 - arxiv.org

This paper embarks on an exploration into the Large Language Model (LLM) datasets,
which play a crucial role in the remarkable advancements of LLMs. The datasets serve as …

Speichern Zitieren Zitiert von: 59 Ähnliche Artikel Alle 4 Versionen HTML-Version

[Free GPT-4]

[PDF] arxiv.org

Spanish pre-trained bert model and evaluation data

J Cañete, G Chaperon, R Fuentes, JH Ho… - arxiv preprint arxiv …, 2023 - arxiv.org

The Spanish language is one of the top 5 spoken languages in the world. Nevertheless,
finding resources to train or evaluate Spanish language models is not an easy task. In this …

Speichern Zitieren Zitiert von: 994 Ähnliche Artikel Alle 2 Versionen HTML-Version

[Free GPT-4]

[PDF] mlr.press

Xtreme: A massively multilingual multi-task benchmark for evaluating cross-lingual generalisation

J Hu, S Ruder, A Siddhant, G Neubig… - International …, 2020 - proceedings.mlr.press

Much recent progress in applications of machine learning models to NLP has been driven
by benchmarks that evaluate models across a wide variety of tasks. However, these broad …

Speichern Zitieren Zitiert von: 953 Ähnliche Artikel Alle 5 Versionen HTML-Version

[Free GPT-4]

[PDF] academia.edu

[PDF][PDF] KLUE: Korean Language Understanding Evaluation

S Park - arxiv preprint arxiv:2105.09680, 2021 - academia.edu

We introduce Korean Language Understanding Evaluation (KLUE) benchmark. KLUE is a
collection of 8 Korean natural language understanding (NLU) tasks, including Topic …

Speichern Zitieren Zitiert von: 302 Ähnliche Artikel Im Cache

Alert erstellen

Zitieren

Erweiterte Suche

In „Meine Bibliothek“ gespeichert

PAWS-X: A cross-lingual adversarial dataset for paraphrase identification

A comprehensive overview of large language models

C-pack: Packed resources for general chinese embeddings

Crosslingual generalization through multitask finetuning

Ernie 3.0: Large-scale knowledge enhanced pre-training for language understanding and generation

Byt5: Towards a token-free future with pre-trained byte-to-byte models

[PDF][PDF] mt5: A massively multilingual pre-trained text-to-text transformer

Datasets for large language models: A comprehensive survey

Spanish pre-trained bert model and evaluation data

Xtreme: A massively multilingual multi-task benchmark for evaluating cross-lingual generalisation

[PDF][PDF] KLUE: Korean Language Understanding Evaluation