Študovňa Google

H Naveed, AU Khan, S Qiu, M Saqib, S Anwar… - arxiv preprint arxiv …, 2023 - arxiv.org

Large Language Models (LLMs) have recently demonstrated remarkable capabilities in
natural language processing tasks and beyond. This success of LLMs has led to a large …

Uložiť Citovať Citované 777-krát Súvisiace články Všetky verzie 4 HTML verzia

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

A survey of knowledge enhanced pre-trained language models

L Hu, Z Liu, Z Zhao, L Hou, L Nie… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org

Pre-trained Language Models (PLMs) which are trained on large text corpus via self-
supervised learning method, have yielded promising performance on various tasks in …

Uložiť Citovať Citované 181-krát Súvisiace články Všetky verzie 10

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

NusaCrowd: Open source initiative for Indonesian NLP resources

S Cahyawijaya, H Lovenia, AF Aji, GI Winata… - arxiv preprint arxiv …, 2022 - arxiv.org

We present NusaCrowd, a collaborative initiative to collect and unify existing resources for
Indonesian languages, including opening access to previously non-public resources …

Uložiť Citovať Citované 1109-krát Súvisiace články Všetky verzie 12 HTML verzia

[Free GPT-4]
[DeepSeek]

[PDF] acm.org

Power hungry processing: Watts driving the cost of ai deployment?

S Luccioni, Y Jernite, E Strubell - … of the 2024 ACM conference on …, 2024 - dl.acm.org

Recent years have seen a surge in the popularity of commercial AI products based on
generative, multi-purpose AI systems promising a unified approach to building machine …

Uložiť Citovať Citované 194-krát Súvisiace články Všetky verzie 8

Evaluating large language models: A comprehensive survey

Z Guo, R **, C Liu, Y Huang, D Shi, L Yu, Y Liu… - arxiv preprint arxiv …, 2023 - arxiv.org

Large language models (LLMs) have demonstrated remarkable capabilities across a broad
spectrum of tasks. They have attracted significant attention and been deployed in numerous …

Uložiť Citovať Citované 137-krát Súvisiace články Všetky verzie 2 V pamäti

[Free GPT-4]
[DeepSeek]

[PDF] mit.edu

Byt5: Towards a token-free future with pre-trained byte-to-byte models

L Xue, A Barua, N Constant, R Al-Rfou… - Transactions of the …, 2022 - direct.mit.edu

Most widely used pre-trained language models operate on sequences of tokens
corresponding to word or subword units. By comparison, token-free models that operate …

Uložiť Citovať Citované 474-krát Súvisiace články Všetky verzie 9

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

mT5: A massively multilingual pre-trained text-to-text transformer

L Xue, N Constant, A Roberts, M Kale… - arxiv preprint arxiv …, 2020 - arxiv.org

The recent" Text-to-Text Transfer Transformer"(T5) leveraged a unified text-to-text format and
scale to attain state-of-the-art results on a wide variety of English-language NLP tasks. In this …

Uložiť Citovať Citované 2534-krát Súvisiace články Všetky verzie 8 HTML verzia

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Klue: Korean language understanding evaluation

S Park, J Moon, S Kim, WI Cho, J Han, J Park… - arxiv preprint arxiv …, 2021 - arxiv.org

We introduce Korean Language Understanding Evaluation (KLUE) benchmark. KLUE is a
collection of 8 Korean natural language understanding (NLU) tasks, including Topic …

Uložiť Citovať Citované 316-krát Súvisiace články Všetky verzie 7 HTML verzia

[Free GPT-4]
[DeepSeek]

[PDF] mlr.press

Xtreme: A massively multilingual multi-task benchmark for evaluating cross-lingual generalisation

J Hu, S Ruder, A Siddhant, G Neubig… - International …, 2020 - proceedings.mlr.press

Much recent progress in applications of machine learning models to NLP has been driven
by benchmarks that evaluate models across a wide variety of tasks. However, these broad …

Uložiť Citovať Citované 956-krát Súvisiace články Všetky verzie 5 HTML verzia

[Free GPT-4]
[DeepSeek]

[PDF] aclanthology.org

IndicNLPSuite: Monolingual corpora, evaluation benchmarks and pre-trained multilingual language models for Indian languages

D Kakwani, A Kunchukuttan, S Golla… - Findings of the …, 2020 - aclanthology.org

In this paper, we introduce NLP resources for 11 major Indian languages from two major
language families. These resources include:(a) large-scale sentence-level monolingual …

Uložiť Citovať Citované 514-krát Súvisiace články Všetky verzie 4 HTML verzia

Vytvoriť upozornenie

Citovať

Rozšírené vyhľadávanie

Uložené do mojej knižnice

Cross-lingual name tagging and linking for 282 languages

A comprehensive overview of large language models

A survey of knowledge enhanced pre-trained language models

NusaCrowd: Open source initiative for Indonesian NLP resources

Power hungry processing: Watts driving the cost of ai deployment?

Evaluating large language models: A comprehensive survey

Byt5: Towards a token-free future with pre-trained byte-to-byte models

mT5: A massively multilingual pre-trained text-to-text transformer

Klue: Korean language understanding evaluation

Xtreme: A massively multilingual multi-task benchmark for evaluating cross-lingual generalisation

IndicNLPSuite: Monolingual corpora, evaluation benchmarks and pre-trained multilingual language models for Indian languages