- Academic Search

KR Mabokela, T Celik, M Raborife - IEEE Access, 2022 - ieeexplore.ieee.org

Sentiment analysis automatically evaluates people's opinions of products or services. It is an
emerging research area with promising advancements in high-resource languages such as …

Salva Cita Citato da 38 Articoli correlati Tutte e 4 le versioni

[Free GPT-4]

[PDF] arxiv.org

Demix layers: Disentangling domains for modular language modeling

S Gururangan, M Lewis, A Holtzman, NA Smith… - arxiv preprint arxiv …, 2021 - arxiv.org

We introduce a new domain expert mixture (DEMix) layer that enables conditioning a
language model (LM) on the domain of the input text. A DEMix layer is a collection of expert …

Salva Cita Citato da 116 Articoli correlati Tutte e 5 le versioni Versione HTML

[Free GPT-4]

[PDF] arxiv.org

Between words and characters: A brief history of open-vocabulary modeling and tokenization in NLP

SJ Mielke, Z Alyafeai, E Salesky, C Raffel… - arxiv preprint arxiv …, 2021 - arxiv.org

What are the units of text that we want to model? From bytes to multi-word expressions, text
can be analyzed and generated at many granularities. Until recently, most natural language …

Salva Cita Citato da 105 Articoli correlati Tutte e 5 le versioni Versione HTML

[Free GPT-4]

[PDF] arxiv.org

Bloom+ 1: Adding language support to bloom for zero-shot prompting

ZX Yong, H Schoelkopf, N Muennighoff, AF Aji… - arxiv preprint arxiv …, 2022 - arxiv.org

The BLOOM model is a large publicly available multilingual language model, but its
pretraining was limited to 46 languages. To extend the benefits of BLOOM to other …

Salva Cita Citato da 55 Articoli correlati Tutte e 7 le versioni Versione HTML

[Free GPT-4]

[PDF] arxiv.org

AmericasNLI: Evaluating zero-shot natural language understanding of pretrained multilingual models in truly low-resource languages

A Ebrahimi, M Mager, A Oncevay, V Chaudhary… - arxiv preprint arxiv …, 2021 - arxiv.org

Pretrained multilingual models are able to perform cross-lingual transfer in a zero-shot
setting, even for languages unseen during pretraining. However, prior work evaluating …

Salva Cita Citato da 86 Articoli correlati Tutte e 5 le versioni Versione HTML

[Free GPT-4]

[PDF] arxiv.org

Expanding pretrained models to thousands more languages via lexicon-based adaptation

X Wang, S Ruder, G Neubig - arxiv preprint arxiv:2203.09435, 2022 - arxiv.org

The performance of multilingual pretrained models is highly dependent on the availability of
monolingual or parallel text present in a target language. Thus, the majority of the world's …

Salva Cita Citato da 61 Articoli correlati Tutte e 9 le versioni Versione HTML

[Free GPT-4]

[PDF] arxiv.org

Indobertweet: A pretrained language model for indonesian twitter with effective domain-specific vocabulary initialization

F Koto, JH Lau, T Baldwin - arxiv preprint arxiv:2109.04607, 2021 - arxiv.org

We present IndoBERTweet, the first large-scale pretrained model for Indonesian Twitter that
is trained by extending a monolingually-trained Indonesian BERT model with additive …

Salva Cita Citato da 82 Articoli correlati Tutte e 5 le versioni Versione HTML

[Free GPT-4]

[PDF] arxiv.org

How to adapt your pretrained multilingual model to 1600 languages

A Ebrahimi, K Kann - arxiv preprint arxiv:2106.02124, 2021 - arxiv.org

Pretrained multilingual models (PMMs) enable zero-shot learning via cross-lingual transfer,
performing best for languages seen during pretraining. While methods exist to improve …

Salva Cita Citato da 60 Articoli correlati Tutte e 4 le versioni Versione HTML

[Free GPT-4]

[PDF] arxiv.org

When being unseen from mBERT is just the beginning: Handling new languages with multilingual language models

B Muller, A Anastasopoulos, B Sagot… - arxiv preprint arxiv …, 2020 - arxiv.org

Transfer learning based on pretraining language models on a large amount of raw data has
become a new norm to reach state-of-the-art performance in NLP. Still, it remains unclear …

Salva Cita Citato da 143 Articoli correlati Tutte e 10 le versioni Versione HTML

[Free GPT-4]

[PDF] openreview.net

Breaking physical and linguistic borders: Multilingual federated prompt tuning for low-resource languages

W Zhao, Y Chen, R Lee, X Qiu, Y Gao… - The Twelfth …, 2024 - openreview.net

Pretrained large language models (LLMs) have emerged as a cornerstone in modern
natural language processing, with their utility expanding to various applications and …

Salva Cita Citato da 14 Articoli correlati Tutte e 2 le versioni Versione HTML

Crea avviso

Cita

Ricerca avanzata

Salvato in La mia biblioteca

Parsing with multilingual BERT, a small corpus, and a small treebank

Multilingual sentiment analysis for under-resourced languages: a systematic review of the landscape

Demix layers: Disentangling domains for modular language modeling

Between words and characters: A brief history of open-vocabulary modeling and tokenization in NLP

Bloom+ 1: Adding language support to bloom for zero-shot prompting

AmericasNLI: Evaluating zero-shot natural language understanding of pretrained multilingual models in truly low-resource languages

Expanding pretrained models to thousands more languages via lexicon-based adaptation

Indobertweet: A pretrained language model for indonesian twitter with effective domain-specific vocabulary initialization

How to adapt your pretrained multilingual model to 1600 languages

When being unseen from mBERT is just the beginning: Handling new languages with multilingual language models

Breaking physical and linguistic borders: Multilingual federated prompt tuning for low-resource languages