Google Učenjak

C Wendler, V Veselovsky, G Monea… - Proceedings of the 62nd …, 2024 - aclanthology.org

We ask whether multilingual language models trained on unbalanced, English-dominated
corpora use English as an internal pivot language—-a question of key importance for …

Shrani Navedi Navedeno v 75 virih Sorodni članki Vse različice: 4 V obliki HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

On the multilingual ability of decoder-based pre-trained language models: Finding and controlling language-specific neurons

T Kojima, I Okimura, Y Iwasawa, H Yanaka… - arxiv preprint arxiv …, 2024 - arxiv.org

Current decoder-based pre-trained language models (PLMs) successfully demonstrate
multilingual capabilities. However, it is unclear how these models handle multilingualism …

Shrani Navedi Navedeno v 24 virih Sorodni članki Vse različice: 6 V obliki HTML

[Free GPT-4]
[DeepSeek]

[HTML] mdpi.com

[HTML][HTML] Extracting sentence embeddings from pretrained transformer models

L Stankevičius, M Lukoševičius - Applied Sciences, 2024 - mdpi.com

Pre-trained transformer models shine in many natural language processing tasks and
therefore are expected to bear the representation of the input sentence or text meaning …

Shrani Navedi Navedeno v 5 virih Sorodni članki Vse različice: 5 Posnetek

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

How do languages influence each other? studying cross-lingual data sharing during LM fine-tuning

R Choenni, D Garrette, E Shutova - arxiv preprint arxiv:2305.13286, 2023 - arxiv.org

Multilingual large language models (MLLMs) are jointly trained on data from many different
languages such that representation of individual languages can benefit from other …

Shrani Navedi Navedeno v 20 virih Sorodni članki Vse različice: 7 V obliki HTML

[Free GPT-4]
[DeepSeek]

[PDF] mit.edu

Data-driven cross-lingual syntax: An agreement study with massively multilingual models

AG Varda, M Marelli - Computational Linguistics, 2023 - direct.mit.edu

Massively multilingual models such as mBERT and XLM-R are increasingly valued in
Natural Language Processing research and applications, due to their ability to tackle the …

Shrani Navedi Navedeno v 17 virih Sorodni članki Vse različice: 7

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Analyzing the mono-and cross-lingual pretraining dynamics of multilingual language models

T Blevins, H Gonen, L Zettlemoyer - arxiv preprint arxiv:2205.11758, 2022 - arxiv.org

The emergent cross-lingual transfer seen in multilingual pretrained models has sparked
significant interest in studying their behavior. However, because these analyses have …

Shrani Navedi Navedeno v 29 virih Sorodni članki Vse različice: 6 V obliki HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Discovering language-neutral sub-networks in multilingual language models

N Foroutan, M Banaei, R Lebret, A Bosselut… - arxiv preprint arxiv …, 2022 - arxiv.org

Multilingual pre-trained language models transfer remarkably well on cross-lingual
downstream tasks. However, the extent to which they learn language-neutral …

Shrani Navedi Navedeno v 23 virih Sorodni članki Vse različice: 4 V obliki HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Discovering low-rank subspaces for language-agnostic multilingual representations

Z **e, H Zhao, T Yu, S Li - arxiv preprint arxiv:2401.05792, 2024 - arxiv.org

Large pretrained multilingual language models (ML-LMs) have shown remarkable
capabilities of zero-shot cross-lingual transfer, without direct cross-lingual supervision. While …

Shrani Navedi Navedeno v 14 virih Sorodni članki Vse različice: 3 V obliki HTML

[Free GPT-4]
[DeepSeek]

[PDF] mlr.press

Differential privacy, linguistic fairness, and training data influence: Impossibility and possibility theorems for multilingual language models

P Rust, A Søgaard - International Conference on Machine …, 2023 - proceedings.mlr.press

Abstract Language models such as mBERT, XLM-R, and BLOOM aim to achieve
multilingual generalization or compression to facilitate transfer to a large number of …

Shrani Navedi Navedeno v 5 virih Sorodni članki Vse različice: 6 V obliki HTML

[Free GPT-4]
[DeepSeek]

[HTML] nih.gov

BioBERTurk: exploring Turkish biomedical language model development strategies in low-resource setting

H Türkmen, O Dikenelli, C Eraslan, MC Callı… - Journal of Healthcare …, 2023 - Springer

Pretrained language models augmented with in-domain corpora show impressive results in
biomedicine and clinical Natural Language Processing (NLP) tasks in English. However …

Shrani Navedi Navedeno v 17 virih Sorodni članki Vse različice: 8

Ustvari opozorilo

Navedi

Napredno iskanje

Shranjeno v Mojo knjižnico

First align, then predict: Understanding the cross-lingual ability of multilingual BERT

Do llamas work in english? on the latent language of multilingual transformers

On the multilingual ability of decoder-based pre-trained language models: Finding and controlling language-specific neurons

[HTML][HTML] Extracting sentence embeddings from pretrained transformer models

How do languages influence each other? studying cross-lingual data sharing during LM fine-tuning

Data-driven cross-lingual syntax: An agreement study with massively multilingual models

Analyzing the mono-and cross-lingual pretraining dynamics of multilingual language models

Discovering language-neutral sub-networks in multilingual language models

Discovering low-rank subspaces for language-agnostic multilingual representations

Differential privacy, linguistic fairness, and training data influence: Impossibility and possibility theorems for multilingual language models

BioBERTurk: exploring Turkish biomedical language model development strategies in low-resource setting