Language-specific neurons: The key to multilingual capabilities in large language models

T Tang, W Luo, H Huang, D Zhang, X Wang… - arxiv preprint arxiv …, 2024 - arxiv.org
Large language models (LLMs) demonstrate remarkable multilingual capabilities without
being pre-trained on specially curated multilingual parallel corpora. It remains a challenging …

The geometry of multilingual language model representations

TA Chang, Z Tu, BK Bergen - arxiv preprint arxiv:2205.10964, 2022 - arxiv.org
We assess how multilingual language models maintain a shared multilingual representation
space while still encoding language-sensitive information in each language. Using XLM-R …

Translation performance from the user's perspective of large language models and neural machine translation systems

J Son, B Kim - Information, 2023 - mdpi.com
The rapid global expansion of ChatGPT, which plays a crucial role in interactive knowledge
sharing and translation, underscores the importance of comparative performance …

SeaEval for multilingual foundation models: From cross-lingual alignment to cultural reasoning

B Wang, Z Liu, X Huang, F Jiao, Y Ding, AT Aw… - arxiv preprint arxiv …, 2023 - arxiv.org
We present SeaEval, a benchmark for multilingual foundation models. In addition to
characterizing how these models understand and reason with natural language, we also …

Crossing the conversational chasm: A primer on natural language processing for multilingual task-oriented dialogue systems

E Razumovskaia, G Glavas, O Majewska… - Journal of Artificial …, 2022 - jair.org
In task-oriented dialogue (ToD), a user holds a conversation with an artificial agent with the
aim of completing a concrete task. Although this technology represents one of the central …

Understanding Cross-Lingual Alignment--A Survey

K Hämmerl, J Libovický, A Fraser - arxiv preprint arxiv:2404.06228, 2024 - arxiv.org
Cross-lingual alignment, the meaningful similarity of representations across languages in
multilingual language models, has been an active field of research in recent years. We …

Combining static word embeddings and contextual representations for bilingual lexicon induction

J Zhang, B Ji, N **ao, X Duan, M Zhang, Y Shi… - arxiv preprint arxiv …, 2021 - arxiv.org
Bilingual Lexicon Induction (BLI) aims to map words in one language to their translations in
another, and is typically through learning linear projections to align monolingual word …

Semi-supervised entity alignment via relation-based adaptive neighborhood matching

W Cai, W Ma, L Wei, Y Jiang - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
Many recent studies of Entity Alignment (EA) use Graph Neural Networks (GNNs) to
aggregate the neighborhood features of entities and achieve better performance. However …

Sentiment analysis using pre-trained language model with no fine-tuning and less resource

Y Kit, MM Mokji - IEEE Access, 2022 - ieeexplore.ieee.org
Sentiment analysis has become popular when Natural Language Processing algorithms
were proven to be able to process complex sentences with good accuracy. Recently, pre …

Role of language relatedness in multilingual fine-tuning of language models: A case study in indo-aryan languages

TI Dhamecha, R Murthy V, S Bharadwaj… - arxiv preprint arxiv …, 2021 - arxiv.org
We explore the impact of leveraging the relatedness of languages that belong to the same
family in NLP models using multilingual fine-tuning. We hypothesize and validate that …