A survey on large language models with multilingualism: Recent advances and new frontiers

K Huang, F Mo, X Zhang, H Li, Y Li, Y Zhang… - arxiv preprint arxiv …, 2024 - arxiv.org
The rapid development of Large Language Models (LLMs) demonstrates remarkable
multilingual capabilities in natural language processing, attracting global attention in both …

Toward best practices for training multilingual dense retrieval models

X Zhang, K Ogueji, X Ma, J Lin - ACM Transactions on Information …, 2023 - dl.acm.org
Dense retrieval models using a transformer-based bi-encoder architecture have emerged as
an active area of research. In this article, we focus on the task of monolingual retrieval in a …

Overview of the trec 2023 neuclir track

D Lawrie, S MacAvaney, J Mayfield… - arxiv preprint arxiv …, 2024 - arxiv.org
The principal goal of the TREC Neural Cross-Language Information Retrieval (NeuCLIR)
track is to study the impact of neural approaches to cross-language information retrieval. The …

Steering large language models for cross-lingual information retrieval

P Guo, Y Ren, Y Hu, Y Cao, Y Li, H Huang - Proceedings of the 47th …, 2024 - dl.acm.org
In today's digital age, accessing information across language barriers poses a significant
challenge, with conventional search systems often struggling to interpret and retrieve …

JaColBERTv2. 5: Optimising Multi-Vector Retrievers to Create State-of-the-Art Japanese Retrievers with Constrained Resources

B Clavié - arxiv preprint arxiv:2407.20750, 2024 - arxiv.org
Neural Information Retrieval has advanced rapidly in high-resource languages, but progress
in lower-resource ones such as Japanese has been hindered by data scarcity, among other …

Query in your tongue: Reinforce large language models with retrievers for cross-lingual search generative experience

P Guo, Y Hu, Y Cao, Y Ren, Y Li, H Huang - Proceedings of the ACM …, 2024 - dl.acm.org
In the contemporary digital landscape, search engines play an invaluable role in information
access, yet they often face challenges in Cross-Lingual Information Retrieval (CLIR) …

Zero-shot cross-lingual reranking with large language models for low-resource languages

M Adeyemi, A Oladipo, R Pradeep, J Lin - arxiv preprint arxiv:2312.16159, 2023 - arxiv.org
Large language models (LLMs) have shown impressive zero-shot capabilities in various
document reranking tasks. Despite their successful implementations, there is still a gap in …

[PDF][PDF] Facilitating cross-lingual information retrieval evaluations for african languages

M Adeyemi - 2024 - uwspace.uwaterloo.ca
Web resources are becoming more available in various languages, increasing the
importance of cross-lingual information retrieval (CLIR) in accessing information that is …

Good for Children, Good for All?

M Landoni, T Huibers, E Murgia, MS Pera - European Conference on …, 2024 - Springer
In this work, we reason how focusing on Information Retrieval (IR) for children and involving
them in participatory studies would benefit the IR community. The Child Computer …

Indicirsuite: Multilingual dataset and neural information models for indian languages

S Haq, A Sharma, P Bhattacharyya - arxiv preprint arxiv:2312.09508, 2023 - arxiv.org
In this paper, we introduce Neural Information Retrieval resources for 11 widely spoken
Indian Languages (Assamese, Bengali, Gujarati, Hindi, Kannada, Malayalam, Marathi …