Survey of cultural awareness in language models: Text and beyond

S Pawar, J Park, J **, A Arora, J Myung… - arxiv preprint arxiv …, 2024 - arxiv.org
Large-scale deployment of large language models (LLMs) in various applications, such as
chatbots and virtual assistants, requires LLMs to be culturally sensitive to the user to ensure …

The 2021 Tokyo Olympics Multilingual News Article Dataset

E Novak, E Calcina, D Mladenić… - arxiv preprint arxiv …, 2025 - arxiv.org
In this paper, we introduce a dataset of multilingual news articles covering the 2021 Tokyo
Olympics. A total of 10,940 news articles were gathered from 1,918 different publishers …