Dictionaries and lexicography in the AI era

R Lew - Humanities and Social Sciences Communications, 2024 - nature.com
This paper examines the implications of AI and machine translation on traditional
lexicography, using three canonical scenarios for dictionary use: text reception, text …

AfriInstruct: Instruction Tuning of African Languages for Diverse Tasks

K Uemura, M Chen, A Pejovic… - Findings of the …, 2024 - aclanthology.org
Large language models (LLMs) for African languages perform worse compared to their
performance in high-resource languages. To address this issue, we introduce AfriInstruct …

AFRIDOC-MT: Document-level MT Corpus for African Languages

JO Alabi, IA Azime, M Zhang, C España-Bonet… - arxiv preprint arxiv …, 2025 - arxiv.org
This paper introduces AFRIDOC-MT, a document-level multi-parallel translation dataset
covering English and five African languages: Amharic, Hausa, Swahili, Yor\ub\'a, and Zulu …

Yankari: A Monolingual Yoruba Dataset

M Akpobi - arxiv preprint arxiv:2412.03334, 2024 - arxiv.org
This paper presents Yankari, a large-scale monolingual dataset for the Yoruba language,
aimed at addressing the critical gap in Natural Language Processing (NLP) resources for …

CURRENT STATE, CHALLENGES AND OPPORTUNITIES FOR NATURAL LANGUAGE PROCESSING RESEARCH AND DEVELOPMENT IN AFRICA …

C Emmanuel, K Andrew - openreview.net
Natural language processing (NLP) has recently gained much attention for representing and
analyzing human language computations as evidenced by the release of sophisticated …