Quality at a glance: An audit of web-crawled multilingual datasets J Kreutzer, I Caswell, L Wang, A Wahab, D van Esch, N Ulzii-Orshikh, ... Transactions of the Association for Computational Linguistics 10, 50-72, 2022 | 150 | 2022 |
MasakhaNER: Named entity recognition for African languages DI Adelani, J Abbott, G Neubig, D D’souza, J Kreutzer, C Lignos, ... Transactions of the Association for Computational Linguistics 9, 1116-1131, 2021 | 94 | 2021 |
Masakhanews: News topic classification for african languages DI Adelani, M Masiak, IA Azime, J Alabi, AL Tonja, C Mwase, O Ogundepo, ... arXiv preprint arXiv:2304.09972, 2023 | 25 | 2023 |
Cvqa: Culturally-diverse multilingual visual question answering benchmark D Romero, C Lyu, HA Wibowo, T Lynn, I Hamed, AN Kishore, A Mandal, ... arXiv preprint arXiv:2406.05967, 2024 | 17 | 2024 |
An amharic news text classification dataset IA Azime, N Mohammed arXiv preprint arXiv:2103.05639, 2021 | 15 | 2021 |
Nkiruka Odu, Rooweither Mabuya, Shamsuddeen Hassan Muhammad, Salomey Osei, Sokhar Samb, Tadesse Kebede Guge, and Pontus Stenetorp. 2024. Irokobench: A new benchmark for african … DI Adelani, J Ojo, IA Azime, JY Zhuang, JO Alabi, X He, M Ochieng, ... Preprint, 0 | 12 | |
Natural language processing in ethiopian languages: Current state, challenges, and opportunities AL Tonja, TD Belay, IA Azime, AA Ayele, MA Mehamed, O Kolesnikova, ... arXiv preprint arXiv:2303.14406, 2023 | 7 | 2023 |
EthioLLM: Multilingual Large Language Models for Ethiopian Languages with Task Evaluation AL Tonja, IA Azime, TD Belay, MG Yigezu, MA Mehamed, AA Ayele, ... arXiv preprint arXiv:2403.13737, 2024 | 6 | 2024 |
Walia-LLM: Enhancing Amharic-LLaMA by Integrating Task-Specific and Generative Datasets IA Azime, MY Fuge, AL Tonja, TD Belay, AK Wassie, ES Jada, Y Chanie, ... arXiv preprint arXiv:2402.08015, 2024 | 5 | 2024 |
Irokobench: A new benchmark for african languages in the age of large language models DI Adelani, J Ojo, IA Azime, JY Zhuang, JO Alabi, X He, M Ochieng, ... arXiv preprint arXiv:2406.03368, 2024 | 3 | 2024 |
Masakhane-Afrisenti at SemEval-2023 Task 12: Sentiment Analysis using Afro-centric Language Models and Adapters for Low-resource African Languages IA Azime, SS Al-Azzawi, AL Tonja, I Shode, J Alabi, A Awokoya, ... arXiv preprint arXiv:2304.06459, 2023 | 2 | 2023 |
AFRIDOC-MT: Document-level MT Corpus for African Languages JO Alabi, IA Azime, M Zhang, C España-Bonet, R Bawden, D Zhu, ... arXiv preprint arXiv:2501.06374, 2025 | | 2025 |
Evaluating the Capabilities of Large Language Models for Multi-label Emotion Understanding TD Belay, IA Azime, AA Ayele, G Sidorov, D Klakow, P Slusallek, ... arXiv preprint arXiv:2412.17837, 2024 | | 2024 |
Uhura: A Benchmark for Evaluating Scientific Question Answering and Truthfulness in Low-Resource African Languages E Bayes, IA Azime, JO Alabi, J Kgomo, T Eloundou, E Proehl, K Chen, ... arXiv preprint arXiv:2412.00948, 2024 | | 2024 |
ProverbEval: Exploring LLM Evaluation Challenges for Low-resource Language Understanding IA Azime, AL Tonja, TD Belay, Y Chanie, BF Balcha, NH Abadi, ... arXiv preprint arXiv:2411.05049, 2024 | | 2024 |
CVQA: Culturally-diverse Multilingual Visual Question Answering Benchmark DOR Mogrovejo, C Lyu, HA Wibowo, S Góngora, A Mandal, ... The Thirty-eight Conference on Neural Information Processing Systems …, 0 | | |
5th Workshop on African Natural Language Processing (AfricaNLP 2024) H Buzaaba, BFP Dossou, DI Adelani, H Elsahar, C Lignos, AL Tonja, ... ICLR 2024 Workshops, 0 | | |