Bloom: A 176b-parameter open-access multilingual language model T Le Scao, A Fan, C Akiki, E Pavlick, S Ilić, D Hesslow, R Castagné, ... | 1745 | 2023 |
Quality at a glance: An audit of web-crawled multilingual datasets J Kreutzer, I Caswell, L Wang, A Wahab, D van Esch, N Ulzii-Orshikh, ... Transactions of the Association for Computational Linguistics 10, 50-72, 2022 | 271* | 2022 |
Participatory research for low-resourced machine translation: A case study in african languages W Nekoto, V Marivate, T Matsila, T Fasubaa, T Kolawole, T Fagbohungbe, ... arXiv preprint arXiv:2010.02353, 2020 | 191 | 2020 |
The gem benchmark: Natural language generation, its evaluation and metrics S Gehrmann, T Adewumi, K Aggarwal, PS Ammanamanchi, ... arXiv preprint arXiv:2102.01672, 2021 | 157 | 2021 |
MasakhaNER: Named entity recognition for African languages DI Adelani, J Abbott, G Neubig, D D’souza, J Kreutzer, C Lignos, ... Transactions of the Association for Computational Linguistics 9, 1116-1131, 2021 | 94 | 2021 |
Afrisenti: A twitter sentiment analysis benchmark for african languages SH Muhammad, I Abdulmumin, AA Ayele, N Ousidhoum, DI Adelani, ... arXiv preprint arXiv:2302.08956, 2023 | 72 | 2023 |
Reusable templates and guides for documenting datasets and models for natural language processing and generation: A case study of the HuggingFace and GEM data and model cards A McMillan-Major, S Osei, JD Rodriguez, PS Ammanamanchi, ... arXiv preprint arXiv:2108.07374, 2021 | 59 | 2021 |
AfroLM: A self-active learning-based multilingual pretrained language model for 23 African languages BFP Dossou, AL Tonja, O Yousuf, S Osei, A Oppong, I Shode, ... arXiv preprint arXiv:2211.03263, 2022 | 40 | 2022 |
Quality at a glance: An audit of web-crawled multilingual datasets I Caswell, J Kreutzer, L Wang, A Wahab, D van Esch, N Ulzii-Orshikh, ... arXiv e-prints, arXiv: 2103.12028, 2021 | 37 | 2021 |
The World as a Graph: Improving El Ni\~ no Forecasts with Graph Neural Networks SR Cachay, E Erickson, AFC Bucker, E Pokropek, W Potosnak, S Bire, ... arXiv preprint arXiv:2104.05089, 2021 | 30 | 2021 |
Graph Neural Networks for Improved El Ni\~ no Forecasting SR Cachay, E Erickson, AFC Bucker, E Pokropek, W Potosnak, S Osei, ... arXiv preprint arXiv:2012.01598, 2020 | 28 | 2020 |
Bibletts: a large, high-fidelity, multilingual, and uniquely african speech corpus J Meyer, DI Adelani, E Casanova, A Öktem, DWJ Weber, S Kabongo, ... arXiv preprint arXiv:2207.03546, 2022 | 21 | 2022 |
Gemv2: Multilingual nlg benchmarking in a single line of code S Gehrmann, A Bhattacharjee, A Mahendiran, A Wang, A Papangelis, ... arXiv preprint arXiv:2206.11249, 2022 | 16 | 2022 |
Afrispeech-200: Pan-african accented speech dataset for clinical and general domain asr T Olatunji, T Afonja, A Yadavalli, CC Emezue, S Singh, BFP Dossou, ... Transactions of the Association for Computational Linguistics 11, 1669-1685, 2023 | 15 | 2023 |
English-twi parallel corpus for machine translation P Azunre, S Osei, S Addo, LA Adu-Gyamfi, S Moore, B Adabankah, ... arXiv preprint arXiv:2103.15625, 2021 | 11 | 2021 |
Afriqa: Cross-lingual open-retrieval question answering for african languages O Ogundepo, TR Gwadabe, CE Rivera, JH Clark, S Ruder, DI Adelani, ... arXiv preprint arXiv:2305.06897, 2023 | 9 | 2023 |
Nlp for ghanaian languages P Azunre, S Osei, S Addo, LA Adu-Gyamfi, S Moore, B Adabankah, ... arXiv preprint arXiv:2103.15475, 2021 | 8 | 2021 |
Contextual text embeddings for twi P Azunre, S Osei, S Addo, LA Adu-Gyamfi, S Moore, B Adabankah, ... arXiv preprint arXiv:2103.15963, 2021 | 7 | 2021 |
Espoir Murhabazi, Elan Van Biljon, Daniel Whitenack, Christopher Onyefuluchi, Chris C W Nekoto, V Marivate, T Matsila, TE Fasubaa, T Kolawole, ... Emezue, Bonaventure FP Dossou, Blessing K. Sibanda, Blessing Itoro Bassey …, 2020 | 4 | 2020 |
AfriSenti: a Twitter sentiment analysis benchmark for African languages (2023) SH Muhammad, I Abdulmumin, AA Ayele, N Ousidhoum, DI Adelani, ... URL https://arxiv. org/abs/2302.08956, 0 | 4 | |