Multilingual sentiment analysis for under-resourced languages: a systematic review of the landscape
KR Mabokela, T Celik, M Raborife - IEEE Access, 2022 - ieeexplore.ieee.org
Sentiment analysis automatically evaluates people's opinions of products or services. It is an
emerging research area with promising advancements in high-resource languages such as …
emerging research area with promising advancements in high-resource languages such as …
Afrisenti: A twitter sentiment analysis benchmark for african languages
Africa is home to over 2,000 languages from more than six language families and has the
highest linguistic diversity among all continents. These include 75 languages with at least …
highest linguistic diversity among all continents. These include 75 languages with at least …
NusaX: Multilingual parallel sentiment dataset for 10 Indonesian local languages
Natural language processing (NLP) has a significant impact on society via technologies
such as machine translation and search engines. Despite its success, NLP technology is …
such as machine translation and search engines. Despite its success, NLP technology is …
Breaking physical and linguistic borders: Multilingual federated prompt tuning for low-resource languages
Pretrained large language models (LLMs) have emerged as a cornerstone in modern
natural language processing, with their utility expanding to various applications and …
natural language processing, with their utility expanding to various applications and …
Masakhaner 2.0: Africa-centric transfer learning for named entity recognition
African languages are spoken by over a billion people, but are underrepresented in NLP
research and development. The challenges impeding progress include the limited …
research and development. The challenges impeding progress include the limited …
AfroLID: A neural language identification tool for African languages
Language identification (LID) is a crucial precursor for NLP, especially for mining web data.
Problematically, most of the world's 7000+ languages today are not covered by LID …
Problematically, most of the world's 7000+ languages today are not covered by LID …
AfroLM: A self-active learning-based multilingual pretrained language model for 23 African languages
In recent years, multilingual pre-trained language models have gained prominence due to
their remarkable performance on numerous downstream Natural Language Processing …
their remarkable performance on numerous downstream Natural Language Processing …
AfriCLIRMatrix: Enabling cross-lingual information retrieval for african languages
Abstract Language diversity in NLP is critical in enabling the development of tools for a wide
range of users. However, there are limited resources for building such tools for many …
range of users. However, there are limited resources for building such tools for many …
Serengeti: Massively multilingual language models for africa
Multilingual pretrained language models (mPLMs) acquire valuable, generalizable linguistic
information during pretraining and have advanced the state of the art on task-specific …
information during pretraining and have advanced the state of the art on task-specific …
Jampatoisnli: A jamaican patois natural language inference dataset
JamPatoisNLI provides the first dataset for natural language inference in a creole language,
Jamaican Patois. Many of the most-spoken low-resource languages are creoles. These …
Jamaican Patois. Many of the most-spoken low-resource languages are creoles. These …