Road traffic conditions in Kenya: Exploring the policies and traffic cultures from unstructured user-generated data using NLP
Road traffic accidents (RTA) are a prevalent cause of fatality with African countries having
the highest fatality index (25–34 per quota). The World Health Organization estimates …
the highest fatality index (25–34 per quota). The World Health Organization estimates …
Massively multilingual corpus of sentiment datasets and multi-faceted sentiment classification benchmark
Despite impressive advancements in multilingual corpora collection and model training,
develo** large-scale deployments of multilingual models still presents a significant …
develo** large-scale deployments of multilingual models still presents a significant …
[PDF][PDF] Information extraction from Spanish radiology reports using multilingual BERT
This paper describes our team's participation in Task 1 of the Conference and Labs of the
Evaluation Forum (CLEF eHealth 2021). The Task 1 challenge targets Named Entity …
Evaluation Forum (CLEF eHealth 2021). The Task 1 challenge targets Named Entity …
Persian Ezafeh Recognition using Transformer-Based Models
In Persian, the grammatical particle ezafe connects two words. Ezafe is one of the salient
factors in Persian phonology and morphology to understand the meaning of a sentence …
factors in Persian phonology and morphology to understand the meaning of a sentence …
Assessment of massively multilingual sentiment classifiers
Models are increasing in size and complexity in the hunt for SOTA. But what if those 2\%
increase in performance does not make a difference in a production use case? Maybe …
increase in performance does not make a difference in a production use case? Maybe …
Looking for clues of language in multilingual BERT to improve cross-lingual generalization
Token embeddings in multilingual BERT (m-BERT) contain both language and semantic
information. We find that the representation of a language can be obtained by simply …
information. We find that the representation of a language can be obtained by simply …
Using BERT for Swiss German Sentence Prediction
O Köchli, P Wenk, C Zweili, T Hanne - International Conference on …, 2023 - Springer
Abstract Natural Language Processing builds the foundation for a wide range of language-
based applications. While widely spread languages benefit from a wide selection of …
based applications. While widely spread languages benefit from a wide selection of …
[PDF][PDF] Evaluating Text Classification Models on Multilingual Documents
J Eigenmann, I Arous, M Khayati, P Cudré-Mauroux - 2021 - exascale.info
Abstract Machine learning models often require large annotated datasets for training in
order to obtain accurate results. However, the scarcity of the labeled data is a bottleneck for …
order to obtain accurate results. However, the scarcity of the labeled data is a bottleneck for …