Road traffic conditions in Kenya: Exploring the policies and traffic cultures from unstructured user-generated data using NLP

J Muguro, W Njeri, K Matsushita, M Sasaki - IATSS research, 2022 - Elsevier
Road traffic accidents (RTA) are a prevalent cause of fatality with African countries having
the highest fatality index (25–34 per quota). The World Health Organization estimates …

Massively multilingual corpus of sentiment datasets and multi-faceted sentiment classification benchmark

L Augustyniak, S Woźniak, M Gruza… - Advances in …, 2023 - proceedings.neurips.cc
Despite impressive advancements in multilingual corpora collection and model training,
develo** large-scale deployments of multilingual models still presents a significant …

[PDF][PDF] Information extraction from Spanish radiology reports using multilingual BERT

O Solarte-Pabón, O Montenegro… - CLEF …, 2021 - academia.edu
This paper describes our team's participation in Task 1 of the Conference and Labs of the
Evaluation Forum (CLEF eHealth 2021). The Task 1 challenge targets Named Entity …

Persian Ezafeh Recognition using Transformer-Based Models

A Ansari, Z Ebrahimian, R Toosi… - 2023 9th International …, 2023 - ieeexplore.ieee.org
In Persian, the grammatical particle ezafe connects two words. Ezafe is one of the salient
factors in Persian phonology and morphology to understand the meaning of a sentence …

Assessment of massively multilingual sentiment classifiers

K Rajda, Ł Augustyniak, P Gramacki, M Gruza… - arxiv preprint arxiv …, 2022 - arxiv.org
Models are increasing in size and complexity in the hunt for SOTA. But what if those 2\%
increase in performance does not make a difference in a production use case? Maybe …

Looking for clues of language in multilingual BERT to improve cross-lingual generalization

CL Liu, TY Hsu, YS Chuang, CY Li, H Lee - arxiv preprint arxiv …, 2020 - arxiv.org
Token embeddings in multilingual BERT (m-BERT) contain both language and semantic
information. We find that the representation of a language can be obtained by simply …

Using BERT for Swiss German Sentence Prediction

O Köchli, P Wenk, C Zweili, T Hanne - International Conference on …, 2023 - Springer
Abstract Natural Language Processing builds the foundation for a wide range of language-
based applications. While widely spread languages benefit from a wide selection of …

[PDF][PDF] Evaluating Text Classification Models on Multilingual Documents

J Eigenmann, I Arous, M Khayati, P Cudré-Mauroux - 2021 - exascale.info
Abstract Machine learning models often require large annotated datasets for training in
order to obtain accurate results. However, the scarcity of the labeled data is a bottleneck for …