NusaCrowd: Open source initiative for Indonesian NLP resources

S Cahyawijaya, H Lovenia, AF Aji… - Findings of the …, 2023 - aclanthology.org
We present NusaCrowd, a collaborative initiative to collect and unify existing resources for
Indonesian languages, including opening access to previously non-public resources …

We know you are living in bali: Location prediction of twitter users using bert language model

LF Simanjuntak, R Mahendra, E Yulianti - Big Data and Cognitive …, 2022 - mdpi.com
Twitter user location data provide essential information that can be used for various
purposes. However, user location is not easy to identify because many profiles omit this …

Corpus creation and language identification for code-mixed Indonesian-Javanese-English Tweets

AF Hidayatullah, RA Apong, DTC Lai, A Qazi - PeerJ Computer Science, 2023 - peerj.com
With the massive use of social media today, mixing between languages in social media text
is prevalent. In linguistics, the phenomenon of mixing languages is known as code-mixing …

Predicting the category and the length of punishment in Indonesian courts based on previous court decision documents

EQ Nuranti, E Yulianti, HS Husin - Computers, 2022 - mdpi.com
Among the sources of legal considerations are judges' previous decisions regarding similar
cases that are archived in court decision documents. However, due to the increasing …

[PDF][PDF] Multi-label text classification of Indonesian customer reviews using bidirectional encoder representations from transformers language model

NK Nissa, E Yuliant - Int. J. Power Electron. Drive Syst, 2023 - academia.edu
Customer review is a critical resource to support the decision-making process in various
industries. To understand how customers perceived each aspect of the product, we can first …

[HTML][HTML] Freedom of expression, aspiration and gender: A cultuling in the student demonstration

S Nurbayani, E Malihah, MA Widiawaty, M Dede… - Social Sciences & …, 2025 - Elsevier
During demonstrations, people voice their aspirations to the government. On April 11, 2022,
during the COVID-19 pandemic, millennial and Gen-Z students led widespread protests in …

Sarcasm Detection in Indonesian-English Code-Mixed Text Using Multihead Attention-Based Convolutional and Bi-Directional GRU

MA Rosid, D Siahaan, A Saikhu - IEEE Access, 2024 - ieeexplore.ieee.org
Detecting sarcasm in text is a very challenging task. Sarcasm often depends on context,
tone, and cultural references, which can be difficult for machines to understand. In addition …

Sentiment analysis on indonesian-sundanese code-mixed data

H Najiha, A Romadhony - 2023 IEEE 8th International …, 2023 - ieeexplore.ieee.org
In this work, we conduct sentiment analysis on Indonesian-Sundanese code-mixed tweets.
Sundanese is one of Indonesia's regional languages with over 42.000. 000 speakers. We …

Emotion classification on code-mixed text messages via soft prompt tuning

J Zhang, D Yang, S Bao, L Cao… - Proceedings of the 13th …, 2023 - aclanthology.org
Emotion classification on code-mixed text messages is challenging due to the multilingual
languages and non-literal cues (ie, emoticons). To solve these problems, we propose an …