One country, 700+ languages: NLP challenges for underrepresented languages and dialects in Indonesia

AF Aji, GI Winata, F Koto, S Cahyawijaya… - arxiv preprint arxiv …, 2022 - arxiv.org
NLP research is impeded by a lack of resources and awareness of the challenges presented
by underrepresented languages and dialects. Focusing on the languages spoken in …

LipKey: A large-scale news dataset for absent keyphrases generation and abstractive summarization

F Koto, T Baldwin, JH Lau - Proceedings of the 29th International …, 2022 - aclanthology.org
Summaries, keyphrases, and titles are different ways of concisely capturing the content of a
document. While most previous work has released the datasets of keyphrases and …

Automatic Semantic Annotation of Indonesian Language Phrase Using N-Gram Language Model.

D Wardani, C Evangelista - International Journal on …, 2024 - search.ebscohost.com
Building semantic data populations in unstructured data or text is challenging. In this type of
data, several problems can be raised, some of which are difficult to analyze. Some groups of …

Keyword Extraction from Scientific Publications Using Local Features and Embedding Model

GSK Kurniawan, KM Lhaksmana - 2023 9th International …, 2023 - ieeexplore.ieee.org
In the field of natural language processing (NLP), keywords are crucial for enhancing
information retrieval (IR) and content summarization, as well as for optimizing search …

Feature-based POS tagging and sentence relevance for news multi-document summarization in Bahasa Indonesia

MZ Abdullah, C Fatichah - Bulletin of Electrical Engineering and Informatics, 2022 - beei.org
Sentence extraction in news document summarization determines representative sentences
primarily by employing the news feature known as news feature score (NeFS). NeFS can …

Penerapan Teknologi LangChain pada Question Answering System Fikih Empat Madzhab: Application of Langchain Technology to the Fiqh Question Answering …

S Rahayu, NS Harahap, S Agustian… - … : Indonesian Journal of …, 2024 - journal.irpi.or.id
Fikih sebagai ilmu yang luas, terkadang menimbulkan beragam persoalan dan perbedaan
pandangan antara madzhab-madzhabnya. Tujuan pandangan ulama tentang isu-isu fikih …

Keyword extraction from news corpus by deep learning in the context of internet of things

Y **ao - International Journal of Grid and Utility Computing, 2023 - inderscienceonline.com
With the rapid development of modern technology and information technology, information
generation and dissemination is getting faster and faster. The amount of web text, such as …

Implemented Text Summarization Tool using Text Rank Algorithm

A More, V Dalal - … Journal of Innovations in Engineering and …, 2021 - search.proquest.com
Text Synopsis is the most common way of producing the dense perspective on the text by
choosing valuable and pertinent data from the first source records. It is a sub subject of Data …

PENERAPAN TEKNOLOGI LANGCHAIN PADA QUESTION ANSWERING SYSTEM FIKIH EMPAT MADZHAB

S RAHAYU - PENERAPAN TEKNOLOGI LANGCHAIN …, 2024 - repository.uin-suska.ac.id
Fikih sebagai ilmu yang luas, terkadang menimbulkan beragam persoalan dan perbedaan
pandangan antara madzhabmadzhabnya. Tujuan pandangan ulama tentang isu-isu fikih …

Document Searching of EPPS Test Result Using Indexing Method

K Krisna, S Mulyati - IOP Conference Series: Materials Science …, 2021 - iopscience.iop.org
This study focuses on documents searching on the EPPS test result, mainly administered in
the education sector for selecting prospective students in universities' admissions process …