LinCE: A centralized benchmark for linguistic code-switching evaluation
Recent trends in NLP research have raised an interest in linguistic code-switching (CS);
modern approaches have been proposed to solve a wide range of NLP tasks on multiple …
modern approaches have been proposed to solve a wide range of NLP tasks on multiple …
Fighting hate speech from bilingual hinglish speaker's perspective, a transformer-and translation-based approach.
Many people have begun to use social media platforms due to the increased use of the
Internet over the previous decade. It has a lot of benefits, but it also comes with a lot of risks …
Internet over the previous decade. It has a lot of benefits, but it also comes with a lot of risks …
Does mapo tofu contain coffee? probing llms for food-related cultural knowledge
Recent studies have highlighted the presence of cultural biases in Large Language Models
(LLMs), yet often lack a robust methodology to dissect these phenomena comprehensively …
(LLMs), yet often lack a robust methodology to dissect these phenomena comprehensively …
Multilingual code-switching for zero-shot cross-lingual intent prediction and slot filling
Predicting user intent and detecting the corresponding slots from text are two key problems
in Natural Language Understanding (NLU). In the context of zero-shot learning, this task is …
in Natural Language Understanding (NLU). In the context of zero-shot learning, this task is …
Does aggression lead to hate? Detecting and reasoning offensive traits in hinglish code-mixed texts
Aggression is a prominent trait of human beings that can affect social harmony in a negative
way. The hate mongers misuse the freedom of speech in social media platforms to flood with …
way. The hate mongers misuse the freedom of speech in social media platforms to flood with …
A code-mixed task-oriented dialog dataset for medical domain
In the healthcare domain, medical and patient interactions form a crucial part of the
diagnosis. Initially, the AI models developed for healthcare centered only on monolingual …
diagnosis. Initially, the AI models developed for healthcare centered only on monolingual …
Calcs 2021 shared task: Machine translation for code-switched data
To date, efforts in the code-switching literature have focused for the most part on language
identification, POS, NER, and syntactic parsing. In this paper, we address machine …
identification, POS, NER, and syntactic parsing. In this paper, we address machine …
Can you traducir this? machine translation for code-switched input
Code-Switching (CSW) is a common phenomenon that occurs in multilingual geographic or
social contexts, which raises challenging problems for natural language processing tools …
social contexts, which raises challenging problems for natural language processing tools …
Char2Subword: Extending the subword embedding space using robust character compositionality
Byte-pair encoding (BPE) is a ubiquitous algorithm in the subword tokenization process of
language models as it provides multiple benefits. However, this process is solely based on …
language models as it provides multiple benefits. However, this process is solely based on …
A Comprehensive Understanding of Code-Mixed Language Semantics Using Hierarchical Transformer
Being a popular mode of text-based communication in multilingual communities, code
mixing in online social media has become an important subject to study. Learning the …
mixing in online social media has become an important subject to study. Learning the …