Qa dataset explosion: A taxonomy of nlp resources for question answering and reading comprehension

A Rogers, M Gardner, I Augenstein - ACM Computing Surveys, 2023 - dl.acm.org
Alongside huge volumes of research on deep learning models in NLP in the recent years,
there has been much work on benchmark datasets needed to track modeling progress …

Code-mixing: A brief survey

S Thara, P Poornachandran - 2018 International conference on …, 2018 - ieeexplore.ieee.org
Indians and many other non-English speakers across the world, prefer not to use single
code in their messaging texts on social media platforms. They make use of transliteration …

A dataset of Hindi-English code-mixed social media text for hate speech detection

A Bohra, D Vijay, V Singh, SS Akhtar… - Proceedings of the …, 2018 - aclanthology.org
Hate speech detection in social media texts is an important Natural language Processing
task, which has several crucial applications like sentiment analysis, investigating …

Automatic language identification in texts: A survey

T Jauhiainen, M Lui, M Zampieri, T Baldwin… - Journal of Artificial …, 2019 - jair.org
Language identification (" LI") is the problem of determining the natural language that a
document or part thereof is written in. Automatic LI has been extensively researched for over …

LinCE: A centralized benchmark for linguistic code-switching evaluation

G Aguilar, S Kar, T Solorio - arxiv preprint arxiv:2005.04322, 2020 - arxiv.org
Recent trends in NLP research have raised an interest in linguistic code-switching (CS);
modern approaches have been proposed to solve a wide range of NLP tasks on multiple …

A survey of code-switched speech and language processing

S Sitaram, KR Chandu, SK Rallabandi… - arxiv preprint arxiv …, 2019 - arxiv.org
Code-switching, the alternation of languages within a conversation or utterance, is a
common communicative phenomenon that occurs in multilingual communities across the …

Transformer based language identification for malayalam-english code-mixed text

S Thara, P Poornachandran - IEEE Access, 2021 - ieeexplore.ieee.org
Social media users have the proclivity to write majority of the data for under resourced
languages in code-mixed format. Code-mixing is defined as mixing of two or more …

Enabling code-mixed translation: Parallel corpus creation and MT augmentation approach

M Dhar, V Kumar, M Shrivastava - Proceedings of the First …, 2018 - aclanthology.org
Code-mixing, use of two or more languages in a single sentence, is ubiquitous; generated
by multi-lingual speakers across the world. The phenomenon presents itself prominently in …

Towards end-to-end multilingual question answering

E Loginova, S Varanasi, G Neumann - Information Systems Frontiers, 2021 - Springer
Multilingual question answering (MLQA) is a critical part of an accessible natural language
interface. However, current solutions demonstrate performance far below that of …

Corpus creation and emotion prediction for Hindi-English code-mixed social media text

D Vijay, A Bohra, V Singh, SS Akhtar… - Proceedings of the …, 2018 - aclanthology.org
Abstract Emotion Prediction is a Natural Language Processing (NLP) task dealing with
detection and classification of emotions in various monolingual and bilingual texts. While …