- Academic Search

S Khanuja, D Bansal, S Mehtani, S Khosla… - arxiv preprint arxiv …, 2021 - arxiv.org

India is a multilingual society with 1369 rationalized languages and dialects being spoken
across the country (INDIA, 2011). Of these, the 22 scheduled languages have a staggering …

Opslaan Citeren Geciteerd door 313 Verwante artikelen Alle 5 versies HTML-versie

[Free GPT-4]

[PDF] ieee.org

A systematic review on language identification of code-mixed text: techniques, data availability, challenges, and framework development

AF Hidayatullah, A Qazi, DTC Lai, RA Apong - IEEE access, 2022 - ieeexplore.ieee.org

The mix of native language with other languages (code-mixing) in social media has posed a
severe challenge for language identification (LID) systems. It has encouraged research on …

Opslaan Citeren Geciteerd door 28 Verwante artikelen Alle 3 versies

[Free GPT-4]

[PDF] jair.org

Automatic language identification in texts: A survey

T Jauhiainen, M Lui, M Zampieri, T Baldwin… - Journal of Artificial …, 2019 - jair.org

Language identification (" LI") is the problem of determining the natural language that a
document or part thereof is written in. Automatic LI has been extensively researched for over …

Opslaan Citeren Geciteerd door 257 Verwante artikelen Alle 11 versies HTML-versie

[Free GPT-4]

[PDF] arxiv.org

GLUECoS: An evaluation benchmark for code-switched NLP

S Khanuja, S Dandapat, A Srinivasan… - arxiv preprint arxiv …, 2020 - arxiv.org

Code-switching is the use of more than one language in the same conversation or utterance.
Recently, multilingual contextual embedding models, trained on multiple monolingual …

Opslaan Citeren Geciteerd door 133 Verwante artikelen Alle 4 versies HTML-versie

[Free GPT-4]

[PDF] aclanthology.org

Language modeling for code-mixing: The role of linguistic theory based synthetic data

A Pratapa, G Bhat, M Choudhury… - Proceedings of the …, 2018 - aclanthology.org

Training language models for Code-mixed (CM) language is known to be a difficult problem
because of lack of data compounded by the increased confusability due to the presence of …

Opslaan Citeren Geciteerd door 170 Verwante artikelen Alle 2 versies HTML-versie

[Free GPT-4]

[PDF] arxiv.org

A survey of code-switched speech and language processing

S Sitaram, KR Chandu, SK Rallabandi… - arxiv preprint arxiv …, 2019 - arxiv.org

Code-switching, the alternation of languages within a conversation or utterance, is a
common communicative phenomenon that occurs in multilingual communities across the …

Opslaan Citeren Geciteerd door 142 Verwante artikelen Alle 2 versies HTML-versie

[Free GPT-4]

[PDF] adityavashistha.com

Learning from tweets: opportunities and challenges to inform policy making during dengue epidemic

F Shahid, SH Ony, TR Albi, S Chellappan… - Proceedings of the …, 2020 - dl.acm.org

Social media platforms are widely used by people to report, access, and share information
during outbreaks and epidemics. Although government agencies and healthcare institutions …

Opslaan Citeren Geciteerd door 27 Verwante artikelen Alle 8 versies

[Free GPT-4]

[PDF] aclanthology.org

Incorporating dialectal variability for socially equitable language identification

D Jurgens, Y Tsvetkov, D Jurafsky - … of the 55th Annual Meeting of …, 2017 - aclanthology.org

Abstract Language identification (LID) is a critical first step for processing multilingual text.
Yet most LID systems are not designed to handle the linguistic diversity of global platforms …

Opslaan Citeren Geciteerd door 115 Verwante artikelen Alle 6 versies HTML-versie

[Free GPT-4]

[PDF] aclanthology.org

BERTologiCoMix: How does code-mixing interact with multilingual BERT?

S Santy, A Srinivasan, M Choudhury - Proceedings of the Second …, 2021 - aclanthology.org

Abstract Models such as mBERT and XLMR have shown success in solving Code-Mixed
NLP tasks even though they were not exposed to such text during pretraining. Code-Mixed …

Opslaan Citeren Geciteerd door 42 Verwante artikelen HTML-versie

[Free GPT-4]

[PDF] arxiv.org

Hinge: A dataset for generation and evaluation of code-mixed hinglish text

V Srivastava, M Singh - arxiv preprint arxiv:2107.03760, 2021 - arxiv.org

Text generation is a highly active area of research in the computational linguistic community.
The evaluation of the generated text is a challenging task and multiple theories and metrics …

Opslaan Citeren Geciteerd door 45 Verwante artikelen Alle 6 versies HTML-versie

Melding maken

Citeren

Geavanceerd zoeken

Opgeslagen in Mijn bibliotheek

Estimating code-switching on twitter with a novel generalized word-level language detection...

Muril: Multilingual representations for indian languages

A systematic review on language identification of code-mixed text: techniques, data availability, challenges, and framework development

Automatic language identification in texts: A survey

GLUECoS: An evaluation benchmark for code-switched NLP

Language modeling for code-mixing: The role of linguistic theory based synthetic data

A survey of code-switched speech and language processing

Learning from tweets: opportunities and challenges to inform policy making during dengue epidemic

Incorporating dialectal variability for socially equitable language identification

BERTologiCoMix: How does code-mixing interact with multilingual BERT?

Hinge: A dataset for generation and evaluation of code-mixed hinglish text