IndicNLPSuite: Monolingual corpora, evaluation benchmarks and pre-trained multilingual language models for Indian languages
In this paper, we introduce NLP resources for 11 major Indian languages from two major
language families. These resources include:(a) large-scale sentence-level monolingual …
language families. These resources include:(a) large-scale sentence-level monolingual …
[PDF][PDF] Findings of the 2014 workshop on statistical machine translation
This paper presents the results of the WMT14 shared tasks, which included a standard news
translation task, a separate medical translation task, a task for run-time estimation of …
translation task, a separate medical translation task, a task for run-time estimation of …
The iit bombay english-hindi parallel corpus
We present the IIT Bombay English-Hindi Parallel Corpus. The corpus is a compilation of
parallel corpora previously available in the public domain as well as new parallel corpora …
parallel corpora previously available in the public domain as well as new parallel corpora …
Overview of the 8th workshop on Asian translation
This paper presents the results of the shared tasks from the 8th workshop on Asian
translation (WAT2021). For the WAT2021, 28 teams participated in the shared tasks and 24 …
translation (WAT2021). For the WAT2021, 28 teams participated in the shared tasks and 24 …
Ai4bharat-indicnlp corpus: Monolingual corpora and word embeddings for indic languages
We present the IndicNLP corpus, a large-scale, general-domain corpus containing 2.7
billion words for 10 Indian languages from two language families. We share pre-trained …
billion words for 10 Indian languages from two language families. We share pre-trained …
Recent advances of low-resource neural machine translation
In recent years, neural network-based machine translation (MT) approaches have steadily
superseded the statistical MT (SMT) methods, and represents the current state-of-the-art in …
superseded the statistical MT (SMT) methods, and represents the current state-of-the-art in …
Neural machine translation: English to hindi
Machine Translation (MT) attempts to minimize the communication gap among people from
various linguistic backgrounds. Automatic translation between pair of different natural …
various linguistic backgrounds. Automatic translation between pair of different natural …
A new language-independent deep CNN for scene text detection and style transfer in social media images
P Shivakumara, A Banerjee, U Pal… - … on Image Processing, 2023 - ieeexplore.ieee.org
Due to the adverse effect of quality caused by different social media and arbitrary languages
in natural scenes, detecting text from social media images and transferring its style is …
in natural scenes, detecting text from social media images and transferring its style is …
A comparative analysis on Hindi and English extractive text summarization
Text summarization is the process of transfiguring a large documental information into a
clear and concise form. In this article, we present a detailed comparative study of various …
clear and concise form. In this article, we present a detailed comparative study of various …
Universal Dependency parsing for Hindi-English code-switching
Code-switching is a phenomenon of mixing grammatical structures of two or more
languages under varied social constraints. The code-switching data differ so radically from …
languages under varied social constraints. The code-switching data differ so radically from …