[HTML][HTML] Correcting diacritics and typos with a ByT5 transformer model

L Stankevičius, M Lukoševičius, J Kapočiūtė-Dzikienė… - Applied Sciences, 2022 - mdpi.com
Due to the fast pace of life and online communications and the prevalence of English and
the QWERTY keyboard, people tend to forgo using diacritics, make typographical errors …

Attentive Sequence-to-Sequence Learning for Diacritic Restoration of Yor\ub\'a Language Text

I Orife - arxiv preprint arxiv:1804.00832, 2018 - arxiv.org
Yor\ub\'a is a widely spoken West African language with a writing system rich in tonal and
orthographic diacritics. With very few exceptions, diacritics are omitted from electronic texts …

Arabic diacritic recovery using a feature-rich bilstm model

K Darwish, A Abdelali, H Mubarak… - Transactions on Asian and …, 2021 - dl.acm.org
Diacritics (short vowels) are typically omitted when writing Arabic text, and readers have to
reintroduce them to correctly pronounce words. There are two types of Arabic diacritics: The …

Diacritics generation and application in hate speech detection on Vietnamese social networks

P Le-Hong - Knowledge-Based Systems, 2021 - Elsevier
One of the challenging problems in text processing is diacritics generation where one needs
to generate diacritic marks for non-accented text. With an ever increasing amount of informal …

Vietnamese diacritics restoration using deep learning approach

BT Hung - 2018 10th International Conference on Knowledge …, 2018 - ieeexplore.ieee.org
This paper presents a solution for the insertion of diacritics into a text where they are
missing, especially for Vietnamese text. Missing diacritics on text is making it difficult for the …

On the use of machine translation-based approaches for vietnamese diacritic restoration

TH Pham, XK Pham, P Le-Hong - … International Conference on …, 2017 - ieeexplore.ieee.org
This paper presents an empirical study of two machine translation-based approaches for
Vietnamese diacritic restoration problem, including phrase-based and neural-based …

Deep learning based Vietnamese diacritics restoration

CH Nga, NK Thinh, PC Chang… - 2019 IEEE international …, 2019 - ieeexplore.ieee.org
Diacritics are very important in diacritical languages, because the meaning of sentences can
be changed in accordance to diacritics. Writing without diacritics makes the sentences …

[PDF][PDF] Integrating diacritics restoration and question classification into vietnamese question answering system

BT Hung - Adv. Sci. Technol. Eng. Syst. J, 2019 - researchgate.net
This paper presents a solution for question answering system for Vietnamese language by
integrating diacritics restoration and question classification via deep learning approach. It …

Machine translation approach for vietnamese diacritic restoration

TND Do, DB Nguyen, DK Mac… - … conference on asian …, 2013 - ieeexplore.ieee.org
The diacritic marks exist in many languages such as French, German, Slovak, Vietnamese,
etc. However for some reasons, sometime they are omitted in writing. This phenomenon may …

[KSIĄŻKA][B] Full and partial diacritic restoration: Development and impact on downstream applications

S Alqahtani - 2020 - search.proquest.com
Languages that include diacritics in speech but omit diacritics in writing to a certain degree
result in written texts that are even more ambiguous than typically expected. Not including …