Automatic speech recognition for under-resourced languages: A survey

L Besacier, E Barnard, A Karpov, T Schultz - Speech communication, 2014 - Elsevier
Speech processing for under-resourced languages is an active field of research, which has
experienced significant progress during the past decade. We propose, in this paper, a …

Arabic speech recognition using end‐to‐end deep learning

HA Alsayadi, AA Abdelhamid, I Hegazy… - IET Signal …, 2021 - Wiley Online Library
Arabic automatic speech recognition (ASR) methods with diacritics have the ability to be
integrated with other systems better than Arabic ASR methods without diacritics. In this work …

[PDF][PDF] Inducing the morphological lexicon of a natural language from unannotated text

MJP Creutz, KH Lagus - International and Interdisciplinary …, 2005 - researchportal.helsinki.fi
This work presents an algorithm for the unsupervised learning, or induction, of a simple
morphology of a natural language. A probabilistic maximum a posteriori model is utilized …

Unlimited vocabulary speech recognition with morph language models applied to Finnish

T Hirsimäki, M Creutz, V Siivola, M Kurimo… - Computer Speech & …, 2006 - Elsevier
In the speech recognition of highly inflecting or compounding languages, the traditional
word-based language modeling is problematic. As the number of distinct word forms can …

[PDF][PDF] Factored neural language models

A Alexandrescu, K Kirchhoff - Proceedings of the Human …, 2006 - aclanthology.org
We present a new type of neural probabilistic language model that learns a map** from
both words and explicit word features into a continuous space that is then used for word …

Cross-domain and cross-language portability of acoustic features estimated by multilayer perceptrons

A Stolcke, F Grezl, MY Hwang, X Lei… - … on Acoustics Speech …, 2006 - ieeexplore.ieee.org
Recent results with phone-posterior acoustic features estimated by multilayer perceptrons
(MLPs) have shown that such features can effectively improve the accuracy of state-of-the …

[PDF][PDF] Hindi POS tagger using naive stemming: harnessing morphological information without extensive linguistic knowledge

M Shrivastava, P Bhattacharyya - International Conference on …, 2008 - researchgate.net
Part of Speech tagging for Indian Languages in general and Hindi in particular is not a very
widely explored territory. There have been many attempts at develo** a good POS tagger …

Multilingual native language identification

S Malmasi, M Dras - Natural Language Engineering, 2017 - cambridge.org
We present the first comprehensive study of Native Language Identification (NLI) applied to
text written in languages other than English, using data from six languages. NLI is the task of …

Alternative structures for character-level RNNs

P Bojanowski, A Joulin, T Mikolov - arxiv preprint arxiv:1511.06303, 2015 - arxiv.org
Recurrent neural networks are convenient and efficient models for language modeling.
However, when applied on the level of characters instead of words, they suffer from several …

RENAR: A rule-based Arabic named entity recognition system

W Zaghouani - ACM Transactions on Asian Language Information …, 2012 - dl.acm.org
Named entity recognition has served many natural language processing tasks such as
information retrieval, machine translation, and question answering systems. Many …