Parp: Prune, adjust and re-prune for self-supervised speech recognition

CIJ Lai, Y Zhang, AH Liu, S Chang… - Advances in …, 2021 - proceedings.neurips.cc
Self-supervised speech representation learning (speech SSL) has demonstrated the benefit
of scale in learning rich representations for Automatic Speech Recognition (ASR) with …

[PDF][PDF] Language Identification of Short Text Segments with N-gram Models.

T Vatanen, JJ Väyrynen, S Virpioja - LREC, 2010 - lrec-conf.org
There are many accurate methods for language identification of long text samples, but
identification of very short strings still presents a challenge. This paper studies a language …

Democratizing neural machine translation with OPUS-MT

J Tiedemann, M Aulamo, D Bakshandaeva… - Language Resources …, 2024 - Springer
This paper presents the OPUS ecosystem with a focus on the development of open machine
translation models and tools, and their integration into end-user applications, development …

Importance of high-order n-gram models in morph-based speech recognition

T Hirsimaki, J Pylkkonen… - IEEE Transactions on …, 2009 - ieeexplore.ieee.org
Speech recognition systems trained for morphologically rich languages face the problem of
vocabulary growth caused by prefixes, suffixes, inflections, and compound words. Solutions …

[HTML][HTML] Advances in subword-based HMM-DNN speech recognition across languages

P Smit, S Virpioja, M Kurimo - Computer Speech & Language, 2021 - Elsevier
We describe a novel way to implement subword language models in speech recognition
systems based on weighted finite state transducers, hidden Markov models, and deep …

Improved subword modeling for WFST-based speech recognition

P Smit, S Virpioja, M Kurimo - Interspeech, 2017 - research.aalto.fi
Because in agglutinative languages the number of observed word forms is very high,
subword units are often utilized in speech recognition. However, the proper use of subword …

Principled comparisons for end-to-end speech recognition: Attention vs hybrid at the 1000-hour scale

A Rouhe, T Grósz, M Kurimo - IEEE/ACM Transactions on …, 2023 - ieeexplore.ieee.org
End-to-End speech recognition has become the center of attention for speech recognition
research, but Hybrid Hidden Markov Model Deep Neural Network (HMM/DNN)-systems …

[PDF][PDF] Morphology-aware statistical machine translation based on morphs induced in an unsupervised manner

S Virpioja, J Väyrynen, MJP Creutz… - Machine Translation …, 2007 - researchportal.helsinki.fi
In this paper, we apply a method of unsupervised morphology learning to a state-of-the-art
phrase-based statistical machine translation (SMT) system. In SMT, words are traditionally …

Computational intelligence in processing of speech acoustics: a survey

A Singh, N Kaur, V Kukreja, V Kadyan… - Complex & Intelligent …, 2022 - Springer
Speech recognition of a language is a key area in the field of pattern recognition. This paper
presents a comprehensive survey on the speech recognition techniques for non-Indian and …

Converting neural network language models into back-off language models for efficient decoding in automatic speech recognition

E Arısoy, SF Chen, B Ramabhadran… - IEEE/ACM Transactions …, 2013 - ieeexplore.ieee.org
Neural network language models (NNLMs) have achieved very good performance in large-
vocabulary continuous speech recognition (LVCSR) systems. Because decoding with …