Speech recognition using domain knowledge

F Peng, B Shahshahani, HS Roy - US Patent 9,646,606, 2017 - Google Patents
GIOL I5/08(2006.01) In some implementations, data that indicates multiple can GIOL
15/18(2013.01) didate transcriptions for an utterance is received. For each of GIOL …

Improving deep learning based automatic speech recognition for Gujarati

D Raval, V Pathak, M Patel, B Bhatt - Transactions on Asian and Low …, 2021 - dl.acm.org
We present a novel approach for improving the performance of an End-to-End speech
recognition system for the Gujarati language. We follow a deep learning-based approach …

Large margin neural language model

J Huang, Y Li, W **, L Huang - arxiv preprint arxiv:1808.08987, 2018 - arxiv.org
We propose a large margin criterion for training neural language models. Conventionally,
neural language models are trained by minimizing perplexity (PPL) on grammatical …

Language modeling for code-switching: Evaluation, integration of monolingual data, and discriminative training

H Gonen, Y Goldberg - arxiv preprint arxiv:1810.11895, 2018 - arxiv.org
We focus on the problem of language modeling for code-switched language, in the context
of automatic speech recognition (ASR). Language modeling for code-switched language is …

Improving spoken language understanding by exploiting asr n-best hypotheses

M Li, W Ruan, X Liu, L Soldaini, W Hamza… - arxiv preprint arxiv …, 2020 - arxiv.org
In a modern spoken language understanding (SLU) system, the natural language
understanding (NLU) module takes interpretations of a speech from the automatic speech …

[PDF][PDF] Discriminative methods for noise robust speech recognition: A CHiME challenge benchmark

Y Tachioka, S Watanabe, J Le Roux… - The 2nd International …, 2013 - shadow.merl.com
The recently introduced second CHiME challenge is a difficult two-microphone speech
recognition task with non-stationary interference. Current approaches in the source …

Discriminative method for recurrent neural network language models

Y Tachioka, S Watanabe - 2015 IEEE International Conference …, 2015 - ieeexplore.ieee.org
A recurrent neural network language model (RNN-LM) can use a long word context more
than can an n-gram language model, and its effective has recently been shown in its …

Search results based n-best hypothesis rescoring with maximum entropy classification

F Peng, S Roy, B Shahshahani… - 2013 IEEE Workshop …, 2013 - ieeexplore.ieee.org
We propose a simple yet effective method for improving speech recognition by reranking the
N-best speech recognition hypotheses using search results. We model N-best reranking as …

Whole sentence neural language models

Y Huang, A Sethy, K Audhkhasi… - … on Acoustics, Speech …, 2018 - ieeexplore.ieee.org
Recurrent neural networks have become increasingly popular for the task of language
modeling achieving impressive gains in state-of-the-art speech recognition and natural …

[PDF][PDF] Turkish Resources for Visual Word Recognition.

B Erten, C Bozsahin, D Zeyrek - LREC, 2014 - academia.edu
We report two tools to conduct psycholinguistic experiments on Turkish words. KelimetriK
allows experimenters to choose words based on desired orthographic scores of word …