Unsupervised training of acoustic models for large vocabulary continuous speech recognition

F Wessel, H Ney - IEEE Transactions on Speech and Audio …, 2004 - ieeexplore.ieee.org
For large vocabulary continuous speech recognition systems, the amount of acoustic
training data is of crucial importance. In the past, large amounts of speech were thus …

[PDF][PDF] Semi-supervised gmm and dnn acoustic model training with multi-system combination and confidence re-calibration.

Y Huang, D Yu, Y Gong, C Liu - Interspeech, 2013 - microsoft.com
We present our study on semi-supervised Gaussian mixture model (GMM) hidden Markov
model (HMM) and deep neural network (DNN) HMM acoustic model training. We analyze …

Quantifying the value of pronunciation lexicons for keyword search in lowresource languages

G Chen, S Khudanpur, D Povey, J Trmal… - … , Speech and Signal …, 2013 - ieeexplore.ieee.org
This paper quantifies the value of pronunciation lexicons in large vocabulary continuous
speech recognition (LVCSR) systems that support keyword search (KWS) in low resource …

Unsupervised acoustic model training

L Lamel, JL Gauvain, G Adda - 2002 IEEE International …, 2002 - ieeexplore.ieee.org
This paper describes some recent experiments using unsupervised techniques for acoustic
model training in order to reduce the system development cost. The approach uses a …

Unsupervised training on large amounts of broadcast news data

J Ma, S Matsoukas, O Kimball… - 2006 IEEE International …, 2006 - ieeexplore.ieee.org
This paper presents our recent effort that aims at improving our Arabic broadcast news (BN)
recognition system by using thousands of hours of un-transcribed Arabic audio in the way of …

Light supervision in acoustic model training

L Nguyen, B **ang - 2004 IEEE International Conference on …, 2004 - ieeexplore.ieee.org
We present a new light supervision method to derive additional acoustic training data
automatically for broadcast news transcription systems. A subset of the TDT corpus, which …

Indexing business processes based on annotated finite state automata

B Mahleko, A Wombacher - 2006 IEEE International …, 2006 - ieeexplore.ieee.org
The existing service discovery infrastructure with UDDI as the de facto standard, is limited in
that it does not support more complex searching based on matching business processes …

Graph-based semisupervised learning for acoustic modeling in automatic speech recognition

Y Liu, K Kirchhoff - IEEE/ACM Transactions on Audio, Speech …, 2016 - ieeexplore.ieee.org
In this paper, we investigate how to apply graph-based semisupervised learning to acoustic
modeling in speech recognition. Graph-based semisupervised learning is a widely used …

Transcription system using automatic speech recognition for the Japanese Parliament (Diet)

T Kawahara - Proceedings of the AAAI Conference on Artificial …, 2012 - ojs.aaai.org
This article describes a new automatic transcription system in the Japanese Parliament
which deploys our automatic speech recognition (ASR) technology. To achieve high …

An end-to-end model from speech to clean transcript for parliamentary meetings

M Mimura, S Sakai, T Kawahara - 2021 Asia-Pacific Signal and …, 2021 - ieeexplore.ieee.org
This paper presents an end-to-end approach for generating readable and clean text directly
from speech signal. While conventional automatic speech recognition (ASR) systems are …