[KİTAP][B] Supervised sequence labelling

A Graves, A Graves - 2012 - Springer
This chapter provides the background material and literature review for supervised
sequence labelling. Section 2.1 briefly reviews supervised learning in general. Section 2.2 …

[PDF][PDF] Comparison of four approaches to automatic language identification of telephone speech

MA Zissman - IEEE Transactions on speech and audio …, 1996 - researchgate.net
We have compared the performance of four approaches for automatic language
identification of speech utterances: Gaussian mixture model (GMM) classification; single …

A probabilistic framework for segment-based speech recognition

JR Glass - Computer Speech & Language, 2003 - Elsevier
Most current speech recognizers use an observation space based on a temporal sequence
of measurements extracted from fixed-length “frames”(eg, Mel-cepstra). Given a hypothetical …

[PDF][PDF] Heterogeneous acoustic measurements and multiple classifiers for speech recognition

AK Halberstadt - 1999 - dspace.mit.edu
The acoustic-phonetic modeling component of most current speech recognition systems
calculates a small set of homogeneous frame-based measurements at a single, fixed time …

[PDF][PDF] Automatic pitch contour stylization using a model of tonal perception

C d'Alessandro, P Mertens - Computer Speech and Language, 1995 - researchgate.net
A new quantitative model of tonal perception for continuous speech is described. The paper
illustrates its ability for automatic stylization of pitch contors, with applicatios to prosodic …

Phone recognition on the TIMIT database

C Lopes, F Perdigao - Speech Technologies/Book, 2011 - books.google.com
In the information age, computer applications have become part of modern life and this has
in turn encouraged the expectations of friendly interaction with them. Speech, as “the” …

A probabilistic framework for feature-based speech recognition

J Glass, J Chang, M McCandless - Proceeding of Fourth …, 1996 - ieeexplore.ieee.org
Most current speech recognizers use an observation space which is based on a temporal
sequence of" frames"(eg Mel-cepstra). There is another class of recognizer which further …

Exemplar-based sparse representation features: From TIMIT to LVCSR

TN Sainath, B Ramabhadran, M Picheny… - … on Audio, Speech …, 2011 - ieeexplore.ieee.org
The use of exemplar-based methods, such as support vector machines (SVMs), k-nearest
neighbors (kNNs) and sparse representations (SRs), in speech recognition has thus far …

[KİTAP][B] Automatic time alignment of phonemes using acoustic-phonetic information

JP Hosom - 2000 - search.proquest.com
One requirement for researching and building spoken language systems is the availability of
speech data that have been labeled and time-aligned at the phonetic level. Although …

[PDF][PDF] Sparse connection and pruning in large dynamic artificial neural networks

N Ström - Fifth European Conference on Speech Communication …, 1997 - Citeseer
This paper presents new methods for training large neural networks for phoneme probability
estimation. A combination of the time-delay architecture and the recurrent network …