Automatic segmentation and labeling of speech based on Hidden Markov Models

F Brugnara, D Falavigna, M Omologo - Speech Communication, 1993 - Elsevier
An accurate database documentation at phonetic level is very important for speech
research: however, manual segmentation and labeling is a time consuming and error prone …

Automatic phonetic segmentation

DT Toledano, LAH Gómez… - IEEE transactions on …, 2003 - ieeexplore.ieee.org
This paper presents the results and conclusions of a thorough study on automatic phonetic
segmentation. It starts with a review of the state of the art in this field. Then, it analyzes the …

Assessing the accuracy of existing forced alignment software on varieties of British English

L MacKenzie, D Turton - Linguistics Vanguard, 2020 - degruyter.com
This paper presents an analysis of the performance and usability of automatic speech
processing tools on six different varieties of English spoken in the British Isles. The tools …

Comparing the performance of forced aligners used in sociophonetic research

S Gonzalez, J Grama, CE Travis - Linguistics Vanguard, 2020 - degruyter.com
Forced aligners have revolutionized sociophonetics, but while there are several forced
aligners available, there are few systematic comparisons of their performance. Here, we …

An optimal feature parameter set based on gated recurrent unit recurrent neural networks for speech segment detection

Ö BATUR DİNLER, N Aydin - Applied Sciences, 2020 - mdpi.com
Speech segment detection based on gated recurrent unit (GRU) recurrent neural networks
for the Kurdish language was investigated in the present study. The novelties of the current …

Forced alignment for Nordic languages: Rapidly constructing a high-quality prototype

NJ Young, M McGarrah - Nordic Journal of Linguistics, 2023 - cambridge.org
We propose a rapid adaptation of FAVE-Align to the Nordic languages, and we offer our own
adaptation to Swedish as a template. This study is motivated by the fact that researchers of …

Speaker-independent phoneme alignment using transition-dependent states

JP Hosom - Speech communication, 2009 - Elsevier
Determining the location of phonemes is important to a number of speech applications,
including training of automatic speech recognition systems, building text-to-speech systems …

HMM-based speech segmentation: Improvements of fully automatic approaches

S Brognaux, T Drugman - IEEE/ACM Transactions on Audio …, 2015 - ieeexplore.ieee.org
Speech segmentation refers to the problem of determining the phoneme boundaries from an
acoustic recording of an utterance together with its orthographic transcription. This paper …

[HTML][HTML] Analysis of forced aligner performance on L2 English speech

S Williams, P Foulkes, V Hughes - Speech Communication, 2024 - Elsevier
There is growing interest in how speech technologies perform on L2 speech. Largely
omitted from this discussion are tools used in the early data processing steps, such as forced …

Automatic time alignment of phonemes using acoustic-phonetic information

JP Hosom - 2000 - search.proquest.com
One requirement for researching and building spoken language systems is the availability of
speech data that have been labeled and time-aligned at the phonetic level. Although …