Air traffic control communication (ATCC) speech corpora and their use for ASR and TTS development

L Šmídl, J Švec, D Tihelka, J Matoušek… - Language Resources …, 2019 - Springer
The paper introduces the motivation for creating dedicated speech corpora of air traffic
control communication, describes in detail the process of preparation of corpora for both …

[PDF][PDF] Classification-based detection of glottal closure instants from speech signals

J Matoušek, D Tihelka - INTERSPEECH, Stockholm, Sweden, 2017 - isca-archive.org
In this paper a classification-based method for the automatic detection of glottal closure
instants (GCIs) from the speech signal is proposed. Peaks in the speech waveforms are …

Using extreme gradient boosting to detect glottal closure instants in speech signal

J Matoušek, D Tihelka - ICASSP 2019-2019 IEEE International …, 2019 - ieeexplore.ieee.org
In this paper, we continue to investigate the use of classifiers for the automatic detection of
glottal closure instants (GCIs) from the speech signal. We focus on extreme gradient …

Detection of glottal closure instant and glottal open region from speech signals using spectral flatness measure

SR Kadiri, RS Prasad, B Yegnanarayana - Speech Communication, 2020 - Elsevier
This paper proposes an approach using spectral flatness measure to detect the glottal
closure instant (GCI) and the glottal open region (GOR) within each glottal cycle in voiced …

A comparison of convolutional neural networks for glottal closure instant detection from raw speech

J Matoušek, D Tihelka - ICASSP 2021-2021 IEEE International …, 2021 - ieeexplore.ieee.org
In this paper, we continue to investigate the use of machine learning for the automatic
detection of glottal closure instants (GCIs) from raw speech. We compare several deep one …

[PDF][PDF] Glottal Closure Instant Detection from Speech Signal Using Voting Classifier and Recursive Feature Elimination.

J Matousek, D Tihelka - Interspeech, 2018 - researchgate.net
In our previous work, we introduced a classification-based method for the automatic
detection of glottal closure instants (GCIs) from the speech signal and we showed it was …

System and method for speech recognition using pitch-synchronous spectral parameters

CJ Chen - US Patent 8,942,977, 2015 - Google Patents
The present invention defines a pitch-synchronous parametrical representation of speech
signals as the basis of speech recognition, and discloses methods of generating the said …

Modelling F0 Dynamics in Unit Selection Based Speech Synthesis

D Tihelka, J Matoušek, Z Hanzlíček - International Conference on Text …, 2014 - Springer
In the common unit selection implementations, F 0 continuity is measured as one of
concatenation cost features with the expectation that smooth units transition (regarding …

Post-Stress Rise Trends Consideration in Unit Selection TTS

M Jůzová, J Volín - International Conference on Text, Speech, and …, 2018 - Springer
In spoken Czech language, the stress and post-stress syllables in human speech are
usually characterized by an increase in fundamental frequency F _0 (except for phrase-final …

Pitch contours as predictors of audible concatenation artifacts

M Legát, J Matoušek - 2011 - otik.uk.zcu.cz
This paper deals with the traditional problem of the occurrence of audible discontinuities at
concatenation points at diphone boundaries in the concatenative speech synthesis. While …