Epoch extraction from telephone quality speech using single pole filter
CM Vikram, SRM Prasanna - IEEE/ACM Transactions on Audio …, 2017 - ieeexplore.ieee.org
Epoch extraction from speech involves the suppression of vocal tract resonances, either by
linear prediction based inverse filtering or filtering at very low frequency. Degradations due …
linear prediction based inverse filtering or filtering at very low frequency. Degradations due …
Epoch-synchronous overlap-add (ESOLA) for time-and pitch-scale modification of speech signals
GMAT: Glottal closure instants detection based on the multiresolution absolute Teager–Kaiser energy operator
Abstract Glottal Closure Instants (GCIs) detection is important to many speech applications.
However, most existing algorithms cannot achieve computational efficiency and accuracy …
However, most existing algorithms cannot achieve computational efficiency and accuracy …
Improving the flexibility of dynamic prosody modification using instants of significant excitation
Modification of suprasegmental features such as pitch and duration of original speech by
fixed scaling factors is referred to as static prosody modification. In dynamic prosody …
fixed scaling factors is referred to as static prosody modification. In dynamic prosody …
Improved epoch based prosody modification by zero frequency filtering of gabor filtered telephonic speech
MR Rajeswari, D Govind… - 2023 National …, 2023 - ieeexplore.ieee.org
Prosodic features are supra-segmental features which span over longer segments in speech
signals. Modification of prosodic parameters is essential for some of the applications such as …
signals. Modification of prosodic parameters is essential for some of the applications such as …
Improved method for epoch estimation in telephonic speech signals using zero frequency filtering
Epochs are the locations correspond to glottal closure instants for voiced speech segments
and onset of bursts or frication in unvoiced segments. In the recent years, the zero frequency …
and onset of bursts or frication in unvoiced segments. In the recent years, the zero frequency …
Importance of non-uniform prosody modification for speech recognition in emotion conditions
A mismatch in training and operating environments causes a performance degradation in
speech recognition systems (ASR). One major reason for this mismatch is due to the …
speech recognition systems (ASR). One major reason for this mismatch is due to the …