Epoch extraction from telephone quality speech using single pole filter

CM Vikram, SRM Prasanna - IEEE/ACM Transactions on Audio …, 2017 - ieeexplore.ieee.org
Epoch extraction from speech involves the suppression of vocal tract resonances, either by
linear prediction based inverse filtering or filtering at very low frequency. Degradations due …

Epoch-synchronous overlap-add (ESOLA) for time-and pitch-scale modification of speech signals

S Rudresh, A Vasisht, K Vijayan… - ar** Distortions
MR Rajeswari, D Govind, SV Gangashetty… - IEEE …, 2024 - ieeexplore.ieee.org
Clip** is one of the non-linear distortions commonly introduced due to microphone
saturation during speech recording. Present work focuses on the effect of clip** in the task …

GMAT: Glottal closure instants detection based on the multiresolution absolute Teager–Kaiser energy operator

K Wu, D Zhang, G Lu - Digital Signal Processing, 2017 - Elsevier
Abstract Glottal Closure Instants (GCIs) detection is important to many speech applications.
However, most existing algorithms cannot achieve computational efficiency and accuracy …

Improving the flexibility of dynamic prosody modification using instants of significant excitation

D Govind, TT Joy - Circuits, Systems, and Signal Processing, 2016 - Springer
Modification of suprasegmental features such as pitch and duration of original speech by
fixed scaling factors is referred to as static prosody modification. In dynamic prosody …

Improved epoch based prosody modification by zero frequency filtering of gabor filtered telephonic speech

MR Rajeswari, D Govind… - 2023 National …, 2023 - ieeexplore.ieee.org
Prosodic features are supra-segmental features which span over longer segments in speech
signals. Modification of prosodic parameters is essential for some of the applications such as …

Improved method for epoch estimation in telephonic speech signals using zero frequency filtering

D Govind, R Vishnu, D Pravena - 2015 IEEE International …, 2015 - ieeexplore.ieee.org
Epochs are the locations correspond to glottal closure instants for voiced speech segments
and onset of bursts or frication in unvoiced segments. In the recent years, the zero frequency …

Importance of non-uniform prosody modification for speech recognition in emotion conditions

VVV Raju, HK Vydana, SV Gangashetty… - 2017 Asia-Pacific …, 2017 - ieeexplore.ieee.org
A mismatch in training and operating environments causes a performance degradation in
speech recognition systems (ASR). One major reason for this mismatch is due to the …