Detection of glottal closure instants from speech signals: A quantitative review
The pseudo-periodicity of voiced speech can be exploited in several speech processing
applications. This requires however that the precise locations of the glottal closure instants …
applications. This requires however that the precise locations of the glottal closure instants …
Empirical mode decomposition for adaptive AM-FM analysis of speech: A review
This work reviews the advancements in the non-conventional analysis of speech signals,
particularly from an AM-FM analysis point of view. The benefits of such an analysis, as …
particularly from an AM-FM analysis point of view. The benefits of such an analysis, as …
COVAREP—A collaborative voice analysis repository for speech technologies
Speech processing algorithms are often developed demonstrating improvements over the
state-of-the-art, but sometimes at the cost of high complexity. This makes algorithm …
state-of-the-art, but sometimes at the cost of high complexity. This makes algorithm …
Glottal source processing: From analysis to applications
The great majority of current voice technology applications rely on acoustic features, such as
the widely used MFCC or LP parameters, which characterize the vocal tract response …
the widely used MFCC or LP parameters, which characterize the vocal tract response …
Epoch extraction based on integrated linear prediction residual using plosion index
AP Prathosh, TV Ananthapadmanabha… - … on Audio, Speech …, 2013 - ieeexplore.ieee.org
Epoch is defined as the instant of significant excitation within a pitch period of voiced
speech. Epoch extraction continues to attract the interest of researchers because of its …
speech. Epoch extraction continues to attract the interest of researchers because of its …
A comparative study of glottal source estimation techniques
Source-tract decomposition (or glottal flow estimation) is one of the basic problems of
speech processing. For this, several techniques have been proposed in the literature …
speech processing. For this, several techniques have been proposed in the literature …
The deterministic plus stochastic model of the residual signal and its applications
The modeling of speech production often relies on a source-filter approach. Although
methods parameterizing the filter have nowadays reached a certain maturity, there is still a …
methods parameterizing the filter have nowadays reached a certain maturity, there is still a …
Wavelet maxima dispersion for breathy to tense voice discrimination
J Kane, C Gobl - IEEE Transactions on Audio, Speech, and …, 2013 - ieeexplore.ieee.org
This paper proposes a new parameter, the Maxima Dispersion Quotient (MDQ), for
differentiating breathy to tense voice. Maxima derived following wavelet decomposition are …
differentiating breathy to tense voice. Maxima derived following wavelet decomposition are …
A deterministic plus stochastic model of the residual signal for improved parametric speech synthesis
Speech generated by parametric synthesizers generally suffers from a typical buzziness,
similar to what was encountered in old LPC-like vocoders. In order to alleviate this problem …
similar to what was encountered in old LPC-like vocoders. In order to alleviate this problem …
Traditional machine learning for pitch detection
Pitch detection is a fundamental problem in speech processing as F0 is used in a large
number of applications. Recent papers have proposed deep learning for robust pitch …
number of applications. Recent papers have proposed deep learning for robust pitch …