Acoustic–phonetic analysis for speech recognition: A review
This paper reviews the literature related to the acoustic–phonetic analysis of speech and the
speech recognition approaches that use these types of knowledge. At first, acoustic …
speech recognition approaches that use these types of knowledge. At first, acoustic …
Speech/music classification using speech-specific features
This paper proposes the use of speech-specific features for speech/music classification.
Features representing the excitation source, vocal tract system and syllabic rate of speech …
Features representing the excitation source, vocal tract system and syllabic rate of speech …
[PDF][PDF] Estimation of Hypernasality Scores from Cleft Lip and Palate Speech.
Hypernasality refers to the perception of excessive nasal resonances in vowels and voiced
consonants. Existing speech processing based approaches concentrate only on the …
consonants. Existing speech processing based approaches concentrate only on the …
Importance of supra-segmental information and self-supervised framework for spoken language diarization task
Spoken language diarization (LD) is a task of automatically extracting the monolingual
segments present in a given code-switched utterance. Generally in the bilingual code …
segments present in a given code-switched utterance. Generally in the bilingual code …
Glottal activity detection from the speech signal using multifractal analysis
This work proposes a novel method for the detection of glottal activity regions from the
speech signal. Glottal activity detection refers to the problem of discriminating voiced and …
speech signal. Glottal activity detection refers to the problem of discriminating voiced and …
Effect of Modeling Glottal Activity Parameters on Zero-Shot Children's ASR
S Shahnawazuddin - IEEE/ACM Transactions on Audio …, 2024 - ieeexplore.ieee.org
The primary objective of this study is to enhance the recognition performance of zero-shot
children's automatic speech recognition (ASR) task. In such a setup, statistical models are …
children's automatic speech recognition (ASR) task. In such a setup, statistical models are …
Analyzing the vocal tract characteristics for out-of-breath speech
In this work, vocal tract characteristic changes under the out-of-breath condition are
explored. Speaking under the influence of physical exercise is called out-of-breath speech …
explored. Speaking under the influence of physical exercise is called out-of-breath speech …
[PDF][PDF] Zero Frequency Filter Based Analysis of Voice Disorders.
Pitch period and amplitude perturbations are widely used parameters to discriminate normal
and voice disorder speech. Instantaneous pitch period and amplitude of glottal vibrations …
and voice disorder speech. Instantaneous pitch period and amplitude of glottal vibrations …
Effect of Modeling Glottal Activity Parameters on Zero-Shot Children's ASR
The primary objective of this study is to enhance the recognition performance of zero-shot
children's automatic speech recognition (ASR) task. In such a setup, statistical models are …
children's automatic speech recognition (ASR) task. In such a setup, statistical models are …
Improved voicing decision using glottal activity features for statistical parametric speech synthesis
A method to improve voicing decision using glottal activity features proposed for statistical
parametric speech synthesis. In existing methods, voicing decision relies mostly on …
parametric speech synthesis. In existing methods, voicing decision relies mostly on …