A speech envelope landmark for syllable encoding in human superior temporal gyrus
The most salient acoustic features in speech are the modulations in its intensity, captured by
the amplitude envelope. Perceptually, the envelope is necessary for speech …
the amplitude envelope. Perceptually, the envelope is necessary for speech …
Interdependence of “What” and “When” in the Brain
From a brain's-eye-view, when a stimulus occurs and what it is are interrelated aspects of
interpreting the perceptual world. Yet in practice, the putative perceptual inferences about …
interpreting the perceptual world. Yet in practice, the putative perceptual inferences about …
Use of temporal information: Detection of periodicity, aperiodicity, and pitch in speech
In this paper, we present a time domain aperiodicity, periodicity, and pitch (APP) detector
that estimates 1) the proportion of periodic and aperiodic energy in a speech signal and 2) …
that estimates 1) the proportion of periodic and aperiodic energy in a speech signal and 2) …
Acoustic–phonetic analysis for speech recognition: A review
This paper reviews the literature related to the acoustic–phonetic analysis of speech and the
speech recognition approaches that use these types of knowledge. At first, acoustic …
speech recognition approaches that use these types of knowledge. At first, acoustic …
Acoustic parameters for automatic detection of nasal manner
Of all the sounds in any language, nasals are the only class of sounds with dominant speech
output from the nasal cavity as opposed to the oral cavity. This gives nasals some special …
output from the nasal cavity as opposed to the oral cavity. This gives nasals some special …
Detection of the closure-burst transitions of stops and affricates in continuous speech using the plosion index
TV Ananthapadmanabha, AP Prathosh… - The Journal of the …, 2014 - pubs.aip.org
Automatic and accurate detection of the closure-burst transition events of stops and
affricates serves many applications in speech processing. A temporal measure named the …
affricates serves many applications in speech processing. A temporal measure named the …
Investigation of different time–frequency representations for detection of fricatives
Fricatives are an important class of speech sounds which exhibit noisy characteristics with
dominant high-frequency components. Accurate detection of fricative segments in …
dominant high-frequency components. Accurate detection of fricative segments in …
A probabilistic framework for landmark detection based on phonetic features for automatic speech recognition
A Juneja, C Espy-Wilson - The Journal of the Acoustical Society of …, 2008 - pubs.aip.org
A probabilistic framework for a landmark-based approach to speech recognition is
presented for obtaining multiple landmark sequences in continuous speech. The landmark …
presented for obtaining multiple landmark sequences in continuous speech. The landmark …
[KSIĄŻKA][B] Speech recognition based on phonetic features and acoustic landmarks
A Juneja - 2004 - search.proquest.com
A probabilistic and statistical framework is presented for automatic speech recognition
based on a phonetic feature representation of speech sounds. In this acoustic-phonetic …
based on a phonetic feature representation of speech sounds. In this acoustic-phonetic …
Psst! prosodic speech segmentation with transformers
Self-attention mechanisms have enabled transformers to achieve superhuman-level
performance on many speech-to-text (STT) tasks, yet the challenge of automatic prosodic …
performance on many speech-to-text (STT) tasks, yet the challenge of automatic prosodic …