Power-normalized cepstral coefficients (PNCC) for robust speech recognition

C Kim, RM Stern - IEEE/ACM Transactions on audio, speech …, 2016 - ieeexplore.ieee.org
This paper presents a new feature extraction algorithm called power normalized Cepstral
coefficients (PNCC) that is motivated by auditory processing. Major new features of PNCC …

Multiresolution spectrotemporal analysis of complex sounds

T Chi, P Ru, SA Shamma - The Journal of the Acoustical Society of …, 2005 - pubs.aip.org
A computational model of auditory analysis is described that is inspired by psychoacoustical
and neurophysiological findings in early and central stages of the auditory system. The …

Cochlear implants: some likely next steps

BS Wilson, DT Lawson, JM Müller… - Annual Review of …, 2003 - annualreviews.org
▪ Abstract The history of cochlear implants is marked by large improvements in performance,
especially over the past two decades and especially due to the development of ever-better …

Modeling auditory-nerve responses for high sound pressure levels in the normal and impaired auditory periphery

MSA Zilany, IC Bruce - The Journal of the Acoustical Society of …, 2006 - pubs.aip.org
This paper presents a computational model to simulate normal and impaired auditory-nerve
(AN) fiber responses in cats. The model responses match physiological data over a wider …

A computational model of human auditory signal processing and perception

ML Jepsen, SD Ewert, T Dau - The Journal of the Acoustical Society of …, 2008 - pubs.aip.org
A model of computational auditory signal-processing and perception that accounts for
various aspects of simultaneous and nonsimultaneous masking in human listeners is …

[PDF][PDF] Neural Network Bottleneck Features for Language Identification.

P Matejka, Le Zhang 0002, T Ng, O Glembek, JZ Ma… - Odyssey, 2014 - isca-archive.org
This paper presents the application of Neural Network Bottleneck (BN) features in Language
Identification (LID). BN features are generally used for Large Vocabulary Speech …

Two new directions in speech processor design for cochlear implants

BS Wilson, R Schatzer, EA Lopez-Poveda, X Sun… - Ear and …, 2005 - journals.lww.com
Two new approaches to the design of speech processors for cochlear implants are
described. The first aims to represent “fine structure” or “fine frequency” information in a way …

SNR estimation based on amplitude modulation analysis with applications to noise suppression

J Tchorz, B Kollmeier - IEEE Transactions on Speech and …, 2003 - ieeexplore.ieee.org
A single-microphone noise suppression algorithm is described that is based on a novel
approach for the estimation of the signal-to-noise ratio (SNR) in different frequency …

Representation of the vowel/ε/in normal and impaired auditory nerve fibers: model predictions of responses in cats

MSA Zilany, IC Bruce - The Journal of the Acoustical Society of …, 2007 - pubs.aip.org
The temporal response of auditory-nerve (AN) fibers to a steady-state vowel is investigated
using a computational auditory-periphery model. The model predictions are validated …

Perception of speech and sound

B Kollmeier, T Brand, B Meyer - Springer handbook of speech processing, 2008 - Springer
The transformation of acoustical signals into auditory sensations can be characterized by
psychophysical quantities such as loudness, tonality, or perceived pitch. The resolution …