A review of physical and perceptual feature extraction techniques for speech, music and environmental sounds

F Alías, JC Socoró, X Sevillano - Applied Sciences, 2016 - mdpi.com
Endowing machines with sensing capabilities similar to those of humans is a prevalent
quest in engineering and computer science. In the pursuit of making computers sense their …

A survey on automatic speech recognition systems for Portuguese language and its variations

TA de Lima, M Da Costa-Abreu - Computer Speech & Language, 2020 - Elsevier
Communication has been an essential part of being human and living in society. There are
several different languages and variations of them, so you can speak English in one place …

Multiresolution spectrotemporal analysis of complex sounds

T Chi, P Ru, SA Shamma - The Journal of the Acoustical Society of …, 2005 - pubs.aip.org
A computational model of auditory analysis is described that is inspired by psychoacoustical
and neurophysiological findings in early and central stages of the auditory system. The …

Modulation spectral features for speech emotion recognition using deep neural networks

P Singh, M Sahidullah, G Saha - Speech Communication, 2023 - Elsevier
This work explores the use of constant-Q transform based modulation spectral features (CQT-
MSF) for speech emotion recognition (SER). The human perception and analysis of sound …

Signal processing for music analysis

M Muller, DPW Ellis, A Klapuri… - IEEE Journal of selected …, 2011 - ieeexplore.ieee.org
Music signal processing may appear to be the junior relation of the large and mature field of
speech signal processing, not least because many techniques and representations …

Spectro-temporal modulation transfer functions and speech intelligibility

T Chi, Y Gao, MC Guyton, P Ru… - The Journal of the …, 1999 - pubs.aip.org
Detection thresholds for spectral and temporal modulations are measured using broadband
spectra with sinusoidally rippled profiles that drift up or down the log-frequency axis at …

Features for content-based audio retrieval

D Mitrović, M Zeppelzauer, C Breiteneder - Advances in computers, 2010 - Elsevier
Today, a large number of audio features exists in audio retrieval for different purposes, such
as automatic speech recognition, music information retrieval, audio segmentation, and …

Prediction of various soil properties for a national spatial dataset of Scottish soils based on four different chemometric approaches: A comparison of near infrared and …

RK Haghi, E Pérez-Fernández, AHJ Robertson - Geoderma, 2021 - Elsevier
Infrared spectroscopic techniques, in combination with chemometric approaches, have been
widely used to estimate different physical and chemical properties in soil samples. This …

Robust speech recognition using the modulation spectrogram

BED Kingsbury, N Morgan, S Greenberg - Speech communication, 1998 - Elsevier
The performance of present-day automatic speech recognition (ASR) systems is seriously
compromised by levels of acoustic interference (such as additive noise and room …

What HMMs can do

JA Bilmes - IEICE TRANSACTIONS on Information and Systems, 2006 - search.ieice.org
Since their inception almost fifty years ago, hidden Markov models (HMMs) have have
become the predominant methodology for automatic speech recognition (ASR) systems …