Histogram equalization of speech representation for robust speech recognition

A De La Torre, AM Peinado, JC Segura… - … on Speech and …, 2005 - ieeexplore.ieee.org
This paper describes a method of compensating for nonlinear distortions in speech
representation caused by noise. The method described here is based on the histogram …

Enhancement of log mel power spectra of speech using a phase-sensitive model of the acoustic environment and sequential estimation of the corrupting noise

L Deng, J Droppo, A Acero - IEEE Transactions on Speech and …, 2004 - ieeexplore.ieee.org
This paper presents a novel speech feature enhancement technique based on a
probabilistic, nonlinear acoustic environment model that effectively incorporates the phase …

[PDF][PDF] One-Pass Single-Channel Noisy Speech Recognition Using a Combination of Noisy and Enhanced Features.

M Fujimoto, H Kawai - Interspeech, 2019 - isca-archive.org
This paper introduces a method of noise-robust automatic speech recognition (ASR) that
remains effective under one-pass single-channel processing. Under these constraints, the …

DNN-based feature enhancement using DOA-constrained ICA for robust speech recognition

HY Lee, JW Cho, M Kim… - IEEE Signal Processing …, 2016 - ieeexplore.ieee.org
The performance of automatic speech recognition (ASR) system is often degraded in
adverse real-world environments. In recent times, deep learning has successfully emerged …

Speech recognition using HMM with MFCC-An analysis using frequency specral decomposion technique

I Patel - Signal & Image Processing: An International Journal …, 2010 - papers.ssrn.com
This paper presents an approach to the recognition of speech signal using frequency
spectral information with Mel frequency for the improvement of speech feature …

Boltzmann–Dirichlet process mixture: a mathematical model for speech recognition

TR Kumar, DV Babu, P Malarvezhi… - Journal of Physics …, 2021 - iopscience.iop.org
This article deliberates a mathematical model for the estimation of speech signals probability
density function. Speech recognition is analyzed using an integration of Boltzmann …

Speech enhancement using hidden Markov models in Mel-frequency domain

H Veisi, H Sameti - Speech Communication, 2013 - Elsevier
Hidden Markov model (HMM)-based minimum mean square error speech enhancement
method in Mel-frequency domain is focused on and a parallel cepstral and spectral (PCS) …

Estimating cepstrum of speech under the presence of noise using a joint prior of static and dynamic features

L Deng, J Droppo, A Acero - IEEE Transactions on Speech and …, 2004 - ieeexplore.ieee.org
In this paper, we present a new algorithm for statistical speech feature enhancement in the
cepstral domain. The algorithm exploits joint prior distributions (in the form of Gaussian …

Independent vector analysis followed by HMM-based feature enhancement for robust speech recognition

JW Cho, HM Park - Signal Processing, 2016 - Elsevier
This paper presents a feature-enhancement method that uses the outputs of independent
vector analysis (IVA) for robust speech recognition. Although frequency-domain (FD) …

Noise robust voice activity detection based on switching Kalman filter

M Fujimoto, K Ishizuka - IEICE transactions on information and …, 2008 - search.ieice.org
This paper addresses the problem of voice activity detection (VAD) in noisy environments.
The VAD method proposed in this paper is based on a statistical model approach, and …