An overview of text-independent speaker recognition: From features to supervectors

T Kinnunen, H Li - Speech communication, 2010 - Elsevier
This paper gives an overview of automatic speaker recognition technology, with an
emphasis on text-independent recognition. Speaker recognition has been studied actively …

Ssd-6d: Making rgb-based 3d detection and 6d pose estimation great again

W Kehl, F Manhardt, F Tombari… - Proceedings of the …, 2017 - openaccess.thecvf.com
We present a novel method for detecting 3D model instances and estimating their 6D poses
from RGB data in a single shot. To this end, we extend the popular SSD paradigm to cover …

Healthcare audio event classification using hidden Markov models and hierarchical hidden Markov models

YT Peng, CY Lin, MT Sun… - 2009 IEEE International …, 2009 - ieeexplore.ieee.org
Audio is a useful modality complement to video for healthcare monitoring. In this paper, we
investigate the use of Hierarchical Hidden Markov Models (HHMMs) for healthcare audio …

Feature selection using singular value decomposition and QR factorization with column pivoting for text-independent speaker identification

S Chakroborty, G Saha - Speech Communication, 2010 - Elsevier
Selection of features is one of the important tasks in the application like Speaker
Identification (SI) and other pattern recognition problems. When multiple features are …

An investigation on the accuracy of truncated DKLT representation for speaker identification with short sequences of speech frames

G Biagetti, P Crippa, L Falaschetti… - IEEE transactions on …, 2016 - ieeexplore.ieee.org
Speaker identification plays a crucial role in biometric person identification as systems
based on human speech are increasingly used for the recognition of people. Mel frequency …

A sequence-to-sequence model for online signal detection and format recognition

L Cheng, H Zhu, Z Hu, B Luo - IEEE Signal Processing Letters, 2024 - ieeexplore.ieee.org
Signal detection and format recognition are critical and challenging tasks across civil and
military sectors. However, they often encounter signal truncation issues during online signal …

Regularized linear prediction of speech

LA Ekman, WB Kleijn, MN Murthi - IEEE transactions on audio …, 2008 - ieeexplore.ieee.org
All-pole spectral envelope estimates based on linear prediction (LP) for speech signals often
exhibit unnaturally sharp peaks, especially for high-pitch speakers. In this paper …

[PDF][PDF] Improving speech recognition rate through analysis parameters

D Eringis, G Tamulevičius - Electrical, Control and Communication …, 2014 - sciendo.com
Speech signal is redundant and non-stationary by nature. Because of vocal tract inertness
these variations are not very rapid and the signal can be considered as stationary in short …

Speaker identification with short sequences of speech frames

G Biagetti, P Crippa, A Curzi, S Orcioni… - … Conference on Pattern …, 2015 - scitepress.org
In biometric person identification systems, speaker identification plays a crucial role as the
voice is the more natural signal to produce and the simplest to acquire. Mel frequency …

GCI identification from voiced speech using the eigen value decomposition of Hankel matrix

P Jain, RB Pachori - … on Image and Signal Processing and …, 2013 - ieeexplore.ieee.org
In this paper, we present a novel method for robust and accurate identification of glottal
closure instants (GCIs) from the voiced speech signal. The proposed method employs a new …