Features for content-based audio retrieval

D Mitrović, M Zeppelzauer, C Breiteneder - Advances in computers, 2010 - Elsevier
Today, a large number of audio features exists in audio retrieval for different purposes, such
as automatic speech recognition, music information retrieval, audio segmentation, and …

[PDF][PDF] Automated speech recognition system—A literature review

M Manjutha, J Gracy, P Subashini… - Computational …, 2017 - researchgate.net
Most natural form of human communication depends on speech. In order to understand
human speech by enabling machines, computers can act as an intermediate for human …

Bird call classification using dnn-based acoustic modelling

R Rajan, J Johnson, N Abdul Kareem - Circuits, Systems, and Signal …, 2022 - Springer
Bird call recognition using deep neural network-hidden Markov model (DNN-HMM)-based
transcription is proposed. The work is an attempt to adapt the human speech recognition …

Significance of spectral cues in automatic speech segmentation for Indian language speech synthesizers

A Baby, JJ Prakash, AS Subramanian… - Speech Communication, 2020 - Elsevier
Building speech synthesis systems for Indian languages is challenging owing to the fact that
digital resources for these languages are hardly available. Vocabulary independent speech …

[PDF][PDF] A hybrid approach to segmentation of speech using group delay processing and HMM based embedded reestimation.

SA Shanmugam, HA Murthy - INTERSPEECH, 2014 - isca-archive.org
The most popular method for automatic segmentation is embedded reestimation of
monophone HMMs after flat start initialization, followed by forced alignment. This method …

[PDF][PDF] A syllable based continuous speech recognizer for Tamil.

A Lakshmi, HA Murthy - Interspeech, 2006 - researchgate.net
This paper presents a novel technique for building a syllable based continuous speech
recognizer when unannotated transcribed train data is available. We present two different …

Phoneme boundary detection from speech: A rule based approach

PB Ramteke, SG Koolagudi - Speech Communication, 2019 - Elsevier
In this paper, a novel approach has been proposed for the automatic segmentation of
speech signal into phonemes. In a well spoken word, phonemes can be characterized by …

Natural sounding TTS based on syllable-like units

S Thomas, MN Rao, HA Murthy… - 2006 14th European …, 2006 - ieeexplore.ieee.org
In this work we describe a new “syllable-like” speech unit that is suitable for concatenative
speech synthesis. These units are automatically generated using a group delay based …

Using polysyllabic units for text to speech synthesis in indian languages

MV Vinodh, A Bellur, KB Narayan… - 2010 National …, 2010 - ieeexplore.ieee.org
This paper describes the design and development of Indian language Text-To-Speech (TTS)
synthesis systems, using polysyllabic units. Firstly, a phone based TTS is built. Later, a …

Vowel detection using a perceptually-enhanced spectrum matching conditioned to phonetic context and speaker identity

HB Kashani, A Sayadiyan, H Sheikhzadeh - Speech Communication, 2017 - Elsevier
Vowel detection methods usually adopt a two-stage procedure for detecting vowel
landmarks. First, a temporal objective contour (TOC), as a time-varying measure of vowel …