- Academic Search

E Cooper - 2019 - search.proquest.com

Text-to-speech synthesis is a key component of interactive, speech-based systems.
Typically, building a high-quality voice requires collecting dozens of hours of speech from a …

Save Cite Cited by 28 Related articles All 5 versions Free GPT-4 Library Search

[Free GPT-4]

[PDF] researchgate.net

[PDF][PDF] Subjective and Objective Evaluation of Speech Intelligibility Enhancement Under Constant Energy and Duration Constraints.

Y Tang, M Cooke - Interspeech, 2011 - researchgate.net

Speakers appear to adopt strategies to improve speech intelligibility for interlocutors in
adverse acoustic conditions. Generated speech, whether synthetic, recorded or live, may …

Save Cite Cited by 55 Related articles All 7 versions Free GPT-4

[Free GPT-4]

[PDF] nsf.gov

Utterance selection for optimizing intelligibility of tts voices trained on asr data

E Cooper, X Wang - Interspeech 2017, 2017 - par.nsf.gov

This paper describes experiments in training HMM-based text-to-speech (TTS) voices on
data collected for Automatic Speech Recognition (ASR) training. We compare a number of …

Save Cite Cited by 27 Related articles All 6 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] ed.ac.uk

Can objective measures predict the intelligibility of modified HMM-based synthetic speech in noise?

C Valentini-Botinhao, J Yamagishi… - Interspeech 2011: 12th …, 2011 - research.ed.ac.uk

Synthetic speech can be modified to improve intelligibility in noise. In order to perform
modifications automatically, it would be useful to have an objective measure that could …

Save Cite Cited by 49 Related articles All 10 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] ed.ac.uk

Intelligibility enhancement of HMM-generated speech in additive noise by modifying Mel cepstral coefficients to increase the glimpse proportion

C Valentini-Botinhao, J Yamagishi, S King… - Computer Speech & …, 2014 - Elsevier

This paper describes speech intelligibility enhancement for Hidden Markov Model (HMM)
generated synthetic speech in noise. We present a method for modifying the Mel cepstral …

Save Cite Cited by 38 Related articles All 16 versions Free GPT-4

Multimodal physiological quality-of-experience assessment of text-to-speech systems

R Gupta, HJ Banville, TH Falk - IEEE Journal of Selected Topics …, 2016 - ieeexplore.ieee.org

With the growing complexity of various text-to-speech systems, it is becoming more
important to understand the underlying perceptual and judgement processes that drive user …

Save Cite Cited by 20 Related articles All 2 versions Free GPT-4

[Free GPT-4]

[PDF] ed.ac.uk

Cepstral analysis based on the Glimpse proportion measure for improving the intelligibility of HMM-based synthetic speech in noise

C Valentini-Botinhao, R Maia… - … , Speech and Signal …, 2012 - ieeexplore.ieee.org

In this paper we introduce a new cepstral coefficient extraction method based on an
intelligibility measure for speech in noise, the Glimpse Proportion measure. This new …

Save Cite Cited by 27 Related articles All 8 versions Free GPT-4

[Free GPT-4]

[PDF] arxiv.org

The Art of Storytelling: Multi-Agent Generative AI for Dynamic Multimodal Narratives

S Arif, T Arif, MS Haroon, AJ Khan, AA Raza… - arxiv preprint arxiv …, 2024 - arxiv.org

This paper introduces the concept of an education tool that utilizes Generative Artificial
Intelligence (GenAI) to enhance storytelling for children. The system combines GenAI-driven …

[Free GPT-4]

[HTML] uba.ar

Emilia: a speech corpus for Argentine Spanish text to speech synthesis

HM Torres, JA Gurlekian, DA Evin… - Language Resources …, 2019 - Springer

This paper introduces Emilia, a speech corpus created to build a female voice in Spanish
spoken in Buenos Aires for the Aromo text-to-speech system. Aromo is a unit selection text …

Save Cite Cited by 8 Related articles All 9 versions Free GPT-4

Fusion of magnitude and phase-based features for objective evaluation of TTS voice

HB Sailor, HA Patil - The 9th International Symposium on …, 2014 - ieeexplore.ieee.org

This paper analyzes the distance-based objective measures for evaluation of Text-to-
Speech (TTS) systems (which is generally used objective measures). In this paper, we …

Save Cite Cited by 9 Related articles All 2 versions Free GPT-4

Create alert

Cite

Advanced search

Saved to My library

Evaluation of objective measures for intelligibility prediction of HMM-based synthetic speech...

[BOOK][B] Text-to-speech synthesis using found data for low-resource languages

[PDF][PDF] Subjective and Objective Evaluation of Speech Intelligibility Enhancement Under Constant Energy and Duration Constraints.

Utterance selection for optimizing intelligibility of tts voices trained on asr data

Can objective measures predict the intelligibility of modified HMM-based synthetic speech in noise?

Intelligibility enhancement of HMM-generated speech in additive noise by modifying Mel cepstral coefficients to increase the glimpse proportion

Multimodal physiological quality-of-experience assessment of text-to-speech systems

Cepstral analysis based on the Glimpse proportion measure for improving the intelligibility of HMM-based synthetic speech in noise

The Art of Storytelling: Multi-Agent Generative AI for Dynamic Multimodal Narratives

Emilia: a speech corpus for Argentine Spanish text to speech synthesis

Fusion of magnitude and phase-based features for objective evaluation of TTS voice