The listening talker: A review of human and algorithmic context-induced modifications of speech

M Cooke, S King, M Garnier, V Aubanel - Computer Speech & Language, 2014 - Elsevier
Speech output technology is finding widespread application, including in scenarios where
intelligibility might be compromised–at least for some listeners–by adverse conditions …

Clear speech perception: Linguistic and cognitive benefits

R Smiljanic - The handbook of speech perception, 2021 - Wiley Online Library
This chapter focuses on the characteristics and effectiveness of clear speech (CS) aimed at
enhancing intelligibility for adult interlocutors with perceptual difficulties arising from hearing …

[PDF][PDF] Intelligibility-enhancing speech modifications: the hurricane challenge.

M Cooke, C Mayo, C Valentini-Botinhao - Interspeech, 2013 - isca-archive.org
Speech output is used extensively, including in situations where correct message reception
is threatened by adverse listening conditions. Recently, there has been a growing interest in …

Multimodal age and gender estimation for adaptive human-robot interaction: A systematic literature review

HA Younis, NIR Ruhaiyem, AA Badr, AK Abdul-Hassan… - Processes, 2023 - mdpi.com
Identifying the gender of a person and his age by way of speaking is considered a crucial
task in computer vision. It is a very important and active research topic with many areas of …

Enhancing speech intelligibility in text-to-speech synthesis using speaking style conversion

D Paul, MPV Shifas, Y Pantazis, Y Stylianou - arxiv preprint arxiv …, 2020 - arxiv.org
The increased adoption of digital assistants makes text-to-speech (TTS) synthesis systems
an indispensable feature of modern mobile devices. It is hence desirable to build a system …

Autoscore: An open-source automated tool for scoring listener perception of speech

SA Borrie, TS Barrett, SE Yoho - The Journal of the Acoustical Society …, 2019 - pubs.aip.org
Speech perception studies typically rely on trained research assistants to score orthographic
listener transcripts for words correctly identified. While the accuracy of the human scoring …

Speech intelligibility prediction using spectro-temporal modulation analysis

A Edraki, WY Chan, J Jensen… - IEEE/ACM transactions …, 2020 - ieeexplore.ieee.org
Spectro-temporal modulations are believed to mediate the analysis of speech sounds in the
human primary auditory cortex. Inspired by humans' robustness in comprehending speech …

AVSE challenge: Audio-visual speech enhancement challenge

ALA Blanco, C Valentini-Botinhao… - 2022 IEEE Spoken …, 2023 - ieeexplore.ieee.org
Audio-visual speech enhancement is the task of improving the quality of a speech signal
when video of the speaker is available. It opens-up the opportunity of improving speech …

[PDF][PDF] Acoustic correlates of speech intelligibility. The usability of the eGeMAPS feature set for atypical speech

W Xue, C Cucchiarini, R van Hout, H Strik - 2019 - repository.ubn.ru.nl
Although speech intelligibility has been studied in different fields such as speech pathology,
language learning, psycholinguistics, and speech synthesis, it is still unclear which concrete …

A uniform phase representation for the harmonic model in speech synthesis applications

G Degottex, D Erro - EURASIP Journal on Audio, Speech, and Music …, 2014 - Springer
Feature-based vocoders, eg, STRAIGHT, offer a way to manipulate the perceived
characteristics of the speech signal in speech transformation and synthesis. For the …