A comprehensive survey of ai-generated content (aigc): A history of generative ai from gan to chatgpt

Y Cao, S Li, Y Liu, Z Yan, Y Dai, PS Yu… - arxiv preprint arxiv …, 2023 - arxiv.org
Recently, ChatGPT, along with DALL-E-2 and Codex, has been gaining significant attention
from society. As a result, many individuals have become interested in related resources and …

A survey of ai-generated content (aigc)

Y Cao, S Li, Y Liu, Z Yan, Y Dai, P Yu, L Sun - ACM Computing Surveys, 2024 - dl.acm.org
Recently, Artificial Intelligence Generated Content (AIGC) has gained significant attention
from society, especially with the rise of Generative AI (GAI) techniques such as ChatGPT …

The listening talker: A review of human and algorithmic context-induced modifications of speech

M Cooke, S King, M Garnier, V Aubanel - Computer Speech & Language, 2014 - Elsevier
Speech output technology is finding widespread application, including in scenarios where
intelligibility might be compromised–at least for some listeners–by adverse conditions …

Evaluating the intelligibility benefit of speech modifications in known noise conditions

M Cooke, C Mayo, C Valentini-Botinhao… - Speech …, 2013 - Elsevier
The use of live and recorded speech is widespread in applications where correct message
reception is important. Furthermore, the deployment of synthetic speech in such applications …

An evaluation of intrusive instrumental intelligibility metrics

S Van Kuyk, WB Kleijn… - IEEE/ACM Transactions …, 2018 - ieeexplore.ieee.org
Instrumental intelligibility metrics are commonly used as an alternative to listening tests. This
paper evaluates 12 monaural intrusive intelligibility metrics: SII, HEGP, CSII, HASPI, NCM …

[PDF][PDF] Intelligibility-enhancing speech modifications: the hurricane challenge.

M Cooke, C Mayo, C Valentini-Botinhao - Interspeech, 2013 - isca-archive.org
Speech output is used extensively, including in situations where correct message reception
is threatened by adverse listening conditions. Recently, there has been a growing interest in …

Enhancing speech intelligibility in text-to-speech synthesis using speaking style conversion

D Paul, MPV Shifas, Y Pantazis, Y Stylianou - arxiv preprint arxiv …, 2020 - arxiv.org
The increased adoption of digital assistants makes text-to-speech (TTS) synthesis systems
an indispensable feature of modern mobile devices. It is hence desirable to build a system …

Speech intelligibility prediction using spectro-temporal modulation analysis

A Edraki, WY Chan, J Jensen… - IEEE/ACM transactions …, 2020 - ieeexplore.ieee.org
Spectro-temporal modulations are believed to mediate the analysis of speech sounds in the
human primary auditory cortex. Inspired by humans' robustness in comprehending speech …

[PDF][PDF] Intelligibility-Enhancing Speech Modifications-The Hurricane Challenge 2.0.

J Rennies, HF Schepker, C Valentini-Botinhao… - …, 2020 - researchgate.net
Understanding speech played back in noisy and reverberant conditions remains a
challenging task. This paper describes the Hurricane Challenge 2.0, the second large-scale …

Whispered and Lombard neural speech synthesis

Q Hu, T Bleisch, P Petkov, T Raitio… - 2021 IEEE Spoken …, 2021 - ieeexplore.ieee.org
It is desirable for a text-to-speech system to take into account the environment where
synthetic speech is presented, and provide appropriate context-dependent output to the …