A comprehensive survey of ai-generated content (aigc): A history of generative ai from gan to chatgpt
Recently, ChatGPT, along with DALL-E-2 and Codex, has been gaining significant attention
from society. As a result, many individuals have become interested in related resources and …
from society. As a result, many individuals have become interested in related resources and …
A survey of ai-generated content (aigc)
Recently, Artificial Intelligence Generated Content (AIGC) has gained significant attention
from society, especially with the rise of Generative AI (GAI) techniques such as ChatGPT …
from society, especially with the rise of Generative AI (GAI) techniques such as ChatGPT …
The listening talker: A review of human and algorithmic context-induced modifications of speech
Speech output technology is finding widespread application, including in scenarios where
intelligibility might be compromised–at least for some listeners–by adverse conditions …
intelligibility might be compromised–at least for some listeners–by adverse conditions …
Evaluating the intelligibility benefit of speech modifications in known noise conditions
The use of live and recorded speech is widespread in applications where correct message
reception is important. Furthermore, the deployment of synthetic speech in such applications …
reception is important. Furthermore, the deployment of synthetic speech in such applications …
An evaluation of intrusive instrumental intelligibility metrics
Instrumental intelligibility metrics are commonly used as an alternative to listening tests. This
paper evaluates 12 monaural intrusive intelligibility metrics: SII, HEGP, CSII, HASPI, NCM …
paper evaluates 12 monaural intrusive intelligibility metrics: SII, HEGP, CSII, HASPI, NCM …
[PDF][PDF] Intelligibility-enhancing speech modifications: the hurricane challenge.
Speech output is used extensively, including in situations where correct message reception
is threatened by adverse listening conditions. Recently, there has been a growing interest in …
is threatened by adverse listening conditions. Recently, there has been a growing interest in …
Enhancing speech intelligibility in text-to-speech synthesis using speaking style conversion
The increased adoption of digital assistants makes text-to-speech (TTS) synthesis systems
an indispensable feature of modern mobile devices. It is hence desirable to build a system …
an indispensable feature of modern mobile devices. It is hence desirable to build a system …
Speech intelligibility prediction using spectro-temporal modulation analysis
Spectro-temporal modulations are believed to mediate the analysis of speech sounds in the
human primary auditory cortex. Inspired by humans' robustness in comprehending speech …
human primary auditory cortex. Inspired by humans' robustness in comprehending speech …
[PDF][PDF] Intelligibility-Enhancing Speech Modifications-The Hurricane Challenge 2.0.
Understanding speech played back in noisy and reverberant conditions remains a
challenging task. This paper describes the Hurricane Challenge 2.0, the second large-scale …
challenging task. This paper describes the Hurricane Challenge 2.0, the second large-scale …
Whispered and Lombard neural speech synthesis
It is desirable for a text-to-speech system to take into account the environment where
synthetic speech is presented, and provide appropriate context-dependent output to the …
synthetic speech is presented, and provide appropriate context-dependent output to the …