Turnitin
降AI改写
早检测系统
早降重系统
Turnitin-UK版
万方检测-期刊版
维普编辑部版
Grammarly检测
Paperpass检测
checkpass检测
PaperYY检测
Speech technology progress based on new machine learning paradigm
Speech technologies have been developed for decades as a typical signal processing area,
while the last decade has brought a huge progress based on new machine learning …
while the last decade has brought a huge progress based on new machine learning …
Controllable emotion transfer for end-to-end speech synthesis
Emotion embedding space learned from references is a straight-forward approach for
emotion transfer in encoder-decoder structured emotional text to speech (TTS) systems …
emotion transfer in encoder-decoder structured emotional text to speech (TTS) systems …
Cross-speaker emotion disentangling and transfer for end-to-end speech synthesis
The cross-speaker emotion transfer task in text-to-speech (TTS) synthesis particularly aims
to synthesize speech for a target speaker with the emotion transferred from reference …
to synthesize speech for a target speaker with the emotion transferred from reference …
iemotts: Toward robust cross-speaker emotion transfer and control for speech synthesis based on disentanglement between prosody and timbre
Cross-speaker emotion transfer is a common approach to generating emotional speech
when speech data with emotion labels from target speakers is not available. This paper …
when speech data with emotion labels from target speakers is not available. This paper …
Controlling emotion strength with relative attribute for end-to-end speech synthesis
Recently, attention-based end-to-end speech synthesis has achieved superior performance
compared to traditional speech synthesis models, and several approaches like global style …
compared to traditional speech synthesis models, and several approaches like global style …
Multi-speaker emotional acoustic modeling for cnn-based speech synthesis
In this paper, we investigate multi-speaker emotional acoustic modeling methods for
convolutional neural network (CNN) based speech synthesis system. For emotion modeling …
convolutional neural network (CNN) based speech synthesis system. For emotion modeling …
Hierarchical multi-grained generative model for expressive speech synthesis
This paper proposes a hierarchical generative model with a multi-grained latent variable to
synthesize expressive speech. In recent years, fine-grained latent variables are introduced …
synthesize expressive speech. In recent years, fine-grained latent variables are introduced …
Model architectures to extrapolate emotional expressions in DNN-based text-to-speech
This paper proposes architectures that facilitate the extrapolation of emotional expressions
in deep neural network (DNN)-based text-to-speech (TTS). In this study, the meaning of …
in deep neural network (DNN)-based text-to-speech (TTS). In this study, the meaning of …
A review of affective generation models
Affective computing is an emerging interdisciplinary field where computational systems are
developed to analyze, recognize, and influence the affective states of a human. It can …
developed to analyze, recognize, and influence the affective states of a human. It can …
Controllable Multi-Speaker Emotional Speech Synthesis With Emotion Representation of High Generalization Capability
J Zheng, J Zhou, W Zheng, L Tao… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
The aim of multi-speaker emotional speech synthesis is to generate speech for a designated
speaker in a desired emotional state. The task is challenging due to the presence of speech …
speaker in a desired emotional state. The task is challenging due to the presence of speech …