Turnitin
降AI改写
早检测系统
早降重系统
Turnitin-UK版
万方检测-期刊版
维普编辑部版
Grammarly检测
Paperpass检测
checkpass检测
PaperYY检测
Deepfakes as a threat to a speaker and facial recognition: An overview of tools and attack vectors
Deepfakes present an emerging threat in cyberspace. Recent developments in machine
learning make deepfakes highly believable, and very difficult to differentiate between what is …
learning make deepfakes highly believable, and very difficult to differentiate between what is …
Usat: A universal speaker-adaptive text-to-speech approach
Conventional text-to-speech (TTS) research has predominantly focused on enhancing the
quality of synthesized speech for speakers in the training dataset. The challenge of …
quality of synthesized speech for speakers in the training dataset. The challenge of …
Tdass: Target domain adaptation speech synthesis framework for multi-speaker low-resource tts
Recently, synthesizing personalized speech by text-to-speech (TTS) application is highly
demanded. But the previous TTS models require a mass of target speaker speeches for …
demanded. But the previous TTS models require a mass of target speaker speeches for …
Metasid: Singer identification with domain adaptation for metaverse
Metaverse has stretched the real world into unlimited space. There will be more live concerts
in Metaverse. The task of singer identification is to identify the song belongs to which singer …
in Metaverse. The task of singer identification is to identify the song belongs to which singer …
Adaptive transformer-based conditioned variational autoencoder for incomplete social event classification
With the rapid development of the Internet and the expanding scale of social media,
incomplete social event classification has increasingly become a challenging task. The key …
incomplete social event classification has increasingly become a challenging task. The key …
Susing: Su-net for singing voice synthesis
Singing voice synthesis is a generative task that involves multi-dimensional control of the
singing model, including lyrics, pitch, and duration, and includes the timbre of the singer and …
singing model, including lyrics, pitch, and duration, and includes the timbre of the singer and …
[PDF][PDF] Fvtts: Face based voice synthesis for text-to-speech
A face is expressive of individual identity and used in various studies such as identification,
authentication, and personalization. Similarly, a voice is a means of expressing individuals …
authentication, and personalization. Similarly, a voice is a means of expressing individuals …
Pose guided human image synthesis with partially decoupled gan
Abstract Pose Guided Human Image Synthesis (PGHIS) is a challenging task of transforming
a human image from the reference pose to a target pose while preserving its style. Most …
a human image from the reference pose to a target pose while preserving its style. Most …
Semi-supervised learning based on reference model for low-resource tts
Most previous neural text-to-speech (TTS) methods are mainly based on supervised
learning methods, which means they depend on a large training dataset and hard to achieve …
learning methods, which means they depend on a large training dataset and hard to achieve …
Mdcnn-sid: Multi-scale dilated convolution network for singer identification
Most singer identification methods are processed in the frequency domain, which potentially
leads to information loss during the spectral transformation. In this paper, instead of the …
leads to information loss during the spectral transformation. In this paper, instead of the …