Turnitin
降AI改写
早检测系统
早降重系统
Turnitin-UK版
万方检测-期刊版
维普编辑部版
Grammarly检测
Paperpass检测
checkpass检测
PaperYY检测
Audio deepfakes: A survey
Z Khanjani, G Watson, VP Janeja - Frontiers in Big Data, 2023 - frontiersin.org
A deepfake is content or material that is synthetically generated or manipulated using
artificial intelligence (AI) methods, to be passed off as real and can include audio, video …
artificial intelligence (AI) methods, to be passed off as real and can include audio, video …
A comprehensive survey on deep music generation: Multi-level representations, algorithms, evaluations, and future directions
S Ji, J Luo, X Yang - arxiv preprint arxiv:2011.06801, 2020 - arxiv.org
The utilization of deep learning techniques in generating various contents (such as image,
text, etc.) has become a trend. Especially music, the topic of this paper, has attracted …
text, etc.) has become a trend. Especially music, the topic of this paper, has attracted …
The singing voice conversion challenge 2023
We present the latest iteration of the voice conversion challenge (VCC) series, a bi-annual
scientific event aiming to compare and understand different voice conversion (VC) systems …
scientific event aiming to compare and understand different voice conversion (VC) systems …
A review of differentiable digital signal processing for music and speech synthesis
The term “differentiable digital signal processing” describes a family of techniques in which
loss function gradients are backpropagated through digital signal processors, facilitating …
loss function gradients are backpropagated through digital signal processors, facilitating …
Generative adversarial networks for speech processing: A review
Generative adversarial networks (GANs) have seen remarkable progress in recent years.
They are used as generative models for all kinds of data such as text, images, audio, music …
They are used as generative models for all kinds of data such as text, images, audio, music …
Diffsvc: A diffusion probabilistic model for singing voice conversion
Singing voice conversion (SVC) is one promising technique that can enrich the way of
human-computer interaction by en-dowing a computer the ability to produce high-fidelity and …
human-computer interaction by en-dowing a computer the ability to produce high-fidelity and …
Transforming spectrum and prosody for emotional voice conversion with non-parallel training data
Emotional voice conversion aims to convert the spectrum and prosody to change the
emotional patterns of speech, while preserving the speaker identity and linguistic content …
emotional patterns of speech, while preserving the speaker identity and linguistic content …
Fastsvc: Fast cross-domain singing voice conversion with feature-wise linear modulation
This paper presents FastSVC, a light-weight cross-domain singing voice conversion (SVC)
system, which can achieve high conversion performance, with inference speed 4x faster …
system, which can achieve high conversion performance, with inference speed 4x faster …
Vaw-gan for disentanglement and recomposition of emotional elements in speech
Emotional voice conversion (EVC) aims to convert the emotion of speech from one state to
another while preserving the linguistic content and speaker identity. In this paper, we study …
another while preserving the linguistic content and speaker identity. In this paper, we study …
Singing voice conversion with disentangled representations of singer and vocal technique using variational autoencoders
We propose a flexible framework that deals with both singer conversion and singers vocal
technique conversion. The proposed model is trained on non-parallel corpora …
technique conversion. The proposed model is trained on non-parallel corpora …