Turnitin
降AI改写
早检测系统
早降重系统
Turnitin-UK版
万方检测-期刊版
维普编辑部版
Grammarly检测
Paperpass检测
checkpass检测
PaperYY检测
Parallel learning: Overview and perspective for computational learning across Syn2Real and Sim2Real
Q Miao, Y Lv, M Huang, X Wang… - IEEE/CAA Journal of …, 2023 - ieeexplore.ieee.org
The virtual-to-real paradigm, ie, training models on virtual data and then applying them to
solve real-world problems, has attracted more and more attention from various domains by …
solve real-world problems, has attracted more and more attention from various domains by …
On the adoption of modern technologies to fight the COVID-19 pandemic: a technical synthesis of latest developments
A Majeed, X Zhang - COVID, 2023 - mdpi.com
In the ongoing COVID-19 pandemic, digital technologies have played a vital role to minimize
the spread of COVID-19, and to control its pitfalls for the general public. Without such …
the spread of COVID-19, and to control its pitfalls for the general public. Without such …
A semi-supervised complementary joint training approach for low-resource speech recognition
Both unpaired speech and text have shown to be beneficial for low-resource automatic
speech recognition (ASR), which, however were either separately used for pre-training, self …
speech recognition (ASR), which, however were either separately used for pre-training, self …
Generating data with text-to-speech and large-language models for conversational speech recognition
Currently, a common approach in many speech processing tasks is to leverage large scale
pre-trained models by fine-tuning them on in-domain data for a particular application. Yet …
pre-trained models by fine-tuning them on in-domain data for a particular application. Yet …
On the Problem of Text-To-Speech Model Selection for Synthetic Data Generation in Automatic Speech Recognition
The rapid development of neural text-to-speech (TTS) systems enabled its usage in other
areas of natural language processing such as automatic speech recognition (ASR) or …
areas of natural language processing such as automatic speech recognition (ASR) or …
Text is all you need: Personalizing ASR models using controllable speech synthesis
Adapting generic speech recognition models to specific individuals is a challenging problem
due to the scarcity of personalized data. Recent works have proposed boosting the amount …
due to the scarcity of personalized data. Recent works have proposed boosting the amount …
Phoneme hallucinator: One-shot voice conversion via set expansion
Voice conversion (VC) aims at altering a person's voice to make it sound similar to the voice
of another person while preserving linguistic content. Existing methods suffer from a …
of another person while preserving linguistic content. Existing methods suffer from a …
Refining Synthesized Speech Using Speaker Information and Phone Masking for Data Augmentation of Speech Recognition
While end-to-end automatic speech recognition (ASR) has shown impressive performance,
it requires a huge amount of speech and transcription data. The conversion of domain …
it requires a huge amount of speech and transcription data. The conversion of domain …
Can we use Common Voice to train a Multi-Speaker TTS system?
Training of multi-speaker text-to-speech (TTS) systems relies on curated datasets based on
high-quality recordings or audiobooks. Such datasets often lack speaker diversity and are …
high-quality recordings or audiobooks. Such datasets often lack speaker diversity and are …
On the effect of purely synthetic training data for different automatic speech recognition architectures
In this work we evaluate the utility of synthetic data for training automatic speech recognition
(ASR). We use the ASR training data to train a text-to-speech (TTS) system similar to …
(ASR). We use the ASR training data to train a text-to-speech (TTS) system similar to …