Google Академія

A Mehrish, N Majumder, R Bharadwaj, R Mihalcea… - Information …, 2023 - Elsevier

The field of speech processing has undergone a transformative shift with the advent of deep
learning. The use of multiple processing layers has enabled the creation of models capable …

Зберегти Послатися Цитовано в 242 джерелах Пов’язані статті Кількість версій: 7

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Deepfakes generation and detection: State-of-the-art, open challenges, countermeasures, and way forward

M Masood, M Nawaz, KM Malik, A Javed, A Irtaza… - Applied …, 2023 - Springer

Easy access to audio-visual content on social media, combined with the availability of
modern tools such as Tensorflow or Keras, and open-source trained models, along with …

Зберегти Послатися Цитовано в 435 джерелах Пов’язані статті Кількість версій: 11

[Free GPT-4]
[DeepSeek]

[PDF] mlr.press

Contentvec: An improved self-supervised speech representation by disentangling speakers

K Qian, Y Zhang, H Gao, J Ni, CI Lai… - International …, 2022 - proceedings.mlr.press

Self-supervised learning in speech involves training a speech representation network on a
large-scale unannotated speech corpus, and then applying the learned representations to …

Зберегти Послатися Цитовано в 126 джерелах Пов’язані статті Кількість версій: 9 Показати у форматі HTML

[Free GPT-4]
[DeepSeek]

[PDF] ieee.org

An overview of voice conversion and its challenges: From statistical modeling to deep learning

B Sisman, J Yamagishi, S King… - IEEE/ACM Transactions …, 2020 - ieeexplore.ieee.org

Speaker identity is one of the important characteristics of human speech. In voice
conversion, we change the speaker identity from one to another, while kee** the linguistic …

Зберегти Послатися Цитовано в 419 джерелах Пов’язані статті Кількість версій: 9

[Free GPT-4]
[DeepSeek]

[HTML] mdpi.com

[HTML][HTML] A review of synthetic image data and its use in computer vision

K Man, J Chahl - Journal of Imaging, 2022 - mdpi.com

Development of computer vision algorithms using convolutional neural networks and deep
learning has necessitated ever greater amounts of annotated and labelled data to produce …

Зберегти Послатися Цитовано в 83 джерелах Пов’язані статті Кількість версій: 6 Кеш

Faulty rolling bearing digital twin model and its application in fault diagnosis with imbalanced samples

Y Qin, H Liu, Y Mao - Advanced Engineering Informatics, 2024 - Elsevier

The simulation signals generated by the bearing dynamics model have a big gap with the
actual signals, which limits their efficacy in bearing fault diagnosis. Therefore, it is valuable …

Зберегти Послатися Цитовано в 31 джерелах Пов’язані статті Кількість версій: 2

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

An overview of affective speech synthesis and conversion in the deep learning era

A Triantafyllopoulos, BW Schuller… - Proceedings of the …, 2023 - ieeexplore.ieee.org

Speech is the fundamental mode of human communication, and its synthesis has long been
a core priority in human–computer interaction research. In recent years, machines have …

Зберегти Послатися Цитовано в 67 джерелах Пов’язані статті Кількість версій: 12

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Stargan-vc2: Rethinking conditional methods for stargan-based voice conversion

T Kaneko, H Kameoka, K Tanaka, N Hojo - ar**s
among multiple domains without relying on parallel data. This is important but challenging …

Зберегти Послатися Цитовано в 188 джерелах Пов’язані статті Кількість версій: 6 Показати у форматі HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Again-vc: A one-shot voice conversion using activation guidance and adaptive instance normalization

YH Chen, DY Wu, TH Wu, H Lee - ICASSP 2021-2021 IEEE …, 2021 - ieeexplore.ieee.org

Recently, voice conversion (VC) has been widely studied. Many VC systems use
disentangle-based learning techniques to separate the speaker and the linguistic content …

Зберегти Послатися Цитовано в 128 джерелах Пов’язані статті Кількість версій: 4

[Free GPT-4]
[DeepSeek]

[PDF] frontiersin.org

Data augmentation for deep neural networks model in EEG classification task: a review

C He, J Liu, Y Zhu, W Du - Frontiers in Human Neuroscience, 2021 - frontiersin.org

Classification of electroencephalogram (EEG) is a key approach to measure the rhythmic
oscillations of neural activity, which is one of the core technologies of brain-computer …

Зберегти Послатися Цитовано в 74 джерелах Пов’язані статті Кількість версій: 7 Кеш

Створити сповіщення

Послатися

Розширений пошук

Збережено в моїй бібліотеці

Cyclegan-vc2: Improved cyclegan-based non-parallel voice conversion

A review of deep learning techniques for speech processing

Deepfakes generation and detection: State-of-the-art, open challenges, countermeasures, and way forward

Contentvec: An improved self-supervised speech representation by disentangling speakers

An overview of voice conversion and its challenges: From statistical modeling to deep learning

[HTML][HTML] A review of synthetic image data and its use in computer vision

Faulty rolling bearing digital twin model and its application in fault diagnosis with imbalanced samples

An overview of affective speech synthesis and conversion in the deep learning era

Stargan-vc2: Rethinking conditional methods for stargan-based voice conversion

Again-vc: A one-shot voice conversion using activation guidance and adaptive instance normalization

Data augmentation for deep neural networks model in EEG classification task: a review