A comprehensive survey on design and application of autoencoder in deep learning

P Li, Y Pei, J Li - Applied Soft Computing, 2023 - Elsevier
Autoencoder is an unsupervised learning model, which can automatically learn data
features from a large number of samples and can act as a dimensionality reduction method …

Deep learning approaches for data augmentation in medical imaging: a review

A Kebaili, J Lapuyade-Lahorgue, S Ruan - Journal of imaging, 2023 - mdpi.com
Deep learning has become a popular tool for medical image analysis, but the limited
availability of training data remains a major challenge, particularly in the medical field where …

[PDF][PDF] Understanding diffusion models: A unified perspective

C Luo - arxiv preprint arxiv:2208.11970, 2022 - papers.baulab.info
Given observed samples x from a distribution of interest, the goal of a generative model is to
learn to model its true data distribution p (x). Once learned, we can generate new samples …

Diffusion recommender model

W Wang, Y Xu, F Feng, X Lin, X He… - Proceedings of the 46th …, 2023 - dl.acm.org
Generative models such as Generative Adversarial Networks (GANs) and Variational Auto-
Encoders (VAEs) are widely utilized to model the generative process of user interactions …

Analog bits: Generating discrete data using diffusion models with self-conditioning

T Chen, R Zhang, G Hinton - arxiv preprint arxiv:2208.04202, 2022 - arxiv.org
We present Bit Diffusion: a simple and generic approach for generating discrete data with
continuous state and continuous time diffusion models. The main idea behind our approach …

Variational diffusion models

D Kingma, T Salimans, B Poole… - Advances in neural …, 2021 - proceedings.neurips.cc
Diffusion-based generative models have demonstrated a capacity for perceptually
impressive synthesis, but can they also be great likelihood-based models? We answer this …

Score-based generative modeling in latent space

A Vahdat, K Kreis, J Kautz - Advances in neural information …, 2021 - proceedings.neurips.cc
Score-based generative models (SGMs) have recently demonstrated impressive results in
terms of both sample quality and distribution coverage. However, they are usually applied …

NaturalSpeech: End-to-End Text-to-Speech Synthesis With Human-Level Quality

X Tan, J Chen, H Liu, J Cong, C Zhang… - … on Pattern Analysis …, 2024 - ieeexplore.ieee.org
Text-to-speech (TTS) has made rapid progress in both academia and industry in recent
years. Some questions naturally arise that whether a TTS system can achieve human-level …

A survey on neural speech synthesis

X Tan, T Qin, F Soong, TY Liu - arxiv preprint arxiv:2106.15561, 2021 - arxiv.org
Text to speech (TTS), or speech synthesis, which aims to synthesize intelligible and natural
speech given text, is a hot research topic in speech, language, and machine learning …