A comprehensive survey on design and application of autoencoder in deep learning

P Li, Y Pei, J Li - Applied Soft Computing, 2023 - Elsevier
Autoencoder is an unsupervised learning model, which can automatically learn data
features from a large number of samples and can act as a dimensionality reduction method …

Deep generative modelling: A comparative review of vaes, gans, normalizing flows, energy-based and autoregressive models

S Bond-Taylor, A Leach, Y Long… - IEEE transactions on …, 2021 - ieeexplore.ieee.org
Deep generative models are a class of techniques that train deep neural networks to model
the distribution of training samples. Research has fragmented into various interconnected …

Variational diffusion models

D Kingma, T Salimans, B Poole… - Advances in neural …, 2021 - proceedings.neurips.cc
Diffusion-based generative models have demonstrated a capacity for perceptually
impressive synthesis, but can they also be great likelihood-based models? We answer this …

Cascaded diffusion models for high fidelity image generation

J Ho, C Saharia, W Chan, DJ Fleet, M Norouzi… - Journal of Machine …, 2022 - jmlr.org
We show that cascaded diffusion models are capable of generating high fidelity images on
the class-conditional ImageNet generation benchmark, without any assistance from auxiliary …

Score-based generative modeling through stochastic differential equations

Y Song, J Sohl-Dickstein, DP Kingma, A Kumar… - arxiv preprint arxiv …, 2020 - arxiv.org
Creating noise from data is easy; creating data from noise is generative modeling. We
present a stochastic differential equation (SDE) that smoothly transforms a complex data …

Improved denoising diffusion probabilistic models

AQ Nichol, P Dhariwal - International conference on machine …, 2021 - proceedings.mlr.press
Denoising diffusion probabilistic models (DDPM) are a class of generative models which
have recently been shown to produce excellent samples. We show that with a few simple …

[HTML][HTML] Coarse-to-fine video instance segmentation with factorized conditional appearance flows

Z Qin, X Lu, X Nie, D Liu, Y Yin, W Wang - IEEE/CAA Journal of …, 2023 - ieee-jas.net
We introduce a novel method using a new generative model that automatically learns
effective representations of the target and background appearance to detect, segment and …

Videogpt: Video generation using vq-vae and transformers

W Yan, Y Zhang, P Abbeel, A Srinivas - arxiv preprint arxiv:2104.10157, 2021 - arxiv.org
We present VideoGPT: a conceptually simple architecture for scaling likelihood based
generative modeling to natural videos. VideoGPT uses VQ-VAE that learns downsampled …

Denoising diffusion probabilistic models

J Ho, A Jain, P Abbeel - Advances in neural information …, 2020 - proceedings.neurips.cc
We present high quality image synthesis results using diffusion probabilistic models, a class
of latent variable models inspired by considerations from nonequilibrium thermodynamics …

NVAE: A deep hierarchical variational autoencoder

A Vahdat, J Kautz - Advances in neural information …, 2020 - proceedings.neurips.cc
Normalizing flows, autoregressive models, variational autoencoders (VAEs), and deep
energy-based models are among competing likelihood-based frameworks for deep …