It's raw! audio generation with state-space models

K Goel, A Gu, C Donahue, C Ré - … conference on machine …, 2022 - proceedings.mlr.press
Develo** architectures suitable for modeling raw audio is a challenging problem due to
the high sampling rates of audio waveforms. Standard sequence modeling approaches like …

A survey on neural speech synthesis

X Tan, T Qin, F Soong, TY Liu - ar**20a/**20a.pdf" data-clk="hl=lt&sa=T&oi=gga&ct=gga&cd=9&d=15645705670677592172&ei=L2zDZ869L5qU6rQPm7LfqQI" data-clk-atid="bHgNuWm2INkJ" target="_blank">[PDF] mlr.press

Waveflow: A compact flow-based model for raw audio

W **, K Peng, K Zhao, Z Song - … Conference on Machine …, 2020 - proceedings.mlr.press
In this work, we propose WaveFlow, a small-footprint generative flow for raw audio, which is
directly trained with maximum likelihood. It handles the long-range structure of 1-D …