A review of differentiable digital signal processing for music and speech synthesis

B Hayes, J Shier, G Fazekas, A McPherson… - Frontiers in Signal …, 2024 - frontiersin.org
The term “differentiable digital signal processing” describes a family of techniques in which
loss function gradients are backpropagated through digital signal processors, facilitating …

Style transfer of audio effects with differentiable signal processing

CJ Steinmetz, NJ Bryan, JD Reiss - arxiv preprint arxiv:2207.08759, 2022 - arxiv.org
We present a framework that can impose the audio effects and production style from one
recording to another by example with the goal of simplifying the audio production process …

Differentiable artificial reverberation

S Lee, HS Choi, K Lee - IEEE/ACM Transactions on Audio …, 2022 - ieeexplore.ieee.org
Artificial reverberation (AR) models play a central role in various audio applications.
Therefore, estimating the AR model parameters (ARPs) of a reference reverberation is a …

[PDF][PDF] auraloss: Audio focused loss functions in PyTorch

CJ Steinmetz, JD Reiss - Digital music research network one-day …, 2020 - eecs.qmul.ac.uk
We present auraloss 1, a PyTorch package that implements time and frequency domain loss
functions designed for audio generation tasks. The package provides a straightforward …

DDSP-based singing vocoders: A new subtractive-based synthesizer and a comprehensive evaluation

DY Wu, WY Hsiao, FR Yang, O Friedman… - arxiv preprint arxiv …, 2022 - arxiv.org
A vocoder is a conditional audio generation model that converts acoustic features such as
mel-spectrograms into waveforms. Taking inspiration from Differentiable Digital Signal …

Automatic music mixing with deep learning and out-of-domain data

MA Martínez-Ramírez, WH Liao, G Fabbro… - arxiv preprint arxiv …, 2022 - arxiv.org
Music mixing traditionally involves recording instruments in the form of clean, individual
tracks and blending them into a final mixture using audio effects and expert knowledge (eg …

Efficient neural networks for real-time modeling of analog dynamic range compression

CJ Steinmetz, JD Reiss - arxiv preprint arxiv:2102.06200, 2021 - arxiv.org
Deep learning approaches have demonstrated success in modeling analog audio effects.
Nevertheless, challenges remain in modeling more complex effects that involve time-varying …

Hyper recurrent neural network: Condition mechanisms for black-box audio effect modeling

YT Yeh, WY Hsiao, YH Yang - arxiv preprint arxiv:2408.04829, 2024 - arxiv.org
Recurrent neural networks (RNNs) have demonstrated impressive results for virtual analog
modeling of audio effects. These networks process time-domain audio signals using a series …

Towards zero-shot amplifier modeling: One-to-many amplifier modeling via tone embedding control

YH Chen, YT Yeh, YC Cheng, JT Wu, YH Ho… - arxiv preprint arxiv …, 2024 - arxiv.org
Replicating analog device circuits through neural audio effect modeling has garnered
increasing interest in recent years. Existing work has predominantly focused on a one-to …

Reverse engineering of a recording mix with differentiable digital signal processing

JT Colonel, J Reiss - The Journal of the Acoustical Society of America, 2021 - pubs.aip.org
A method to retrieve the parameters used to create a multitrack mix using only raw tracks
and the stereo mixdown is presented. This method is able to model linear time-invariant …