An overview of lead and accompaniment separation in music

Z Rafii, A Liutkus, FR Stöter, SI Mimilakis… - … on Audio, Speech …, 2018 - ieeexplore.ieee.org
Popular music is often composed of an accompaniment and a lead component, the latter
typically consisting of vocals. Filtering such mixtures to extract one or both components has …

[PDF][PDF] Open-unmix-a reference implementation for music source separation

FR Stöter, S Uhlich, A Liutkus… - Journal of Open Source …, 2019 - joss.theoj.org
Music source separation is the task of decomposing music into its constitutive components,
eg, yielding separated stems for the vocals, bass, and drums. Such a separation has many …

Single-channel audio source separation with NMF: divergences, constraints and algorithms

C Févotte, E Vincent, A Ozerov - Audio Source Separation, 2018 - Springer
Spectral decomposition by nonnegative matrix factorisation (NMF) has become state-of-the-
art practice in many audio signal processing tasks, such as source separation, enhancement …

A consolidated perspective on multimicrophone speech enhancement and source separation

S Gannot, E Vincent… - … /ACM Transactions on …, 2017 - ieeexplore.ieee.org
Speech enhancement and separation are core problems in audio signal processing, with
commercial applications in devices as diverse as mobile phones, conference call systems …

Speech enhancement with LSTM recurrent neural networks and its application to noise-robust ASR

F Weninger, H Erdogan, S Watanabe, E Vincent… - Latent Variable Analysis …, 2015 - Springer
We evaluate some recent developments in recurrent neural network (RNN) based speech
enhancement in the light of noise-robust automatic speech recognition (ASR). The proposed …

Deep unfolding: Model-based inspiration of novel deep architectures

JR Hershey, JL Roux, F Weninger - arxiv preprint arxiv:1409.2574, 2014 - arxiv.org
Model-based methods and deep neural networks have both been tremendously successful
paradigms in machine learning. In model-based methods, problem domain knowledge can …

Multichannel audio source separation with deep neural networks

AA Nugraha, A Liutkus, E Vincent - IEEE/ACM Transactions on …, 2016 - ieeexplore.ieee.org
This article addresses the problem of multichannel audio source separation. We propose a
framework where deep neural networks (DNNs) are used to model the source spectra and …

Improving music source separation based on deep neural networks through data augmentation and network blending

S Uhlich, M Porcu, F Giron, M Enenkl… - … on acoustics, speech …, 2017 - ieeexplore.ieee.org
This paper deals with the separation of music into individual instrument tracks which is
known to be a challenging problem. We describe two different deep neural network …

Towards scaling up classification-based speech separation

Y Wang, DL Wang - IEEE Transactions on Audio, Speech, and …, 2013 - ieeexplore.ieee.org
Formulating speech separation as a binary classification problem has been shown to be
effective. While good separation performance is achieved in matched test conditions using …

Automatic music transcription: challenges and future directions

E Benetos, S Dixon, D Giannoulis, H Kirchhoff… - Journal of Intelligent …, 2013 - Springer
Automatic music transcription is considered by many to be a key enabling technology in
music signal processing. However, the performance of transcription systems is still …