Single-channel audio source separation with NMF: divergences, constraints and algorithms

C Févotte, E Vincent, A Ozerov - Audio Source Separation, 2018 - Springer
Spectral decomposition by nonnegative matrix factorisation (NMF) has become state-of-the-
art practice in many audio signal processing tasks, such as source separation, enhancement …

The 2013 signal separation evaluation campaign

N Ono, Z Koldovský, S Miyabe… - 2013 IEEE International …, 2013 - ieeexplore.ieee.org
This paper summarizes the 2013 community-based Signal Separation Evaluation Campaign
(SiSEC 2013). Five speech and music datasets were contributed, including two new …

Blind speech separation and enhancement with GCC-NMF

SUN Wood, J Rouat, S Dupont… - IEEE/ACM Transactions …, 2017 - ieeexplore.ieee.org
We present a blind source separation algorithm named GCC-NMF that combines
unsupervised dictionary learning via non-negative matrix factorization (NMF) with spatial …

Soft nonnegative matrix co-factorization

N Seichepine, S Essid, C Févotte… - IEEE Transactions on …, 2014 - ieeexplore.ieee.org
This work introduces a new framework for nonnegative matrix factorization (NMF) in
multisensor or multimodal data configurations, where taking into account the mutual …

Joint phoneme alignment and text-informed speech separation on highly corrupted speech

K Schulze-Forster, CSJ Doire… - ICASSP 2020-2020 …, 2020 - ieeexplore.ieee.org
Speech separation quality can be improved by exploiting textual information. However, this
usually requires text-to-speech alignment at phoneme level. Classical alignment methods …

[PDF][PDF] Text-informed speech enhancement with deep neural networks.

K Kinoshita, M Delcroix, A Ogawa, T Nakatani - INTERSPEECH, 2015 - kecl.ntt.co.jp
A speech signal captured by a distant microphone is generally contaminated by background
noise, which severely degrades the audible quality and intelligibility of the observed signal …

The flexible audio source separation toolbox version 2.0

Y Salaün, E Vincent, N Bertin, N Souviraa-Labastie… - ICASSP, 2014 - inria.hal.science
The Flexible Audio Source Separation Toolbox (FASST) is a toolbox for audio source
separation that relies on a general modeling and estimation framework that is applicable to …

Deep learning for audio and music

G Peeters, G Richard - Multi-faceted Deep Learning: Models and Data, 2021 - Springer
This chapter provides an overview of how deep learning techniques can be used for audio
signals. We first review the main DNN architectures, meta-architectures and training …

Unsupervised low latency speech enhancement with RT-GCC-NMF

SUN Wood, J Rouat - IEEE Journal of Selected Topics in Signal …, 2019 - ieeexplore.ieee.org
In this paper, we present RT-GCC-NMF: a real-time (RT), two-channel blind speech
enhancement algorithm that combines the non-negative matrix factorization (NMF) …

Cycle GAN-Based Audio Source Separation Using Time–Frequency Masking

S Joseph, R Rajan - Circuits, Systems, and Signal Processing, 2023 - Springer
Audio source separation is addressed using time–frequency filtering and conditional
adversarial networks. First, pitch tracks in the mixed audio are estimated using a multi-pitch …