Deep karaoke: Extracting vocals from musical mixtures using a convolutional deep neural network

AJR Simpson, G Roma, MD Plumbley - Latent Variable Analysis and …, 2015 - Springer
Identification and extraction of singing voice from within musical mixtures is a key challenge
in source separation and machine audition. Recently, deep neural networks (DNN) have …

Deep neural network based instrument extraction from music

S Uhlich, F Giron, Y Mitsufuji - 2015 IEEE International …, 2015 - ieeexplore.ieee.org
This paper deals with the extraction of an instrument from music by using a deep neural
network. As prior information, we only assume to know the instrument types that are present …

Deep neural networks for single channel source separation

EM Grais, MU Sen, H Erdogan - 2014 IEEE International …, 2014 - ieeexplore.ieee.org
In this paper, a novel approach for single channel source separation (SCSS) using a deep
neural network (DNN) architecture is introduced. Unlike previous studies in which DNN and …

Deep feature factorization for concept discovery

E Collins, R Achanta… - Proceedings of the …, 2018 - openaccess.thecvf.com
Abstract We propose Deep Feature Factorization (DFF), a method capable of localizing
similar semantic concepts within an image or a set of images. We use DFF to gain insight …

SEANet: A multi-modal speech enhancement network

M Tagliasacchi, Y Li, K Misiunas, D Roblek - arxiv preprint arxiv …, 2020 - arxiv.org
We explore the possibility of leveraging accelerometer data to perform speech enhancement
in very noisy conditions. Although it is possible to only partially reconstruct user's speech …

On the ideal ratio mask as the goal of computational auditory scene analysis

C Hummersone, T Stokes, T Brookes - Blind source separation: advances …, 2014 - Springer
The ideal binary mask (IBM) is widely considered to be the benchmark for time–frequency-
based sound source separation techniques such as computational auditory scene analysis …

NMF-based target source separation using deep neural network

TG Kang, K Kwon, JW Shin… - IEEE Signal Processing …, 2014 - ieeexplore.ieee.org
Non-negative matrix factorization (NMF) is one of the most well-known techniques that are
applied to separate a desired source from mixture data. In the NMF framework, a collection …

Monaural speech enhancement using deep neural networks by maximizing a short-time objective intelligibility measure

M Kolbæk, ZH Tan, J Jensen - 2018 IEEE International …, 2018 - ieeexplore.ieee.org
In this paper we propose a Deep Neural Network (D NN) based Speech Enhancement (SE)
system that is designed to maximize an approximation of the Short-Time Objective …

Monaural source separation in complex domain with long short-term memory neural network

Y Sun, Y **an, W Wang… - IEEE Journal of Selected …, 2019 - ieeexplore.ieee.org
In recent research, deep neural network (DNN) has been used to solve the monaural source
separation problem. According to the training objectives, DNN-based monaural speech …

Role of deep neural network in speech enhancement: A review

D Hepsiba, J Justin - … : Second International Conference, SLAAI-ICAI 2018 …, 2019 - Springer
This paper presents a review on different methodologies adopted in speech enhancement
and the role of Deep Neural Networks (DNN) in enhancement of speech. Mostly, a speech …