[LIBRO][B] Audio source separation and speech enhancement

E Vincent, T Virtanen, S Gannot - 2018 - books.google.com
Learn the technology behind hearing aids, Siri, and Echo Audio source separation and
speech enhancement aim to extract one or more source signals of interest from an audio …

Learning to separate object sounds by watching unlabeled video

R Gao, R Feris, K Grauman - Proceedings of the European …, 2018 - openaccess.thecvf.com
Perceiving a scene most fully requires all the senses. Yet modeling how objects look and
sound is challenging: most natural scenes and events contain multiple objects, and the …

Text-driven separation of arbitrary sounds

K Kilgour, B Gfeller, Q Huang, A Jansen… - arxiv preprint arxiv …, 2022 - arxiv.org
We propose a method of separating a desired sound source from a single-channel mixture,
based on either a textual description or a short audio sample of the target source. This is …

Joint phoneme alignment and text-informed speech separation on highly corrupted speech

K Schulze-Forster, CSJ Doire… - ICASSP 2020-2020 …, 2020 - ieeexplore.ieee.org
Speech separation quality can be improved by exploiting textual information. However, this
usually requires text-to-speech alignment at phoneme level. Classical alignment methods …

Motion informed audio source separation

S Parekh, S Essid, A Ozerov… - … , Speech and Signal …, 2017 - ieeexplore.ieee.org
In this paper we tackle the problem of single channel audio source separation driven by
descriptors of the sounding object's motion. As opposed to previous approaches, motion is …

Weakly informed audio source separation

K Schulze-Forster, C Doire, G Richard… - 2019 IEEE Workshop …, 2019 - ieeexplore.ieee.org
Prior information about the target source can improve audio source separation quality but is
usually not available with the necessary level of audio alignment. This has limited its …

An introduction to multichannel NMF for audio source separation

A Ozerov, C Févotte, E Vincent - Audio Source Separation, 2018 - Springer
This chapter introduces multichannel nonnegative matrix factorization (NMF) methods for
audio source separation. All the methods and some of their extensions are introduced within …

Optimal condition training for target source separation

E Tzinis, G Wichern, P Smaragdis… - ICASSP 2023-2023 …, 2023 - ieeexplore.ieee.org
Recent research has shown remarkable performance in leveraging multiple extraneous
conditional and non-mutually-exclusive semantic concepts for sound source separation …

Variational Bayesian inference for source separation and robust feature extraction

K Adiloğlu, E Vincent - IEEE/ACM Transactions on Audio …, 2016 - ieeexplore.ieee.org
We consider the task of separating and classifying individual sound sources mixed together.
The main challenge is to achieve robust classification despite residual distortion of the …

Multi-channel audio source separation using multiple deformed references

N Souviraà-Labastie, A Olivero… - … /ACM Transactions on …, 2015 - ieeexplore.ieee.org
We present a general multi-channel source separation framework where additional audio
references are available for one (or more) source (s) of a given mixture. Each audio …