UNSSOR: unsupervised neural speech separation by leveraging over-determined training mixtures
In reverberant conditions with multiple concurrent speakers, each microphone acquires a
mixture signal of multiple speakers at a different location. In over-determined conditions …
mixture signal of multiple speakers at a different location. In over-determined conditions …
Neural full-rank spatial covariance analysis for blind source separation
This paper describes aneural blind source separation (BSS) method based on amortized
variational inference (AVI) of a non-linear generative model of mixture signals. A classical …
variational inference (AVI) of a non-linear generative model of mixture signals. A classical …
Spatial loss for unsupervised multi-channel source separation
We propose a spatial loss for unsupervised multi-channel source separation. The proposed
loss exploits the duality of direction of arrival (DOA) and beamforming: the steering and …
loss exploits the duality of direction of arrival (DOA) and beamforming: the steering and …
Surrogate source model learning for determined source separation
We propose to learn surrogate functions of universal speech priors for determined blind
speech separation. Deep speech priors are highly desirable due to their superior modelling …
speech separation. Deep speech priors are highly desirable due to their superior modelling …
Self-remixing: Unsupervised speech separation via separation and remixing
We present Self-Remixing, a novel self-supervised speech separation method, which refines
a pre-trained separation model in an unsupervised manner. Self-Remixing consists of a …
a pre-trained separation model in an unsupervised manner. Self-Remixing consists of a …
[PDF][PDF] Weakly-Supervised Neural Full-Rank Spatial Covariance Analysis for a Front-End System of Distant Speech Recognition.
This paper presents a weakly-supervised multichannel neural speech separation method for
distant speech recognition (DSR) of real conversational speech mixtures. A blind source …
distant speech recognition (DSR) of real conversational speech mixtures. A blind source …
USDnet: Unsupervised Speech Dereverberation via Neural Forward Filtering
ZQ Wang - arxiv preprint arxiv:2402.00820, 2024 - arxiv.org
In reverberant conditions with a single speaker, each far-field microphone records a
reverberant version of the same speaker signal at a different location. In over-determined …
reverberant version of the same speaker signal at a different location. In over-determined …
Unsupervised multi-channel separation and adaptation
A key challenge in machine learning is to generalize from training data to an application
domain of interest. This work extends the recently-proposed mixture invariant training (MixIT) …
domain of interest. This work extends the recently-proposed mixture invariant training (MixIT) …
Location as supervision for weakly supervised multi-channel source separation of machine sounds
In this work, we are interested in learning a model to separate sources that cannot be
recorded in isolation, such as parts of a machine that must run simultaneously in order for …
recorded in isolation, such as parts of a machine that must run simultaneously in order for …
[PDF][PDF] Mentoring-Reverse Mentoring for Unsupervised Multi-Channel Speech Source Separation.
Mentoring-reverse mentoring, which is a novel knowledge transfer framework for
unsupervised learning, is introduced in multi-channel speech source separation. This …
unsupervised learning, is introduced in multi-channel speech source separation. This …