Applications, promises, and pitfalls of deep learning for fluorescence image reconstruction

C Belthangady, LA Royer - Nature methods, 2019 - nature.com
Deep learning is becoming an increasingly important tool for image reconstruction in
fluorescence microscopy. We review state-of-the-art applications such as image restoration …

[PDF][PDF] A systematic literature review of machine and deep learning-based detection and classification methods for diseases related to the respiratory system

QMM Zarandah, SM Daud, SS Abu-Naser - Journal of Theoretical and …, 2023 - jatit.org
Deep Learning (DL) is a sub field of Machine Learning (ML) that has considerable
prospective in many areas of study like computer vision, image, and audio processing. A …

Learning complex spectral map** with gated convolutional recurrent networks for monaural speech enhancement

K Tan, DL Wang - IEEE/ACM Transactions on Audio, Speech …, 2019 - ieeexplore.ieee.org
Phase is important for perceptual quality of speech. However, it seems intractable to directly
estimate phase spectra through supervised learning due to their lack of spectrotemporal …

Mosnet: Deep learning based objective assessment for voice conversion

CC Lo, SW Fu, WC Huang, X Wang… - arxiv preprint arxiv …, 2019 - arxiv.org
Existing objective evaluation metrics for voice conversion (VC) are not always correlated
with human perception. Therefore, training VC models with such criteria may not effectively …

Weighted speech distortion losses for neural-network-based real-time speech enhancement

Y **a, S Braun, CKA Reddy, H Dubey… - ICASSP 2020-2020 …, 2020 - ieeexplore.ieee.org
This paper investigates several aspects of training a RNN (recurrent neural network) that
impact the objective and subjective quality of enhanced speech for real-time single-channel …

Mmdenselstm: An efficient combination of convolutional and recurrent neural networks for audio source separation

N Takahashi, N Goswami… - 2018 16th International …, 2018 - ieeexplore.ieee.org
Deep neural networks have become an indispensable technique for audio source
separation (SS). It was recently reported that a variant of CNN architecture called MM …

FRCRN: Boosting feature representation using frequency recurrence for monaural speech enhancement

S Zhao, B Ma, KN Watcharasupat… - ICASSP 2022-2022 …, 2022 - ieeexplore.ieee.org
Convolutional recurrent networks (CRN) integrating a convolutional encoder-decoder (CED)
structure and a recurrent structure have achieved promising performance for monaural …

DPCRN: Dual-path convolution recurrent network for single channel speech enhancement

X Le, H Chen, K Chen, J Lu - arxiv preprint arxiv:2107.05429, 2021 - arxiv.org
The dual-path RNN (DPRNN) was proposed to more effectively model extremely long
sequences for speech separation in the time domain. By splitting long sequences to smaller …

Remixit: Continual self-training of speech enhancement models via bootstrapped remixing

E Tzinis, Y Adi, VK Ithapu, B Xu… - IEEE Journal of …, 2022 - ieeexplore.ieee.org
We present RemixIT, a simple yet effective self-supervised method for training speech
enhancement without the need of a single isolated in-domain speech nor a noise waveform …

Improving perceptual quality by phone-fortified perceptual loss using wasserstein distance for speech enhancement

TA Hsieh, C Yu, SW Fu, X Lu, Y Tsao - arxiv preprint arxiv:2010.15174, 2020 - arxiv.org
Speech enhancement (SE) aims to improve speech quality and intelligibility, which are both
related to a smooth transition in speech segments that may carry linguistic information, eg …