- Academic Search

Deep neural network techniques for monaural speech enhancement and separation: state of the art analysis

P Ochieng - Artificial Intelligence Review, 2023 - Springer

Deep neural networks (DNN) techniques have become pervasive in domains such as
natural language processing and computer vision. They have achieved great success in …

Tallenna Viittaa Viittausten määrä 27 Aiheeseen liittyviä artikkeleita Kaikki 10 versiota

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Dual-path rnn: efficient long sequence modeling for time-domain single-channel speech separation

Y Luo, Z Chen, T Yoshioka - ICASSP 2020-2020 IEEE …, 2020 - ieeexplore.ieee.org

Recent studies in deep learning-based speech separation have proven the superiority of
time-domain approaches to conventional time-frequency-based methods. Unlike the time …

Tallenna Viittaa Viittausten määrä 896 Aiheeseen liittyviä artikkeleita Kaikki 6 versiota

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Librimix: An open-source dataset for generalizable speech separation

J Cosentino, M Pariente, S Cornell, A Deleforge… - arxiv preprint arxiv …, 2020 - arxiv.org

In recent years, wsj0-2mix has become the reference dataset for single-channel speech
separation. Most deep learning-based speech separation models today are benchmarked …

Tallenna Viittaa Viittausten määrä 320 Aiheeseen liittyviä artikkeleita Kaikki 5 versiota HTML-versio

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Continuous speech separation: Dataset and analysis

Z Chen, T Yoshioka, L Lu, T Zhou… - ICASSP 2020-2020 …, 2020 - ieeexplore.ieee.org

This paper describes a dataset and protocols for evaluating continuous speech separation
algorithms. Most prior speech separation studies use pre-segmented audio signals, which …

Tallenna Viittaa Viittausten määrä 241 Aiheeseen liittyviä artikkeleita Kaikki 3 versiota

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Asteroid: the PyTorch-based audio source separation toolkit for researchers

M Pariente, S Cornell, J Cosentino… - arxiv preprint arxiv …, 2020 - arxiv.org

This paper describes Asteroid, the PyTorch-based audio source separation toolkit for
researchers. Inspired by the most successful neural source separation systems, it provides …

Tallenna Viittaa Viittausten määrä 172 Aiheeseen liittyviä artikkeleita Kaikki 9 versiota HTML-versio

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

On loss functions for supervised monaural time-domain speech enhancement

M Kolbæk, ZH Tan, SH Jensen… - IEEE/ACM Transactions …, 2020 - ieeexplore.ieee.org

Many deep learning-based speech enhancement algorithms are designed to minimize the
mean-square error (MSE) in some transform domain between a predicted and a target …

Tallenna Viittaa Viittausten määrä 160 Aiheeseen liittyviä artikkeleita Kaikki 8 versiota

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Improving speaker discrimination of target speech extraction with time-domain speakerbeam

M Delcroix, T Ochiai, K Zmolikova… - ICASSP 2020-2020 …, 2020 - ieeexplore.ieee.org

Target speech extraction, which extracts a single target source in a mixture given clues
about the target speaker, has attracted increasing attention. We have recently proposed …

Tallenna Viittaa Viittausten määrä 135 Aiheeseen liittyviä artikkeleita Kaikki 6 versiota

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Multi-modal multi-channel target speech separation

R Gu, SX Zhang, Y Xu, L Chen… - IEEE Journal of …, 2020 - ieeexplore.ieee.org

Target speech separation refers to extracting a target speaker's voice from an overlapped
audio of simultaneous talkers. Previously the use of visual modality for target speech …

Tallenna Viittaa Viittausten määrä 117 Aiheeseen liittyviä artikkeleita Kaikki 6 versiota

[Free GPT-4]
[DeepSeek]

[PDF] iop.org

[PDF][PDF] The Intel neuromorphic DNS challenge

J Timcheck, SB Shrestha, DBD Rubin… - Neuromorphic …, 2023 - iopscience.iop.org

A critical enabler for progress in neuromorphic computing research is the ability to
transparently evaluate different neuromorphic solutions on important tasks and to compare …

Tallenna Viittaa Viittausten määrä 32 Aiheeseen liittyviä artikkeleita Kaikki 5 versiota

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

A consolidated view of loss functions for supervised deep learning-based speech enhancement

S Braun, I Tashev - 2021 44th International Conference on …, 2021 - ieeexplore.ieee.org

Deep learning-based speech enhancement for real-time applications recently made large
advancements. Due to the lack of a tractable perceptual optimization target, many myths …

Tallenna Viittaa Viittausten määrä 95 Aiheeseen liittyviä artikkeleita Kaikki 5 versiota

Luo ilmoitus

Viittaa

Tarkennettu haku

Tallennettu omaan kirjastoon

A comprehensive study of speech separation: spectrogram vs waveform separation

Deep neural network techniques for monaural speech enhancement and separation: state of the art analysis

Dual-path rnn: efficient long sequence modeling for time-domain single-channel speech separation

Librimix: An open-source dataset for generalizable speech separation

Continuous speech separation: Dataset and analysis

Asteroid: the PyTorch-based audio source separation toolkit for researchers

On loss functions for supervised monaural time-domain speech enhancement

Improving speaker discrimination of target speech extraction with time-domain speakerbeam

Multi-modal multi-channel target speech separation

[PDF][PDF] The Intel neuromorphic DNS challenge

A consolidated view of loss functions for supervised deep learning-based speech enhancement