- Academic Search

A Mehrish, N Majumder, R Bharadwaj, R Mihalcea… - Information …, 2023 - Elsevier

The field of speech processing has undergone a transformative shift with the advent of deep
learning. The use of multiple processing layers has enabled the creation of models capable …

Spara Citera Citerat av 240 Relaterade artiklar Alla 7 versionerna

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

A review of speaker diarization: Recent advances with deep learning

TJ Park, N Kanda, D Dimitriadis, KJ Han… - Computer Speech & …, 2022 - Elsevier

Speaker diarization is a task to label audio or video recordings with classes that correspond
to speaker identity, or in short, a task to identify “who spoke when”. In the early years …

Spara Citera Citerat av 426 Relaterade artiklar Alla 7 versionerna

[Free GPT-4]
[DeepSeek]

[HTML] aip.org

[HTML][HTML] A survey of sound source localization with deep learning methods

PA Grumiaux, S Kitić, L Girin, A Guérin - The Journal of the Acoustical …, 2022 - pubs.aip.org

This article is a survey of deep learning methods for single and multiple sound source
localization, with a focus on sound source localization in indoor environments, where …

Spara Citera Citerat av 289 Relaterade artiklar Alla 11 versionerna

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

CHiME-6 challenge: Tackling multispeaker speech recognition for unsegmented recordings

S Watanabe, M Mandel, J Barker, E Vincent… - ar** for single-and multi-channel speech enhancement and robust ASR

ZQ Wang, P Wang, DL Wang - IEEE/ACM transactions on …, 2020 - ieeexplore.ieee.org

This study proposes a complex spectral map** approach for single-and multi-channel
speech enhancement, where deep neural networks (DNNs) are used to predict the real and …

Spara Citera Citerat av 222 Relaterade artiklar Alla 12 versionerna

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Asteroid: the PyTorch-based audio source separation toolkit for researchers

M Pariente, S Cornell, J Cosentino… - arxiv preprint arxiv …, 2020 - arxiv.org

This paper describes Asteroid, the PyTorch-based audio source separation toolkit for
researchers. Inspired by the most successful neural source separation systems, it provides …

Spara Citera Citerat av 172 Relaterade artiklar Alla 9 versionerna Se som HTML-version

Skapa alarm

Citera

Avancerad sökning

Har sparats i Mitt bibliotek

Audio source separation and speech enhancement

A review of deep learning techniques for speech processing

A review of speaker diarization: Recent advances with deep learning

[HTML][HTML] A survey of sound source localization with deep learning methods

CHiME-6 challenge: Tackling multispeaker speech recognition for unsegmented recordings

Asteroid: the PyTorch-based audio source separation toolkit for researchers