- Academic Search

[PDF][PDF] Recent advances in end-to-end automatic speech recognition

J Li - APSIPA Transactions on Signal and Information …, 2022 - nowpublishers.com

Recently, the speech community is seeing a significant trend of moving from deep neural
network based hybrid modeling to end-to-end (E2E) modeling for automatic speech …

Speichern Zitieren Zitiert von: 440 Ähnliche Artikel Alle 7 Versionen HTML-Version

[Free GPT-4]

[PDF] ieee.org

Deep spoken keyword spotting: An overview

I López-Espejo, ZH Tan, JHL Hansen, J Jensen - IEEE Access, 2021 - ieeexplore.ieee.org

Spoken keyword spotting (KWS) deals with the identification of keywords in audio streams
and has become a fast-growing technology thanks to the paradigm shift introduced by deep …

Speichern Zitieren Zitiert von: 141 Ähnliche Artikel Alle 7 Versionen

[Free GPT-4]

[PDF] arxiv.org

Deep learning for audio signal processing

H Purwins, B Li, T Virtanen, J Schlüter… - IEEE Journal of …, 2019 - ieeexplore.ieee.org

Given the recent surge in developments of deep learning, this paper provides a review of the
state-of-the-art deep learning techniques for audio signal processing. Speech, music, and …

Speichern Zitieren Zitiert von: 922 Ähnliche Artikel Alle 7 Versionen

[Free GPT-4]

[PDF] arxiv.org

Deep learning for environmentally robust speech recognition: An overview of recent developments

Z Zhang, J Geiger, J Pohjalainen, AED Mousa… - ACM Transactions on …, 2018 - dl.acm.org

Eliminating the negative effect of non-stationary environmental noise is a long-standing
research topic for automatic speech recognition but still remains an important challenge …

Speichern Zitieren Zitiert von: 428 Ähnliche Artikel Alle 10 Versionen

[Free GPT-4]

[PDF] arxiv.org

The pytorch-kaldi speech recognition toolkit

M Ravanelli, T Parcollet… - ICASSP 2019-2019 IEEE …, 2019 - ieeexplore.ieee.org

The availability of open-source software is playing a remarkable role in the popularization of
speech recognition and deep learning. Kaldi, for instance, is nowadays an established …

Speichern Zitieren Zitiert von: 301 Ähnliche Artikel Alle 9 Versionen

Multichannel signal processing with deep neural networks for automatic speech recognition

TN Sainath, RJ Weiss, KW Wilson, B Li… - … on Audio, Speech …, 2017 - ieeexplore.ieee.org

Multichannel automatic speech recognition (ASR) systems commonly separate speech
enhancement, including localization, beamforming, and postfiltering, from acoustic …

Speichern Zitieren Zitiert von: 286 Ähnliche Artikel Alle 5 Versionen

[Free GPT-4]

[PDF] arxiv.org

Far-field automatic speech recognition

R Haeb-Umbach, J Heymann, L Drude… - Proceedings of the …, 2020 - ieeexplore.ieee.org

The machine recognition of speech spoken at a distance from the microphones, known as
far-field automatic speech recognition (ASR), has received a significant increase in attention …

Speichern Zitieren Zitiert von: 121 Ähnliche Artikel Alle 8 Versionen

[Free GPT-4]

[PDF] uni-paderborn.de

Speech processing for digital home assistants: Combining signal processing with deep-learning techniques

R Haeb-Umbach, S Watanabe… - IEEE Signal …, 2019 - ieeexplore.ieee.org

Once a popular theme of futuristic science fiction or far-fetched technology forecasts, digital
home assistants with a spoken language interface have become a ubiquitous commodity …

Speichern Zitieren Zitiert von: 199 Ähnliche Artikel Alle 9 Versionen

[Free GPT-4]

[PDF] arxiv.org

FaSNet: Low-latency adaptive beamforming for multi-microphone audio processing

Y Luo, C Han, N Mesgarani, E Ceolini… - 2019 IEEE automatic …, 2019 - ieeexplore.ieee.org

Beamforming has been extensively investigated for multi-channel audio processing tasks.
Recently, learning-based beamforming methods, sometimes called neural beamformers …

Speichern Zitieren Zitiert von: 166 Ähnliche Artikel Alle 6 Versionen

[Free GPT-4]

[PDF] vut.cz

Single channel target speaker extraction and recognition with speaker beam

M Delcroix, K Zmolikova, K Kinoshita… - … on acoustics, speech …, 2018 - ieeexplore.ieee.org

This paper addresses the problem of single channel speech recognition of a target speaker
in a mixture of speech signals. We propose to exploit auxiliary speaker information provided …

Speichern Zitieren Zitiert von: 227 Ähnliche Artikel Alle 5 Versionen

Zitieren

Erweiterte Suche

In „Meine Bibliothek“ gespeichert

[PDF][PDF] Recent advances in end-to-end automatic speech recognition

Deep spoken keyword spotting: An overview

Deep learning for audio signal processing

Deep learning for environmentally robust speech recognition: An overview of recent developments

The pytorch-kaldi speech recognition toolkit

Multichannel signal processing with deep neural networks for automatic speech recognition

Far-field automatic speech recognition

Speech processing for digital home assistants: Combining signal processing with deep-learning techniques

FaSNet: Low-latency adaptive beamforming for multi-microphone audio processing

Single channel target speaker extraction and recognition with speaker beam