Turnitin
降AI改写
早检测系统
早降重系统
Turnitin-UK版
万方检测-期刊版
维普编辑部版
Grammarly检测
Paperpass检测
checkpass检测
PaperYY检测
[PDF][PDF] Recent advances in end-to-end automatic speech recognition
J Li - APSIPA Transactions on Signal and Information …, 2022 - nowpublishers.com
Recently, the speech community is seeing a significant trend of moving from deep neural
network based hybrid modeling to end-to-end (E2E) modeling for automatic speech …
network based hybrid modeling to end-to-end (E2E) modeling for automatic speech …
End-to-end speech recognition: A survey
In the last decade of automatic speech recognition (ASR) research, the introduction of deep
learning has brought considerable reductions in word error rate of more than 50% relative …
learning has brought considerable reductions in word error rate of more than 50% relative …
RWTH ASR Systems for LibriSpeech: Hybrid vs Attention--w/o Data Augmentation
We present state-of-the-art automatic speech recognition (ASR) systems employing a
standard hybrid DNN/HMM architecture compared to an attention-based encoder-decoder …
standard hybrid DNN/HMM architecture compared to an attention-based encoder-decoder …
Are multidimensional recurrent layers really necessary for handwritten text recognition?
J Puigcerver - 2017 14th IAPR international conference on …, 2017 - ieeexplore.ieee.org
Current state-of-the-art approaches to offline Handwritten Text Recognition extensively rely
on Multidimensional Long Short-Term Memory networks. However, these architectures …
on Multidimensional Long Short-Term Memory networks. However, these architectures …
Improved training of end-to-end attention models for speech recognition
Sequence-to-sequence attention-based models on subword units allow simple open-
vocabulary end-to-end speech recognition. In this work, we show that such models can …
vocabulary end-to-end speech recognition. In this work, we show that such models can …
Handwriting recognition with large multidimensional long short-term memory recurrent neural networks
Multidimensional long short-term memory recurrent neural networks achieve impressive
results for handwriting recognition. However, with current CPU-based implementations, their …
results for handwriting recognition. However, with current CPU-based implementations, their …
A comprehensive study of deep bidirectional LSTM RNNs for acoustic modeling in speech recognition
Recent experiments show that deep bidirectional long short-term memory (BLSTM) recurrent
neural network acoustic models outperform feedforward neural networks for automatic …
neural network acoustic models outperform feedforward neural networks for automatic …
Generating synthetic audio data for attention-based speech recognition systems
Recent advances in text-to-speech (TTS) led to the development of flexible multi-speaker
end-to-end TTS systems. We extend state-of-the-art attention-based automatic speech …
end-to-end TTS systems. We extend state-of-the-art attention-based automatic speech …
Attention based on-device streaming speech recognition with large speech corpus
In this paper, we present a new on-device automatic speech recognition (ASR) system
based on monotonic chunk-wise attention (MoChA) models trained with large (> 10K hours) …
based on monotonic chunk-wise attention (MoChA) models trained with large (> 10K hours) …
On using specaugment for end-to-end speech translation
This work investigates a simple data augmentation technique, SpecAugment, for end-to-end
speech translation. SpecAugment is a low-cost implementation method applied directly to …
speech translation. SpecAugment is a low-cost implementation method applied directly to …