- Academic Search

R Prabhavalkar, T Hori, TN Sainath… - … on Audio, Speech …, 2023 - ieeexplore.ieee.org

In the last decade of automatic speech recognition (ASR) research, the introduction of deep
learning has brought considerable reductions in word error rate of more than 50% relative …

Spara Citera Citerat av 180 Relaterade artiklar Alla 6 versionerna Bibliotekssökning

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Paraformer: Fast and accurate parallel transformer for non-autoregressive end-to-end speech recognition

Z Gao, S Zhang, I McLoughlin, Z Yan - arxiv preprint arxiv:2206.08317, 2022 - arxiv.org

Transformers have recently dominated the ASR field. Although able to yield good
performance, they involve an autoregressive (AR) decoder to generate tokens one by one …

Spara Citera Citerat av 102 Relaterade artiklar Alla 9 versionerna Se som HTML-version

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

A survey on non-autoregressive generation for neural machine translation and beyond

Y **ao, L Wu, J Guo, J Li, M Zhang… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org

Non-autoregressive (NAR) generation, which is first proposed in neural machine translation
(NMT) to speed up inference, has attracted much attention in both machine learning and …

Spara Citera Citerat av 93 Relaterade artiklar Alla 9 versionerna

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Bectra: Transducer-based end-to-end asr with bert-enhanced encoder

Y Higuchi, T Ogawa, T Kobayashi… - ICASSP 2023-2023 …, 2023 - ieeexplore.ieee.org

We present BERT-CTC-Transducer (BECTRA), a novel end-to-end automatic speech
recognition (E2E-ASR) model formulated by the transducer with a BERT-enhanced encoder …

Spara Citera Citerat av 15 Relaterade artiklar Alla 6 versionerna

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Deliberation of streaming rnn-transducer by non-autoregressive decoding

W Wang, K Hu, TN Sainath - ICASSP 2022-2022 IEEE …, 2022 - ieeexplore.ieee.org

We propose to deliberate the hypothesis alignment of a streaming RNN-T model with the
previously proposed Align-Refine non-autoregressive decoding method and its improved …

Spara Citera Citerat av 20 Relaterade artiklar Alla 4 versionerna

[Free GPT-4]
[DeepSeek]

[PDF] isca-archive.org

[PDF][PDF] Text-Only Domain Adaptation Based on Intermediate CTC.

H Sato, T Komori, T Mishima, Y Kawai, T Mochizuki… - Interspeech, 2022 - isca-archive.org

We propose a domain adaptation method that enables connectionist temporal classification
(CTC)-based end-to-end (E2E) automatic speech recognition (ASR) models to adapt to a …

Spara Citera Citerat av 12 Relaterade artiklar Alla 6 versionerna Se som HTML-version

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Semi-autoregressive streaming asr with label context

S Arora, G Saon, S Watanabe… - ICASSP 2024-2024 …, 2024 - ieeexplore.ieee.org

Non-autoregressive (NAR) modeling has gained significant interest in speech processing
since these models achieve dramatically lower inference time than autoregressive (AR) …

Spara Citera Citerat av 3 Relaterade artiklar Alla 4 versionerna

[Free GPT-4]
[DeepSeek]

[PDF] academia.edu

Non-autoregressive end-to-end automatic speech recognition incorporating downstream natural language processing

M Omachi, Y Fujita, S Watanabe… - ICASSP 2022-2022 …, 2022 - ieeexplore.ieee.org

We propose a fast and accurate end-to-end (E2E) model, which executes automatic speech
recognition (ASR) and downstream natural language processing (NLP) simultaneously. The …

Spara Citera Citerat av 8 Relaterade artiklar Alla 4 versionerna

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Improving Streaming End-to-End ASR on Transformer-Based Causal Models With Encoder States Revision Strategies

Z Li, H Miao, K Deng, G Cheng, S Tian, T Li… - arxiv preprint arxiv …, 2022 - arxiv.org

There is often a trade-off between performance and latency in streaming automatic speech
recognition (ASR). Traditional methods such as look-ahead and chunk-based methods …

Spara Citera Citerat av 6 Relaterade artiklar Alla 4 versionerna Se som HTML-version

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

LV-CTC: Non-autoregressive ASR with CTC and latent variable models

Y Fujita, S Watanabe, X Chang… - 2023 IEEE Automatic …, 2023 - ieeexplore.ieee.org

Non-autoregressive (NAR) models for automatic speech recognition (ASR) aim to achieve
high accuracy and fast inference by simplifying the autoregressive (AR) generation process …

Spara Citera Citerat av 1 Relaterade artiklar Alla 5 versionerna

Skapa alarm

Citera

Avancerad sökning

Har sparats i Mitt bibliotek

Streaming end-to-end ASR based on blockwise non-autoregressive models

End-to-end speech recognition: A survey

Paraformer: Fast and accurate parallel transformer for non-autoregressive end-to-end speech recognition

A survey on non-autoregressive generation for neural machine translation and beyond

Bectra: Transducer-based end-to-end asr with bert-enhanced encoder

Deliberation of streaming rnn-transducer by non-autoregressive decoding

[PDF][PDF] Text-Only Domain Adaptation Based on Intermediate CTC.

Semi-autoregressive streaming asr with label context

Non-autoregressive end-to-end automatic speech recognition incorporating downstream natural language processing

Improving Streaming End-to-End ASR on Transformer-Based Causal Models With Encoder States Revision Strategies

LV-CTC: Non-autoregressive ASR with CTC and latent variable models