- Academic Search

J Li - APSIPA Transactions on Signal and Information …, 2022 - nowpublishers.com

Recently, the speech community is seeing a significant trend of moving from deep neural
network based hybrid modeling to end-to-end (E2E) modeling for automatic speech …

Speichern Zitieren Zitiert von: 438 Ähnliche Artikel Alle 7 Versionen HTML-Version

[Free GPT-4]

[PDF] dtu.dk

Self-supervised speech representation learning: A review

A Mohamed, H Lee, L Borgholt… - IEEE Journal of …, 2022 - ieeexplore.ieee.org

Although supervised deep learning has revolutionized speech and audio processing, it has
necessitated the building of specialist models for individual tasks and application scenarios …

Speichern Zitieren Zitiert von: 401 Ähnliche Artikel Alle 10 Versionen

[Free GPT-4]

[PDF] arxiv.org

Neural machine translation for low-resource languages: A survey

S Ranathunga, ESA Lee, M Prifti Skenduli… - ACM Computing …, 2023 - dl.acm.org

Neural Machine Translation (NMT) has seen tremendous growth in the last ten years since
the early 2000s and has already entered a mature phase. While considered the most widely …

Speichern Zitieren Zitiert von: 276 Ähnliche Artikel Alle 6 Versionen

[Free GPT-4]

[PDF] ieee.org

End-to-end speech recognition: A survey

R Prabhavalkar, T Hori, TN Sainath… - … on Audio, Speech …, 2023 - ieeexplore.ieee.org

In the last decade of automatic speech recognition (ASR) research, the introduction of deep
learning has brought considerable reductions in word error rate of more than 50% relative …

Speichern Zitieren Zitiert von: 174 Ähnliche Artikel Alle 6 Versionen

[Free GPT-4]

[PDF] arxiv.org

Speecht5: Unified-modal encoder-decoder pre-training for spoken language processing

J Ao, R Wang, L Zhou, C Wang, S Ren, Y Wu… - arxiv preprint arxiv …, 2021 - arxiv.org

Motivated by the success of T5 (Text-To-Text Transfer Transformer) in pre-trained natural
language processing models, we propose a unified-modal SpeechT5 framework that …

Speichern Zitieren Zitiert von: 249 Ähnliche Artikel Alle 6 Versionen HTML-Version

[Free GPT-4]

[PDF] arxiv.org

Pushing the limits of semi-supervised learning for automatic speech recognition

Y Zhang, J Qin, DS Park, W Han, CC Chiu… - arxiv preprint arxiv …, 2020 - arxiv.org

We employ a combination of recent developments in semi-supervised learning for automatic
speech recognition to obtain state-of-the-art results on LibriSpeech utilizing the unlabeled …

Speichern Zitieren Zitiert von: 384 Ähnliche Artikel Alle 2 Versionen HTML-Version

[Free GPT-4]

[PDF] arxiv.org

Bigssl: Exploring the frontier of large-scale semi-supervised learning for automatic speech recognition

Y Zhang, DS Park, W Han, J Qin… - IEEE Journal of …, 2022 - ieeexplore.ieee.org

We summarize the results of a host of efforts using giant automatic speech recognition (ASR)
models pre-trained using large, diverse unlabeled datasets containing approximately a …

Speichern Zitieren Zitiert von: 201 Ähnliche Artikel Alle 4 Versionen

[Free GPT-4]

[PDF] arxiv.org

Specaugment: A simple data augmentation method for automatic speech recognition

DS Park, W Chan, Y Zhang, CC Chiu, B Zoph… - arxiv preprint arxiv …, 2019 - arxiv.org

We present SpecAugment, a simple data augmentation method for speech recognition.
SpecAugment is applied directly to the feature inputs of a neural network (ie, filter bank …

Speichern Zitieren Zitiert von: 4410 Ähnliche Artikel Alle 8 Versionen HTML-Version

[Free GPT-4]

[PDF] arxiv.org

Masked language model scoring

J Salazar, D Liang, TQ Nguyen, K Kirchhoff - arxiv preprint arxiv …, 2019 - arxiv.org

Pretrained masked language models (MLMs) require finetuning for most NLP tasks. Instead,
we evaluate MLMs out of the box via their pseudo-log-likelihood scores (PLLs), which are …

Speichern Zitieren Zitiert von: 580 Ähnliche Artikel Alle 7 Versionen HTML-Version

[Free GPT-4]

[PDF] arxiv.org

Hierarchical neural story generation

A Fan, M Lewis, Y Dauphin - arxiv preprint arxiv:1805.04833, 2018 - arxiv.org

We explore story generation: creative systems that can build coherent and fluent passages
of text about a topic. We collect a large dataset of 300K human-written stories paired with …

Speichern Zitieren Zitiert von: 1810 Ähnliche Artikel Alle 6 Versionen HTML-Version

Alert erstellen

Zitieren

Erweiterte Suche

In „Meine Bibliothek“ gespeichert

On using monolingual corpora in neural machine translation

[PDF][PDF] Recent advances in end-to-end automatic speech recognition

Self-supervised speech representation learning: A review

Neural machine translation for low-resource languages: A survey

End-to-end speech recognition: A survey

Speecht5: Unified-modal encoder-decoder pre-training for spoken language processing

Pushing the limits of semi-supervised learning for automatic speech recognition

Bigssl: Exploring the frontier of large-scale semi-supervised learning for automatic speech recognition

Specaugment: A simple data augmentation method for automatic speech recognition

Masked language model scoring

Hierarchical neural story generation