Google Tudós

Spellmapper: A non-autoregressive neural spellchecker for asr customization with candidate retrieval based on n-gram map**s

A Antonova, E Bakhturina, B Ginsburg - arxiv preprint arxiv:2306.02317, 2023 - arxiv.org

Contextual spelling correction models are an alternative to shallow fusion to improve
automatic speech recognition (ASR) quality given user vocabulary. To deal with large user …

Mentés Hivatkozás Idézetek száma: 6 Kapcsolódó cikkek Mind a(z) 6 változat HTML-változat

[Free GPT-4]

[PDF] isca-archive.org

[PDF][PDF] NeMo Forced Aligner and its application to word alignment for subtitle generation

E Rastorgueva, V Lavrukhin, B Ginsburg - Proc. INTERSPEECH, 2023 - isca-archive.org

Abstract We present NeMo Forced Aligner (NFA): an efficient and accurate forced aligner
which is part of the NeMo conversational AI open-source toolkit. NFA can produce token …

Mentés Hivatkozás Idézetek száma: 5 Kapcsolódó cikkek Mind a(z) 2 változat HTML-változat

[Free GPT-4]

[PDF] aclanthology.org

Revisiting automatic speech recognition for tamil and hindi connected number recognition

R Mishra, SRG Boopathy, M Ravikiran… - Proceedings of the …, 2023 - aclanthology.org

Abstract Automatic Speech Recognition and its applications are rising in popularity across
applications with reasonable inference results. Recent state-of-the-art approaches, often …

Mentés Hivatkozás Idézetek száma: 2 Kapcsolódó cikkek Mind a(z) 2 változat HTML-változat

[Free GPT-4]

[PDF] arxiv.org

Building and curating conversational corpora for diversity-aware language science and technology

A Liesenfeld, M Dingemanse - arxiv preprint arxiv:2203.03399, 2022 - arxiv.org

We present an analysis pipeline and best practice guidelines for building and curating
corpora of everyday conversation in diverse languages. Surveying language documentation …

Mentés Hivatkozás Idézetek száma: 7 Kapcsolódó cikkek Mind a(z) 9 változat HTML-változat

Everyday conversations: a comparative study of expert transcriptions and ASR outputs at a lexical level

T Sherstinova, R Kolobov, N Mikhaylovskiy - International Conference on …, 2023 - Springer

The study examines the outcomes of automatic speech recognition (ASR) applied to field
recordings of daily Russian speech. Everyday conversations, captured in real-life …

Mentés Hivatkozás Idézetek száma: 5 Kapcsolódó cikkek Mind a(z) 2 változat

Automatic Time Alignment Generation For End-to-End ASR Using Acoustic Probability Modelling

D Jiang, C Zhang, PC Woodland - 2024 IEEE Spoken …, 2024 - ieeexplore.ieee.org

End-to-end trainable (E2E) automatic speech recognition (ASR) models can achieve low
error rates, but unlike hidden Markov model (HMM)-based systems they cannot naturally …

Mentés Hivatkozás Kapcsolódó cikkek

[Free GPT-4]

[HTML] mdpi.com

[HTML][HTML] Methodology for Obtaining High-Quality Speech Corpora

A Wieczorkowska - Applied Sciences, 2025 - mdpi.com

Speech-based communication between users and machines is a very lively branch of
research that covers speech recognition, synthesis, and, generally, natural language …

Mentés Hivatkozás Kapcsolódó cikkek Tárolt változat

[Free GPT-4]

[PDF] arxiv.org

An analysis of large speech models-based representations for speech emotion recognition

AB Stânea, V Strilețchi, C Strilețchi… - … Conference on Speech …, 2023 - ieeexplore.ieee.org

Large speech models-derived features have recently shown increased performance over
signal-based features across multiple downstream tasks, even when the networks are not …

Mentés Hivatkozás Idézetek száma: 1 Kapcsolódó cikkek Mind a(z) 2 változat

[Free GPT-4]

[PDF] eurasip.org

Validation of Speech Data for Training Automatic Speech Recognition Systems

J Krizaj, JZ Gros, S Dobrisek - 2022 30th European Signal …, 2022 - ieeexplore.ieee.org

Recent automatic speech recognition systems are largely based on deep neural networks
that need large amounts of labelled speech data to train. This can be a problem, especially …

Mentés Hivatkozás Idézetek száma: 1 Kapcsolódó cikkek Mind a(z) 3 változat

Értesítés létrehozása

Hivatkozás

Speciális keresés

Mentve a Saját könyvtárba

A toolbox for construction and analysis of speech datasets

Spellmapper: A non-autoregressive neural spellchecker for asr customization with candidate retrieval based on n-gram map**s

[PDF][PDF] NeMo Forced Aligner and its application to word alignment for subtitle generation

Revisiting automatic speech recognition for tamil and hindi connected number recognition

Building and curating conversational corpora for diversity-aware language science and technology

Everyday conversations: a comparative study of expert transcriptions and ASR outputs at a lexical level

Automatic Time Alignment Generation For End-to-End ASR Using Acoustic Probability Modelling

[HTML][HTML] Methodology for Obtaining High-Quality Speech Corpora

An analysis of large speech models-based representations for speech emotion recognition

Validation of Speech Data for Training Automatic Speech Recognition Systems