- Academic Search

J Li - APSIPA Transactions on Signal and Information …, 2022 - nowpublishers.com

Recently, the speech community is seeing a significant trend of moving from deep neural
network based hybrid modeling to end-to-end (E2E) modeling for automatic speech …

Save Cite Cited by 440 Related articles All 7 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] ieee.org

Graph neural networks for contextual ASR with the tree-constrained pointer generator

G Sun, C Zhang, PC Woodland - IEEE/ACM Transactions on …, 2024 - ieeexplore.ieee.org

Incorporating biasing words obtained through contextual knowledge is paramount in
automatic speech recognition (ASR) applications. This paper proposes an innovative …

Save Cite Cited by 4 Related articles All 3 versions Free GPT-4

[Free GPT-4]

[HTML] amazon.science

[HTML][HTML] Selective biasing with trie-based contextual adapters for personalised speech recognition using neural transducers

P Harding, S Tong, S Wiesler - 2023 - amazon.science

Neural transducer ASR models achieve state of the art accuracy on many tasks, however
rare word recognition poses a particular challenge as models often fail to recognise words …

Save Cite Cited by 4 Related articles All 4 versions Free GPT-4 Cached

Automatic Speech Recognition Design Modeling

K Babu Rao, B Mopuru, M Jawarneh… - Conversational …, 2024 - Wiley Online Library

The term “automatic speech recognition” refers to the procedure by which an auditory signal
of spoken words can be converted into text. Voice recognition is another term that may be …

Save Cite Cited by 4 Related articles

[Free GPT-4]

[PDF] arxiv.org

Incorporating Class-based Language Model for Named Entity Recognition in Factorized Neural Transducer

P Wang, Y Yang, Z Liang, T Tan, S Zhang… - arxiv preprint arxiv …, 2023 - arxiv.org

In spite of the excellent strides made by end-to-end (E2E) models in speech recognition in
recent years, named entity recognition is still challenging but critical for semantic …

[Free GPT-4]

[PDF] arxiv.org

Integrating Language Models into Direct Speech Translation: An Inference-Time Solution to Control Gender Inflection

D Fucci, M Gaido, S Papi, M Cettolo, M Negri… - arxiv preprint arxiv …, 2023 - arxiv.org

When translating words referring to the speaker, speech translation (ST) systems should not
resort to default masculine generics nor rely on potentially misleading vocal traits. Rather …

Save Cite Cited by 1 Related articles All 8 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] mdpi.com

Entropy-Based Dynamic Rescoring with Language Model in E2E ASR Systems

Z Gong, D Saito, N Minematsu - Applied Sciences, 2022 - mdpi.com

Language models (LM) have played crucial roles in automatic speech recognition (ASR),
whether as an essential part of a conventional ASR system composed of an acoustic model …

[Free GPT-4]

[PDF] aclanthology.org

DOC-RAG: ASR Language Model Personalization with Domain-Distributed Co-occurrence Retrieval Augmentation

P Mathur, Z Liu, K Li, Y Ma, G Karen… - Proceedings of the …, 2024 - aclanthology.org

Abstract We propose DOC-RAG-Domain-distributed Co-occurrence Retrieval Augmentation
for ASR language model personalization aiming to improve the automatic speech …

Save Cite Related articles View as HTML

Create alert

Cite

Advanced search

Saved to My library

Contextual density ratio for language model biasing of sequence to sequence ASR systems

[PDF][PDF] Recent advances in end-to-end automatic speech recognition

Graph neural networks for contextual ASR with the tree-constrained pointer generator

[HTML][HTML] Selective biasing with trie-based contextual adapters for personalised speech recognition using neural transducers

Automatic Speech Recognition Design Modeling

Incorporating Class-based Language Model for Named Entity Recognition in Factorized Neural Transducer

Integrating Language Models into Direct Speech Translation: An Inference-Time Solution to Control Gender Inflection

Entropy-Based Dynamic Rescoring with Language Model in E2E ASR Systems

DOC-RAG: ASR Language Model Personalization with Domain-Distributed Co-occurrence Retrieval Augmentation