- Academic Search

Y Higuchi, K Karube, T Ogawa… - ICASSP 2022-2022 …, 2022 - ieeexplore.ieee.org

In end-to-end automatic speech recognition (ASR), a model is expected to implicitly learn
representations suitable for recognizing a word-level sequence. However, the huge …

Save Cite Cited by 33 Related articles All 7 versions Free GPT-4

[Free GPT-4]

[PDF] github.io

Disordered speech recognition considering low resources and abnormal articulation

Y Lin, J Dang, L Wang, S Li, C Ding - Speech Communication, 2023 - Elsevier

The success of automatic speech recognition (ASR) benefits a great number of healthy
people, but not people with disorders. The speech disordered may truly need support from …

Save Cite Cited by 5 Related articles All 3 versions Free GPT-4

[Free GPT-4]

[PDF] github.io

PhISANet: Phonetically Informed Speech Animation Network

S Medina, SL Taylor, C Stoll, G Edwards… - ICASSP 2024-2024 …, 2024 - ieeexplore.ieee.org

Realistic animation is crucial for immersive and seamless human-avatar interactions as
digital avatars become more prevalent. This work presents PhISANet, an encoder-decoder …

[Free GPT-4]

[PDF] arxiv.org

Phonetic-assisted Multi-Target Units Modeling for Improving Conformer-Transducer ASR system

L Li, D Xu, H Wei, Y Long - ar** realistic facial animations of a person from a speech signal …

[Free GPT-4]

[PDF] hal.science

Reconnaissance automatique de la parole d'enfants apprenant· e· s lecteur· ice· s en salle de classe: modélisation acoustique de phonèmes

L Gelin - 2022 - theses.hal.science

À travers ces travaux de thèse, nous cherchons à perfectionner les transcriptions
phonétiques de lectures orales d'enfants apprenant· e· s lecteur· rice· s réalisées en …

Save Cite Cited by 2 Related articles All 9 versions Free GPT-4 View as HTML

Create alert

Cite

Advanced search

Saved to My library

Char+ CV-CTC: combining graphemes and consonant/vowel units for CTC-based ASR using Multitask...

Hierarchical conditional end-to-end asr with ctc and multi-granular subword units

Disordered speech recognition considering low resources and abnormal articulation

PhISANet: Phonetically Informed Speech Animation Network

Phonetic-assisted Multi-Target Units Modeling for Improving Conformer-Transducer ASR system

Reconnaissance automatique de la parole d'enfants apprenant· e· s lecteur· ice· s en salle de classe: modélisation acoustique de phonèmes