Automatic speech recognition method based on deep learning approaches for Uzbek language

A Mukhamadiyev, I Khujayarov, O Djuraev, J Cho - Sensors, 2022 - mdpi.com
Communication has been an important aspect of human life, civilization, and globalization
for thousands of years. Biometric analysis, education, security, healthcare, and smart cities …

A study of transformer-based end-to-end speech recognition system for Kazakh language

M Orken, O Dina, A Keylan, T Tolganay, O Mohamed - Scientific reports, 2022 - nature.com
Today, the Transformer model, which allows parallelization and also has its own internal
attention, has been widely used in the field of speech recognition. The great advantage of …

Hybrid end-to-end model for Kazakh speech recognition

OZ Mamyrbayev, DO Oralbekova, K Alimhan… - International Journal of …, 2023 - Springer
Modern automatic speech recognition systems based on end-to-end (E2E) models show
good results in terms of the accuracy of language recognition, which have large corpuses for …

Identifying the influence of transfer learning method in develo** an end-to-end automatic speech recognition system with a low data level

M Orken, A Keylan, O Dina… - … -European Journal of …, 2022 - papers.ssrn.com
Ensuring the best quality and performance of modern speech technologies, today, is
possible based on the widespread use of machine learning methods. The idea of this project …

Efficient conformer for agglutinative language ASR model using low-rank approximation and balanced softmax

T Guo, N Yolwas, W Slamu - Applied Sciences, 2023 - mdpi.com
Recently, the performance of end-to-end speech recognition has been further improved
based on the proposed Conformer framework, which has also been widely used in the field …

[HTML][HTML] End-to-end neural automatic speech recognition system for low resource languages

S Dhahbi, N Saleem, S Bourouis, M Berrima… - Egyptian Informatics …, 2025 - Elsevier
The rising popularity of end-to-end (E2E) automatic speech recognition (ASR) systems can
be attributed to their ability to learn complex speech patterns directly from raw data …

CAST: Context-association architecture with simulated long-utterance training for mandarin speech recognition

Y Ming, B Lyu, Z Li - Speech Communication, 2023 - Elsevier
Abstract End-to-end (E2E) models are widely used because they significantly improve the
performance of automatic speech recognition (ASR). However, based on the limitations of …

Continuous Sign Language Recognition and Its Translation into Intonation-Colored Speech

N Amangeldy, A Ukenova, G Bekmanova… - Sensors, 2023 - mdpi.com
This article is devoted to solving the problem of converting sign language into a consistent
text with intonation markup for subsequent voice synthesis of sign phrases by speech with …

The Development of a Kazakh Speech Recognition Model Using a Convolutional Neural Network with Fixed Character Level Filters

N Kadyrbek, M Mansurova, A Shomanov… - Big Data and Cognitive …, 2023 - mdpi.com
This study is devoted to the transcription of human speech in the Kazakh language in
dynamically changing conditions. It discusses key aspects related to the phonetic structure …

[PDF][PDF] Kazakh Speech Recognition: Wav2vec2. 0 vs. Whisper

Z Kozhirbayev - Journal of Advances in Information Technology, 2023 - jait.us
In recent years, the progress made in neural models trained on extensive multilingual text or
speech data has shown great potential for improving the status of underresourced …