Hybrid end-to-end model for Kazakh speech recognition


Mamyrbayev O.Z. Oralbekova D.O. Alimhan K. Nuranbayeva B.M.
July 2023Springer

International Journal of Speech Technology
2023#26Issue 2261 - 270 pp.

Modern automatic speech recognition systems based on end-to-end (E2E) models show good results in terms of the accuracy of language recognition, which have large corpuses for several thousand hours of speech for system training. Such models require a very large amount of training data, which is problematic for low-resource languages like the Kazakh language. However, many studies have shown that the combination of connectionist temporal classification with other E2E models improves the performance of systems even with limited training data. In this regard, the speech corpus of the Kazakh language was assembled, and this corpus was expanded using the augmentation method. Our work presents the implementation of a joint model of CTC and the attention mechanism for recognition of Kazakh speech, which solves the problem of rapid decoding and training of the system. The results demonstrated that the proposed E2E model using language models improved the system performance and showed the best result on our dataset for the Kazakh language. As a result of the experiment, the system achieved competitive results in Kazakh speech recognition.

Attention , Automatic speech recognition , Connectionist temporal classification , End-to-end , Low resource language

Text of the article Перейти на текст статьи

Institute of Information and Computational Technologies CS MES RK, 28 Shevchenko Str., Almaty, Kazakhstan
Satbayev University, Almaty, Kazakhstan
Al-Farabi Kazakh National University, Almaty, Kazakhstan
L.N. Gumilyov Eurasian National University, Satpayev Str., 2, Nur-Sultan, 010008, Kazakhstan
Caspian University, Dostyk 85A, Almaty, Kazakhstan

Institute of Information and Computational Technologies CS MES RK
Satbayev University
Al-Farabi Kazakh National University
L.N. Gumilyov Eurasian National University
Caspian University

10 лет помогаем публиковать статьи Международный издатель

Книга Публикация научной статьи Волощук 2026 Book Publication of a scientific article 2026