Improving OCR Accuracy for Kazakh Handwriting Recognition Using GAN Models
Yeleussinov A. Amirgaliyev Y. Cherikbayeva L.
May 2023MDPI
Applied Sciences (Switzerland)
2023#13Issue 9
This paper aims to increase the accuracy of Kazakh handwriting text recognition (KHTR) using the generative adversarial network (GAN), where a handwriting word image generator and an image quality discriminator are constructed. In order to obtain a high-quality image of handwritten text, the multiple losses are intended to encourage the generator to learn the structural properties of the texts. In this case, the quality discriminator is trained on the basis of the relativistic loss function. Based on the proposed structure, the resulting document images not only preserve texture details but also generate different writer styles, which provides better OCR performance in public databases. With a self-created dataset, images of different types of handwriting styles were obtained, which will be used when training the network. The proposed approach allows for a character error rate (CER) of 11.15% and a word error rate (WER) of 25.65%.
character error rate , generative adversarial network , optical character recognition , word error rate
Text of the article Перейти на текст статьи
Faculty of Information Technology, Department of Computer Science, Al Farabi Kazakh National University, Almaty, 050010, Kazakhstan
Institute of Information and Computational Technologies, Almaty, 050010, Kazakhstan
Faculty of Information Technology
Institute of Information and Computational Technologies
10 лет помогаем публиковать статьи Международный издатель
Книга Публикация научной статьи Волощук 2026 Book Publication of a scientific article 2026