Development of an Intelligent Information Retrieval System Based on Ontology, Linguistic Algorithms and Large Language Models


Mukanova A. Nazyrova A. Zulkhazhav A. Lamasheva Z. Dauletkaliyeva A.
November 2025Multidisciplinary Digital Publishing Institute (MDPI)

Applied Sciences (Switzerland)
2025#15Issue 22

This study proposes a hybrid semantic question-answering (QA) system for the Kazakh language that integrates ontological modeling, linguistic processing, and large language models (LLMs). The approach combines the structured reasoning of ontologies with the contextual understanding of neural language models to improve accuracy and interpretability in low-resource settings. The system includes five core modules—user interface, linguistic analyzer, LLM-based knowledge processor, ontology manager, and knowledge base—interacting through a unified architecture. A corpus of over 50,000 question–answer pairs was created for evaluation. Experimental results demonstrate significant improvements in precision, recall, F1, and mean reciprocal rank (MRR) compared with BERT-QA and ontology-only baselines. Statistical validation confirms the reliability and scalability of the hybrid model. The system is deployed as a functional web platform at The system is deployed as a functional web platform that supports semantic question answering for the Kazakh language.

artificial intelligence , ChatGPT , information systems , intelligent search , linguistic analysis , LLM , ontology

Text of the article Перейти на текст статьи

Higher School of Information Technology and Engineering, Astana International University, 8 Kabanbay Batyr Av., Astana, 010000, Kazakhstan
Faculty of Information Technologies, L.N. Gumilyov Eurasian National University, 2 Satpayev Str., Astana, 010008, Kazakhstan

Higher School of Information Technology and Engineering
Faculty of Information Technologies

10 лет помогаем публиковать статьи Международный издатель

Книга Публикация научной статьи Волощук 2026 Book Publication of a scientific article 2026