KazCausal: Manual Annotation of Causal Constructions in the Kazakh Language
Taberkhan R. Sambetbayeva M.
2025Institute of Electrical and Electronics Engineers Inc.
International Conference on Computer Science and Engineering, UBMK
2025Issue 2025463 - 467 pp.
This article presents KazCausal, a manual annotation scheme developed to identify causal constructions in Kazakh language texts. The research aims to create a formalized corpus that reflects the morphological, syntactic, and semantic aspects of causality expression in the agglutinative structure of the Kazakh language. The scheme supports multiple labels and overlapping relationships, such as direct consequence, motivation, sanction, emotional conditioning, and indefinite causality. The corpus includes 681 sentences, 150 of which are marked as containing causal relationships. Inter-annotator agreement analysis demonstrates a high level of consistency. The results can be used in NLP tasks, including model training and automatic extraction of causal relationships.
agglutinative morphology , causality , corpus annotation , NLP , semantic relations , The Kazakh language
Text of the article Перейти на текст статьи
L.N. Gumilyov Eurasian National University, Departament of Information Systems, Astana, Kazakhstan
L.N. Gumilyov Eurasian National University
10 лет помогаем публиковать статьи Международный издатель
Книга Публикация научной статьи Волощук 2026 Book Publication of a scientific article 2026