A COMPUTATIONAL PIPELINE FOR LEXICAL AND THEMATIC ANALYSIS OF THE CODE OF ADMINISTRATIVE OFFENSES OF THE REPUBLIC OF KAZAKHSTAN


ВЫЧИСЛИТЕЛЬНЫЙ КОНВЕЙЕР ДЛЯ ЛЕКСИЧЕСКОГО И ТЕМАТИЧЕСКОГО АНАЛИЗА КОДЕКСА ОБ АДМИНИСТРАТИВНЫХ ПРАВОНАРУШЕНИЯХ РЕСПУБЛИКИ КАЗАХСТАН
Mukhsimbayev B. Pak A. Kuralbayev A.
2025Kazakh-British Technical University

Herald of the Kazakh British Technical UNiversity
2025#22Issue 4227 - 243 pp.

This study introduces a computational pipeline for the automated linguistic and structural analysis of legal texts, applied to the Code of Administrative Offenses of the Republic of Kazakhstan (CAO RK, K1400000235). The proposed workflow integrates data collection, text preprocessing, tokenization, keyword extraction, semantic clustering, and visualization using natural language processing (NLP) and statistical techniques implemented in Python. The pipeline unites lexical, thematic, and quantitative linguistic analyses into a coherent sequence that enables the identification of frequency distributions, semantic fields, and latent topics across the hierarchical structure of the Code (sections, chapters, and articles). The analysis of the CAO RK corpus revealed several distinctive linguistic patterns: a dominance of sanction and responsibility-related vocabulary (штраф, ответственность, правонарушение), high lexical density in chapters regulating economic and procedural offenses, and concentrated thematic clusters reflecting the normative-punitive orientation of administrative law. Visualization techniques such as frequency histograms, thematic heatmaps, and topic maps illustrate the potential of the pipeline for exploring legislative language quantitatively. Overall, the framework establishes a scalable foundation for comparative legal linguistics, automated legislative monitoring, and the modernization of legal analytics in Kazakhstan.

administrative law , computational legal linguistics , frequency analysis , legal informatics. Introduction , legal text analysis , natural language processing , topic modeling

Text of the article Перейти на текст статьи

Kazakh-British Technical University, Almaty, Kazakhstan

Kazakh-British Technical University

10 лет помогаем публиковать статьи Международный издатель

Книга Публикация научной статьи Волощук 2026 Book Publication of a scientific article 2026