Multidimensional Visualization and AI-Driven Prediction Using Clinical and Biochemical Biomarkers in Premature Cardiovascular Aging


Abzaliyev K. Suleimenova M. Abzaliyeva S. Mansurova M. Shomanov A. Bugibayeva A. Tolemisova A. Kurmanova A. Nassyrova N.
October 2025Multidisciplinary Digital Publishing Institute (MDPI)

Biomedicines
2025#13Issue 10

Background: Cardiovascular diseases (CVDs) remain the primary cause of global mortality, with arterial hypertension, ischemic heart disease (IHD), and cerebrovascular accident (CVA) forming a progressive continuum from early risk factors to severe outcomes. While numerous studies focus on isolated biomarkers, few integrate multidimensional visualization with artificial intelligence to reveal hidden, clinically relevant patterns. Methods: We conducted a comprehensive analysis of 106 patients using an integrated framework that combined clinical, biochemical, and lifestyle data. Parameters included renal function (glomerular filtration rate, cystatin C), inflammatory markers, lipid profile, enzymatic activity, and behavioral factors. After normalization and imputation, we applied correlation analysis, parallel coordinates visualization, t-distributed stochastic neighbor embedding (t-SNE) with k-means clustering, principal component analysis (PCA), and Random Forest modeling with SHAP (SHapley Additive exPlanations) interpretation. Bootstrap resampling was used to estimate 95% confidence intervals for mean absolute SHAP values, assessing feature stability. Results: Consistent patterns across outcomes revealed impaired renal function, reduced physical activity, and high hypertension prevalence in IHD and CVA. t-SNE clustering achieved complete separation of a high-risk group (100% CVD-positive) from a predominantly low-risk group (7.8% CVD rate), demonstrating unsupervised validation of biomarker discriminative power. PCA confirmed multidimensional structure, while Random Forest identified renal function, hypertension status, and physical activity as dominant predictors, achieving robust performance (Accuracy 0.818; AUC-ROC 0.854). SHAP analysis identified arterial hypertension, BMI, and physical inactivity as dominant predictors, complemented by renal biomarkers (GFR, cystatin) and NT-proBNP. Conclusions: This study pioneers the integration of multidimensional visualization and AI-driven analysis for CVD risk profiling, enabling interpretable, data-driven identification of high- and low-risk clusters. Despite the limited single-center cohort (n = 106) and cross-sectional design, the findings highlight the potential of interpretable models for precision prevention and transparent decision support in cardiovascular aging research.

biomarkers , machine learning , parallel coordinates , premature aging , principal component analysis , random forest , t-SNE

Text of the article Перейти на текст статьи

Department of Internal Medicine, Faculty of Medicine and Healthcare, Al-Farabi Kazakh National University, Almaty, 050040, Kazakhstan
Department of Big Data and Artificial Intelligence, Faculty of Information Technology, Al-Farabi Kazakh National University, Almaty, 050040, Kazakhstan
Department of Postdoctoral Research, Nazarbayev University, Astana, 010000, Kazakhstan
LLP «International Research Institute for Postgraduate Education», Almaty, 050060, Kazakhstan
Department of Science and International Relations, Non-State Educational Institution «Kazakh-Russian Medical University», Almaty, 050040, Kazakhstan

Department of Internal Medicine
Department of Big Data and Artificial Intelligence
Department of Postdoctoral Research
LLP «International Research Institute for Postgraduate Education»
Department of Science and International Relations

10 лет помогаем публиковать статьи Международный издатель

Книга Публикация научной статьи Волощук 2026 Book Publication of a scientific article 2026