pAtbP-EnC: Identifying Anti-Tubercular Peptides Using Multi-Feature Representation and Genetic Algorithm-Based Deep Ensemble Model
Akbar S. Raza A. Shloul T.A. Ahmad A. Saeed A. Ghadi Y.Y. Mamyrbayev O. Tag-Eldin E.
2023Institute of Electrical and Electronics Engineers Inc.
IEEE Access
2023#11137099 - 137114 pp.
Mycobacterium tuberculosis, a highly perilous pathogen in humans, serves as the causative agent of tuberculosis (TB), affecting nearly 33% of the global population. With the increasing prevalence of multidrug-resistant TB, there is a need for novel and efficacious alternative therapies. Peptide therapies have emerged as a favorable alternative due to their remarkable specificity in targeting cells without affecting healthy cells. However, the experimental identification methods of anti-tubercular peptides (AtbPs) are labor-intensive and costly. Therefore, accurate prediction of AtbPs has become challenging due to the large number of peptide samples. In this paper, we propose an ensemble learning model to enhance the prediction outcomes by addressing the limitations of individual learning models. We formulate the training samples by utilizing four distinct representation methods: AAindex, Composition/Transition/Distribution, Dipeptide Deviation from Expected Mean, and Enhanced Grouped Amino Acid Composition to numerically encode peptide samples. The feature vectors extracted from these methods are fused to develop a compact vector. We evaluate the prediction rates using three different classification models, employing both individual and heterogeneous vectors. Furthermore, we enhance the prediction and training capabilities of the proposed model by using the predicted labels of the individual classifiers for implementing an ensemble deep model via a genetic algorithm. Through evaluation of both the training datasets and independent datasets, our proposed ensemble learner achieves impressive accuracies of 97.80%, 95.13%, 93.91%, and 94.17%, using RD training, MD training, RD independent, and MD independent datasets, respectively. Our findings demonstrate that the proposed pAtbP-EnC model outperforms existing predictors by reporting approximately 11% higher training accuracy. We conclude that the pAtbP-EnC predictor will be a considerable tool in the field of pharmaceutical design and research academia. The used datasets and the source code are publicly available at https://github.com/Intelligent-models/pAtbP-EnC2023.
Anti-tubercular peptides , ensemble classification , genetic algorithm , hybrid representation , k-fold cross-validation test
Text of the article Перейти на текст статьи
Abdul Wali Khan University Mardan, Department of Computer Science, Mardan, 23200, Pakistan
My University Islamabad, Department of Computer Science, Islamabad, 44000, Pakistan
Liwa College of Technology, Department of General Education, Abu Dhabi, United Arab Emirates
University of Engineering and Technology, Department of Computer Science and It, Peshawar, 25000, Pakistan
Al Ain University, Department of Computer Science, Abu Dhabi, United Arab Emirates
Institute of Information and Computational Technologies, Almaty, 050010, Kazakhstan
Future University in Egypt, Faculty of Engineering and Technology, New Cairo, 11835, Egypt
Abdul Wali Khan University Mardan
My University Islamabad
Liwa College of Technology
University of Engineering and Technology
Al Ain University
Institute of Information and Computational Technologies
Future University in Egypt
10 лет помогаем публиковать статьи Международный издатель
Книга Публикация научной статьи Волощук 2026 Book Publication of a scientific article 2026