Efficient active learning using recursive estimation of error reduction


Mukhamediya A. Sameni R. Zollanvari A.
14 July 2025Elsevier B.V.

Neurocomputing
2025#638

Efficient labeling is an open challenge in the big data era, where the rate of data collection far exceeds human capacity for data labeling required for training machine learning models. Active learning (AL) attempts to find the most valuable instances from a pool of unlabeled data by exploiting a limited annotation budget. In this paper, we focus on the estimated error reduction (EER) approach for AL. Given an initial labeled set, EER aims to identify the candidate unlabeled data points that, if used in training with other labeled data, minimize a weighted average of error estimates. As such, EER is a promising technique for AL because it directly examines the effect of labeling the candidate data points on the classifiers performance. However, the main shortcoming of EER is attributed to its excessive computational cost, as it requires retraining the classifier for each candidate data point and every possible label. In this regard, we propose an efficient EER algorithm using an accurate closed-form error estimator of Regularized Linear Discriminant Analysis (AL-RLDA) for the case of binary classification. At the same time, we introduce a novel objective function for EER, that relies less on the estimates of posterior probabilities. The efficiency of the algorithm is attributed to: (i) using a closed-form error estimator; and (ii) updating the estimator expression recursively for each candidate data point. Our empirical results using real data show that the proposed method can outperform baseline and state-of-the-art AL methods.

Active learning , Estimated error reduction , Regularized Linear Discriminant Analysis

Text of the article Перейти на текст статьи

Department of Electrical and Computer Engineering, School of Engineering and Digital Sciences, Nazarbayev University, Kabanbay batyr 53, Astana, 010000, Kazakhstan
Department of Biomedical Informatics, School of Medicine, Emory University, Atlanta, 30322, GA, United States
Department of Biomedical Engineering, Georgia Institute of Technology, Atlanta, 30332, GA, United States

Department of Electrical and Computer Engineering
Department of Biomedical Informatics
Department of Biomedical Engineering

10 лет помогаем публиковать статьи Международный издатель

Книга Публикация научной статьи Волощук 2026 Book Publication of a scientific article 2026