A Hybrid Machine Learning Approach for High-Accuracy Energy Consumption Prediction Using Indoor Environmental Quality Sensors
Amangeldy B. Tasmurzayev N. Imankulov T. Imanbek B. Wójcik W. Nurakhov Y.
August 2025Multidisciplinary Digital Publishing Institute (MDPI)
Energies
2025#18Issue 15
Accurate forecasting of energy consumption in buildings is essential for achieving energy efficiency and reducing carbon emissions. However, many existing models rely on limited input variables and overlook the complex influence of indoor environmental quality (IEQ). In this study, we assess the performance of hybrid machine learning ensembles for predicting hourly energy demand in a smart office environment using high-frequency IEQ sensor data. Environmental variables including carbon dioxide concentration (CO2), particulate matter (PM2.5), total volatile organic compounds (TVOCs), noise levels, humidity, and temperature were recorded over a four-month period. We evaluated two ensemble configurations combining support vector regression (SVR) with either Random Forest or LightGBM as base learners and Ridge regression as a meta-learner, alongside single-model baselines such as SVR and artificial neural networks (ANN). The SVR combined with Random Forest and Ridge regression demonstrated the highest predictive performance, achieving a mean absolute error (MAE) of 1.20, a mean absolute percentage error (MAPE) of 8.92%, and a coefficient of determination (R2) of 0.82. Feature importance analysis using SHAP values, together with non-parametric statistical testing, identified TVOCs, humidity, and PM2.5 as the most influential predictors of energy use. These findings highlight the value of integrating high-resolution IEQ data into predictive frameworks and demonstrate that such data can significantly improve forecasting accuracy. This effect is attributed to the direct link between these IEQ variables and the activation of energy-intensive systems; fluctuations in humidity drive HVAC energy use for dehumidification, while elevated pollutant levels (TVOCs, PM2.5) trigger increased ventilation to maintain indoor air quality, thus raising the total energy load.
building energy management , energy consumption , hybrid models , indoor environmental quality (IEQ) , machine learning , predictive modeling , smart buildings , support vector regression (SVR)
Text of the article Перейти на текст статьи
LLP “DigitAlem”, Almaty, 050042, Kazakhstan
Faculty of Information Technologies, Al-Farabi Kazakh National University, Almaty, 050040, Kazakhstan
Institute of Electronics and Information Technology, Politechnika Lubelska, Lublin, 20-618, Poland
LLP “DigitAlem”
Faculty of Information Technologies
Institute of Electronics and Information Technology
10 лет помогаем публиковать статьи Международный издатель
Книга Публикация научной статьи Волощук 2026 Book Publication of a scientific article 2026