Transfer learning using the global Caravan dataset for developing a local river streamflow prediction model
Alzhanov A. Nugumanova A. Moreido V.
October 2025Elsevier Ltd
Environmental Modelling and Software
2025#194
Effective water resource and flood risk management depends on reliable streamflow forecasting. However, the accuracy of such forecasts is often limited by sparse monitoring networks and insufficient historical data. To address this issue, we explore the potential of a multi-basin training approach using the global Caravan hydrological dataset to improve local streamflow forecasting. As a case study, we focus on the Uba River basin in East Kazakhstan. The developed models are evaluated against two baselines: GR4J hydrological model and an LSTM model trained exclusively on local data. Results indicate that our approach enhances forecasting accuracy and outperforms the baseline models, with the best model achieving Nash-Sutcliffe efficiency value of 0.8187 compared to 0.72 of GR4J and 0.7602 of LSTM trained exclusively on local data. These findings indicate that multi-basin training with global datasets can enhance local streamflow forecasting in data-scarce regions.
Caravan dataset , ERA5-Land , Flood forecasting , LSTM , Streamflow forecasting , Transfer learning
Text of the article Перейти на текст статьи
Big Data and Blockchain Technologies Research Innovation Center, Astana IT University, Astana, Kazakhstan
Water Problems Institute of Russian Academy of Sciences, Moscow, 119333, Russian Federation
Big Data and Blockchain Technologies Research Innovation Center
Water Problems Institute of Russian Academy of Sciences
10 лет помогаем публиковать статьи Международный издатель
Книга Публикация научной статьи Волощук 2026 Book Publication of a scientific article 2026