Transfer learning using the global Caravan dataset for developing a local river streamflow prediction model


Alzhanov A. Nugumanova A. Moreido V.
October 2025Elsevier Ltd

Environmental Modelling and Software
2025#194

Effective water resource and flood risk management depends on reliable streamflow forecasting. However, the accuracy of such forecasts is often limited by sparse monitoring networks and insufficient historical data. To address this issue, we explore the potential of a multi-basin training approach using the global Caravan hydrological dataset to improve local streamflow forecasting. As a case study, we focus on the Uba River basin in East Kazakhstan. The developed models are evaluated against two baselines: GR4J hydrological model and an LSTM model trained exclusively on local data. Results indicate that our approach enhances forecasting accuracy and outperforms the baseline models, with the best model achieving Nash-Sutcliffe efficiency value of 0.8187 compared to 0.72 of GR4J and 0.7602 of LSTM trained exclusively on local data. These findings indicate that multi-basin training with global datasets can enhance local streamflow forecasting in data-scarce regions.

Caravan dataset , ERA5-Land , Flood forecasting , LSTM , Streamflow forecasting , Transfer learning

Text of the article Перейти на текст статьи

Big Data and Blockchain Technologies Research Innovation Center, Astana IT University, Astana, Kazakhstan
Water Problems Institute of Russian Academy of Sciences, Moscow, 119333, Russian Federation

Big Data and Blockchain Technologies Research Innovation Center
Water Problems Institute of Russian Academy of Sciences

10 лет помогаем публиковать статьи Международный издатель

Книга Публикация научной статьи Волощук 2026 Book Publication of a scientific article 2026