Abstract
Forecasting water stages is of significance to river and reservoir management. However, conventional models sometimes fail to perform accurately, as water levels are characterized by high nonstationarity. To provide an improved estimation of water stages, this study develops a new prediction framework by coupling stand-alone machine learning models with ensemble algorithms. As base learners, the optimal regression tree (RT) and extreme learning machine (ELM) are integrated into four ensemble strategies, i.e., bagging (BA), boosting (BO), random forest (RF) and random subspace (RS), leading to eight ensemble models. They are then assessed using daily water-stage records at two hydrological stations on the Yangtze River. Their performance is evaluated by statistical criteria: coefficient of determination (CD), Nash–Sutcliffe efficiency (NSE), root mean square error (RMSE) and mean absolute error (MAE). The RT and the ELM generate satisfactory predictions with deficiency in capturing extreme values. The ensemble models generally enhance the prediction efficiency, with their mean CD and NSE augment by up to 6.9% and 7.0%, and mean RMSE and MAE reduction by up to 47.9% and 47.0%. The BO-based models, namely BO-RT and BO-ELM, result in the highest accuracy, with a mean absolute relative error (ARE) of 1.0% and 1.4%. Ensemble learning gains even in multi-step-ahead forecasts, which satisfactorily extends the lead time up to 14 days. This study illustrates the capability of ensemble learning for improved water-level forecasts, which provides reference for modeling related issues such as sediment load and rainfall-runoff.
Similar content being viewed by others
References
King CW, Webber ME (2008) Water intensity of transportation. Environ Sci Technol 42:7866–7872
Coops H, Beklioglu M, Crisman TL (2003) The role of water-level fluctuations in shallow lake ecosystems–workshop conclusions. Hydrobiologia 506:23–27
Schuurmans J, Hof A, Dijkstra S, Bosgra O, Brouwer R (1999) Simple water level controller for irrigation and drainage canals. J Irrig Drain Eng 125:189–195
Cohen Y, Radomski P (1993) Water level regulations and fisheries in Rainy Lake and the Namakan Reservoir. Can J Fish Aquat Sci 50:1934–1945
Alsdorf DE, Melack JM, Dunne T, Mertes LA, Hess LL, Smith LC (2000) Interferometric radar measurements of water level changes on the Amazon flood plain. Nature 404:174–177
Li S, He D (2008) Water level response to hydropower development in the upper Mekong River. AMBIO 37:170–7
Zhou T, Jiang Z, Liu X, Tan K (2020) Research on the long-term and short-term forecasts of navigable river’s water-level fluctuation based on the adaptive multilayer perceptron. J Hydrol 591:125285
Aksoy H, Unal N, Eris E, Yuce M (2013) Stochastic modeling of Lake Van water level time series with jumps and multiple trends. Hydrol Earth Syst Sci 17:2297–2303
Kebede S, Travi Y, Alemayehu T, Marc V (2006) Water balance of Lake Tana and its sensitivity to fluctuations in rainfall, Blue Nile basin, Ethiopia. J Hydrol 316:233–247
Fischer P, Öhl U (2005) Effects of water-level fluctuations on the littoral benthic fish community in lakes: a mesocosm experiment. Behav Ecol 16:741–746
Phan T-T-H, Nguyen XH (2020) Combining statistical machine learning models with ARIMA for water level forecasting: the case of the Red river. Adv Water Resour 142:103656
Sanikhani H, Kisi O, Maroufpoor E, Yaseen ZM (2019) Temperature-based modeling of reference evapotranspiration using several artificial intelligence models: application of different modeling scenarios. Theor Appl Climatol 135:449–462
Zhu S, Heddam S, Nyarko EK, Hadzima-Nyarko M, Piccolroaz S, Wu S (2019) Modeling daily water temperature for rivers: comparison between adaptive neuro-fuzzy inference systems and artificial neural networks models. Environ Sci Pollut Res 26:402–420
Yaseen ZM, Sulaiman SO, Deo RC, Chau K-W (2019) An enhanced extreme learning machine model for river flow forecasting: State-of-the-art, practical applications in water resource engineering area and future research direction. J Hydrol 569:387–408
Hadi SJ, Tombul M (2018) Monthly streamflow forecasting using continuous wavelet and multi-gene genetic programming combination. J Hydrol 561:674–687
Altunkaynak A (2007) Forecasting surface water level fluctuations of Lake Van by artificial neural networks. Water Resour Manag 21:399–408
Buyukyildiz M, Tezel G, Yilmaz V (2014) Estimation of the change in lake water level by artificial intelligence methods. Water Resour Manag 28:4747–4763
Hrnjica B, Bonacci O (2019) Lake level prediction using feed forward and recurrent neural networks. Water Resour Manag 33:2471–2484
Le X-H, Ho HV, Lee G, Jung S (2019) Application of long short-term memory (LSTM) neural network for flood forecasting. Water 11:1387
Güldal V, Tongal H (2010) Comparison of recurrent neural network, adaptive neuro-fuzzy inference system and stochastic models in Eğirdir Lake level forecasting. Water Resour Manag 24:105–128
Kisi O, Shiri J, Nikoofar B (2012) Forecasting daily lake levels using artificial intelligence approaches. Comput Geosci 41:169–180
Khan MS, Coulibaly P (2006) Application of support vector machine in lake water level prediction. J Hydrol Eng 11:199–205
Pasupa K, Jungjareantrat S (2016) Water levels forecast in Thailand: A case study of Chao Phraya River. 2016 14th International Conference on Control, Automation, Robotics and Vision (ICARCV): IEEE. 1–6
Yu Z, Lei G, Jiang Z, Liu F (2017) ARIMA modelling and forecasting of water level in the middle reach of the Yangtze River. 2017 4th International Conference on Transportation Information and Safety (ICTIS): IEEE. 172–7
Galavi H, Mirzaei M, Shul LT, Valizadeh N (2013) Klang River–level forecasting using ARIMA and ANFIS models. J Am Water Works Ass 105:E496–E506
Seo Y, Choi E, Yeo W (2017) Reservoir water level forecasting using machine learning models. J Korean Soc Agricult Eng 59:97–110
Yang J-H, Cheng C-H, Chan C-P (2017) A time-series water level forecasting model based on imputation and variable selection method. Comput Intell Neurosci. https://doi.org/10.1155/2017/8734214
Ghorbani MA, Khatibi R, Aytek A, Makarynskyy O, Shiri J (2010) Sea water level forecasting using genetic programming and comparing the performance with artificial neural networks. Comput Geosci 36:620–627
Karimi S, Shiri J, Kisi O, Makarynskyy O (2012) Forecasting water level fluctuations of Urmieh Lake using gene expression programming and adaptive neuro-fuzzy inference system. Int J Ocean Clim Syst 3:109–125
Zhang GP (2003) Time series forecasting using a hybrid ARIMA and neural network model. Neurocomputing 50:159–175
Seo Y, Kim S, Kisi O, Singh VP (2015) Daily water level forecasting using wavelet decomposition and artificial intelligence techniques. J Hydrol 520:224–243
Zhong C, Jiang Z, Chu X, Guo T, Wen Q (2019) Water level forecasting using a hybrid algorithm of artificial neural networks and local Kalman filtering. Proc Inst Mech Eng Part M J Eng Marit Environ 233:174–185
Xu G, Cheng Y, Liu F, Ping P, Sun J (2019) A water level prediction model based on ARIMA-RNN. 2019 IEEE Fifth International Conference on Big Data Computing Service and Applications (BigDataService): IEEE. 221–6
Ghorbani MA, Deo RC, Karimi V, Yaseen ZM, Terzi O (2018) Implementation of a hybrid MLP-FFA model for water level prediction of Lake Egirdir, Turkey. Stoch Environ Res Risk Assess 32:1683–1697
Kisi O, Shiri J, Karimi S, Shamshirband S, Motamedi S, Petković D et al (2015) A survey of water level fluctuation predicting in Urmia Lake using support vector machine with firefly algorithm. Appl Math Comput 270:731–743
Zounemat-Kermani M, Batelaan O, Fadaee M, Hinkelmann R (2021) Ensemble machine learning paradigms in hydrology: a review. J Hydrol 598:126266
Kim D, Yu H, Lee H, Beighley E, Durand M, Alsdorf DE et al (2019) Ensemble learning regression for estimating river discharges using satellite altimetry data: Central Congo River as a Test-bed. Remote Sens Environ 221:741–755
Naghibi SA, Dolatkordestani M, Rezaei A, Amouzegari P, Heravi MT, Kalantar B et al (2019) Application of rotation forest with decision trees as base classifier and a novel ensemble model in spatial modeling of groundwater potential. Environ Monit Assess 191:1–20
Liu S, Xu J, Zhao J, Xie X, Zhang W (2014) Efficiency enhancement of a process-based rainfall–runoff model using a new modified AdaBoost. RT Tech Appl Soft Comput 23:521–529
Elkiran G, Nourani V, Abba S (2019) Multi-step ahead modelling of river water quality parameters using ensemble artificial intelligence-based approach. J Hydrol 577:123962
Salam R, Islam ARMT (2020) Potential of RT, Bagging and RS ensemble learning algorithms for reference evapotranspiration prediction using climatic data-limited humid region in Bangladesh. J Hydrol 590:125241
Zhao Y, Li Y, Zhang L, Wang Q (2016) Groundwater level prediction of landslide based on classification and regression tree. Geod Geodyn 7:348–355
Shiri J, Shamshirband S, Kisi O, Karimi S, Bateni SM, Nezhad SHH et al (2016) Prediction of water-level in the Urmia Lake using the extreme learning machine approach. Water Resour Manag 30:5217–5229
Deo RC, Şahin M (2016) An extreme learning machine model for the simulation of monthly mean streamflow water level in eastern Queensland. Environ Monit Assess 188:90
Brieman L, Friedman JH, Olshen RA, Stone CJ (1984) Classification and regression trees. CRC Press, New York
Gomes CMA, Jelihovschi E (2020) Presenting the regression tree method and its application in a large-scale educational dataset. Int J Res Method Edu 43:201–221
Huang G-B, Zhu Q-Y, Siew C-K (2006) Extreme learning machine: theory and applications. Neurocomputing 70:489–501
Atiquzzaman M, Kandasamy J (2016) Prediction of hydrological time-series using extreme learning machine. J Hydroinf 18:345–353
Rezaie-Balf M, Kisi O (2018) New formulation for forecasting streamflow: evolutionary polynomial regression vs. extreme learning machine. Hydrol Res 49:939–953
Breiman L (1996) Bagging predictors. Mach Learn 24:123–140
Rokach L (2010) Ensemble-based classifiers. Artif Intell Rev 33:1–39
Alfaro E, Gamez M, Garcia N (2013) Adabag: an R package for classification with boosting and bagging. J Stat Softw 54:1–35
Breiman L (2001) Random forests. Mach Learn 45:5–32
Dong X, Yu Z, Cao W, Shi Y, Ma Q (2020) A survey on ensemble learning. Front Comput Sci 14:241–258
Khosravi K, Cooper JR, Daggupati P, Pham BT, Bui DT (2020) Bedload transport rate prediction: application of novel hybrid data mining techniques. J Hydrol 585:124774
Mosavi A, Golshan M, Janizadeh S, Choubin B, Melesse AM, Dineva AA (2020) Ensemble models of GLM, FDA, MARS, and RF for flood and erosion susceptibility mapping: a priority assessment of sub-basins. Geocarto Int 37:2541–60
Ho TK (1998) The random subspace method for constructing decision forests. IEEE Trans Pattern Anal Mach Intell 20:832–844
Nhu V-H, Khosravi K, Cooper JR, Karimi M, Kisi O, Pham BT et al (2020) Monthly suspended sediment load prediction using artificial intelligence: testing of a new random subspace method. Hydrol Sci J 65:2116–2127
Chen W, Zhao X, Tsangaratos P, Shahabi H, Ilia I, Xue W et al (2020) Evaluating the usage of tree-based ensemble methods in groundwater spring potential mapping. J Hydrol 583:124602
Tiwari MK, Chatterjee C (2011) A new wavelet–bootstrap–ANN hybrid model for daily discharge forecasting. J Hydroinf 13:500–519
Moriasi DN, Arnold JG, Van Liew MW, Bingner RL, Harmel RD, Veith TL (2007) Model evaluation guidelines for systematic quantification of accuracy in watershed simulations. Trans ASABE 50:885–900
Ayele GT, Teshale EZ, Yu B, Rutherfurd ID, Jeong J (2017) Streamflow and sediment yield prediction for watershed prioritization in the Upper Blue Nile River Basin. Ethiopia Water 9:782
Xie Q, Yang J, Lundström TS (2021) Sediment and morphological changes along Yangtze River’s 500 km between Datong and Xuliujing before and after Three Gorges Dam commissioning. Sci Rep 11:1–17
Li S, Yang J, Ansell A (2021) Discharge prediction for rectangular sharp-crested weirs by machine learning techniques. Flow Meas Instrum 79:101931
Roushangar K, Ghasempour R (2019) Evaluation of the impact of channel geometry and rough elements arrangement in hydraulic jump energy dissipation via SVM. J Hydroinf 21:92–103
Mehr AD (2018) An improved gene expression programming model for streamflow forecasting in intermittent streams. J Hydrol 563:669–678
Rahmani-Rezaeieh A, Mohammadi M, Mehr AD (2020) Ensemble gene expression programming: a new approach for evolution of parsimonious streamflow forecasting model. Theor Appl Climatol 139:549–564
Sattar AM, Ertuğrul ÖF, Gharabaghi B, McBean EA, Cao J (2019) Extreme learning machine model for water network management. Neural Comput Appl 31:157–169
Matlab. Matlab R2021a (2021) Natick. The Mathworks, Inc, Massachusetts
Valipour M, Banihabib ME, Behbahani SMR (2013) Comparison of the ARMA, ARIMA, and the autoregressive artificial neural network models in forecasting the monthly inflow of Dez dam reservoir. J Hydrol 476:433–441
Heo KY, Ha KJ, Yun KS, Lee SS, Kim HJ, Wang B (2014) Methods for uncertainty assessment of climate models and model predictions over East Asia. Int J Climatol 34:377–390
Cheng H, Tan P-N, Gao J, Scripps J (2006) Multistep-ahead time series prediction. Pacific-Asia Conference on Knowledge Discovery and Data Mining: Springer. 765-74
Barzegar R, Moghaddam AA, Adamowski J, Ozga-Zielinski B (2018) Multi-step water quality forecasting using a boosting ensemble multi-wavelet extreme learning machine model. Stoch Environ Res Risk Assess 32:799–813
Tyralis H, Papacharalampous G, Langousis A (2021) Super ensemble learning for daily streamflow forecasting: large-scale demonstration and comparison with multiple machine learning algorithms. Neural Comput Appl 33:3053–3068
Zounemat-Kermani M, Stephan D, Barjenbruch M, Hinkelmann R (2020) Ensemble data mining modeling in corrosion of concrete sewer: a comparative study of network-based (MLPNN & RBFNN) and tree-based (RF, CHAID, & CART) models. Adv Eng Inf 43:101030
Kim S, Alizamir M, Zounemat-Kermani M, Kisi O, Singh VP (2020) Assessing the biochemical oxygen demand using neural networks and ensemble tree approaches in South Korea. J Environ Manag 270:110834
Pandey M, Jamei M, Karbasi M, Ahmadianfar I, Chu X (2021) Prediction of maximum scour depth near spur dikes in uniform bed sediment using stacked generalization ensemble tree-based frameworks. J Irrig Drain Eng 147:04021050
Pan M, Zhou H, Cao J, Liu Y, Hao J, Li S et al (2020) Water level prediction model based on GRU and CNN. IEEE Access 8:60090–60100
Liu D, Jiang W, Mu L, Wang S (2020) Streamflow prediction using deep learning neural network: case study of Yangtze River. IEEE Access 8:90069–90086
Xu W, Jiang Y, Zhang X, Li Y, Zhang R, Fu G (2020) Using long short-term memory networks for river flow prediction. Hydrol Res 51:1358–1376
Acknowledgements
The study is funded by the Swedish Hydropower Center (SVC). It is part of research projects “Two-phase flow modeling: evaluations and simulations for safer spillway discharge” and “Quality and trust of numerical modeling of water-air flows for safe spillway discharge”, with James Yang and Anders Ansell as project leaders. SVC has been established by the Swedish Energy Agency, Energiforsk and Svenska Kraftnät, together with Royal Institute of Technology (KTH), Luleå University of Technology (LTU), Uppsala University (UU) and Chalmers University of Technology (CTH). Participating companies and industry associations include AFRY, Andritz Hydro, Boliden, Fortum Generation, Holmen Energi, Jämtkraft, Karlstads Energi, LKAB, Mälarenergi, Norconsult, Rainpower, Skellefteå Kraft, Sollefteåforsens, Statkraft Sverige, Sweco Energuide, Sweco Infrastructure, Tekniska verken i Linköping, Uniper, Vattenfall R&D, Vattenfall Vattenkraft, Voith Hydro, WSP Sverige and Zinkgruvan. The assistance from Holger Ecke of Vattenfall R&D and Qiancheng Xie of LTU is acknowledged.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Li, S., Yang, J. Improved river water-stage forecasts by ensemble learning. Engineering with Computers 39, 3293–3311 (2023). https://doi.org/10.1007/s00366-022-01751-1
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00366-022-01751-1