Enhancing the prediction accuracy of data-driven models for monthly streamflow in Urmia Lake Basin based upon the autoregressive conditionally heteroskedastic time-series model

Nasrin Fathollahzadeh Attar, Quoc Bao Pham, Sajad Fani Nowbandegani, Mohammad Rezaie-Balf, Chow Ming Fai, Ali Najah Ahmed, Saeed Pipelzadeh, Tran Duc Dung, Pham Thi Thao Nhi, Dao Nguyen Khoi, Ahmed El-Shafie

Research output: Contribution to journalArticle

1 Citation (Scopus)

Abstract

Hydrological modeling is one of the important subjects in managing water resources and the processes of predicting stochastic behavior. Developing Data-Driven Models (DDMs) to apply to hydrological modeling is a very complex issue because of the stochastic nature of the observed data, like seasonality, periodicities, anomalies, and lack of data. As streamflow is one of the most important components in the hydrological cycle, modeling and estimating streamflow is a crucial aspect. In this study, two models, namely, Optimally Pruned Extreme Learning Machine (OPELM) and Chi-Square Automatic Interaction Detector (CHAID) methods were used to model the deterministic parts of monthly streamflow equations, while Autoregressive Conditional Heteroskedasticity (ARCH) was used in modeling the stochastic parts of monthly streamflow equations. The state of art and innovation of this study is the integration of these models in order to create new hybrid models, ARCH-OPELM and ARCH-CHAID, and increasing the accuracy of models. The study draws on the monthly streamflow data of two different river stations, located in north-western Iran, including Dizaj and Tapik, which are on Nazluchai and Baranduzchai, gathered over 31 years from 1986 to 2016. To ascertain the conclusive accuracy, five evaluation metrics including Correlation Coefficient (R), Root Mean Square Error (RMSE), Nash-Sutcliffe Efficiency (NSE), Mean Absolute Error (MAE), the ratio of RMSE to the Standard Deviation (RSD), scatter plots, time-series plots, and Taylor diagrams were used. Standalone CHAID models have better results than OPELM methods considering sole models. In the case of hybrid models, ARCH-CHAID models in the validation stage performed better than ARCH-OPELM for Dizaj station (R = 0.96, RMSE = 1.289 m3/s, NSE = 0.92, MAE = 0.719 m3/s and RSD = 0.301) and for Tapik station (R = 0.94, RMSE = 2.662 m3/s, NSE = 0.86, MAE = 1.467 m3/s and RSD = 0.419). The results remarkably reveal that ARCH-CHAID models in both stations outperformed all other models. Finally, it is worth mentioning that the new hybrid "ARCH-DDM" models outperformed standalone models in predicting monthly streamflow.

Original languageEnglish
Article number571
JournalApplied Sciences (Switzerland)
Volume10
Issue number2
DOIs
Publication statusPublished - 01 Jan 2020

All Science Journal Classification (ASJC) codes

  • Materials Science(all)
  • Instrumentation
  • Engineering(all)
  • Process Chemistry and Technology
  • Computer Science Applications
  • Fluid Flow and Transfer Processes

Fingerprint Dive into the research topics of 'Enhancing the prediction accuracy of data-driven models for monthly streamflow in Urmia Lake Basin based upon the autoregressive conditionally heteroskedastic time-series model'. Together they form a unique fingerprint.

  • Cite this

    Attar, N. F., Pham, Q. B., Nowbandegani, S. F., Rezaie-Balf, M., Fai, C. M., Ahmed, A. N., Pipelzadeh, S., Dung, T. D., Nhi, P. T. T., Khoi, D. N., & El-Shafie, A. (2020). Enhancing the prediction accuracy of data-driven models for monthly streamflow in Urmia Lake Basin based upon the autoregressive conditionally heteroskedastic time-series model. Applied Sciences (Switzerland), 10(2), [571]. https://doi.org/10.3390/app10020571