Non-linear input variable selection approach integrated with non-tuned data intelligence model for streamflow pattern simulation

Document identifier:
Access full text here:10.1109/ACCESS.2019.2943515
Keyword: Engineering and Technology, Civil Engineering, Geotechnical Engineering, Teknik och teknologier, Samhällsbyggnadsteknik, Geoteknik, Correlated variables, Non-linear XGB approach, Extreme learning machine, Streamflow, Soil Mechanics
Publication year: 2019
Relevant Sustainable Development Goals (SDGs):
SDG 6 Clean water and sanitation
The SDG label(s) above have been assigned by


Streamflow modeling is considered as an essential component for water resources planning and management. There are numerous challenges related to streamflow prediction that are facing water resources engineers. These challenges due to the complex processes associated with several natural variables such as non-stationarity, non-linearity, and randomness. In this study, a new model is proposed to predict long-term streamflow. Several lags that cover several years are abstracted using the potential of Extreme Gradient Boosting (XGB) then after the selected inputs variables are imposed into the predictive model (i.e., Extreme Learning Machine (ELM)). The proposed model is compared with the stand-alone schema in which the optimum lags of the variables are supplied into the XGB and ELM models. Hydrological variables including rainfall, temperature and evapotranspiration are used to build the model and predict the streamflow at Goksu-Himmeti basin in Turkey. The results showed that XGB model performed an excellent result in which can be used for predicting the streamflow pattern. Also, it is clear from the attained results that the accuracy of the streamflow prediction using XGB technique could be improved when the high number of lags was used. However, the implementation of the XGB is tree-based technique in which several issues could be raised such as overfitting problem. The proposed schema XGBELM in which XGB approach is selected the correlated inputs and ranking them according to their importance; then after, the selected inputs are supplied into the ELM model for the prediction process. The XGBELM model outperformed the stand-alone schema of both XGB and ELM models and the high-lagged schema of the XGB. It is important to indicate that the XGBELM model found to improve the prediction ability with minimum variables number.


Sinan Jasim Hadi

Department of the Real Estate Development and Management, Ankara University, Ankara, Turkey
Other publications >>

S.I. Abba

Department of Physical Planning Development, Maitama Sule University Kano, Nigeria
Other publications >>

Saad Sh. Sammen

Department of Civil Engineering, College of Engineering, University of Diyala, Diyala Governorate, Iraq
Other publications >>

Sinan Q. Salih

Institute of Research and Development, Duy Tan University, Da Nang, Vietnam
Other publications >>

Nadhir Al-Ansari

Luleå tekniska universitet; Geoteknologi
Other publications >>

Zaher Mundher Yaseen

Sustainable Developments in Civil Engineering Research Group, Faculty of civil Engineering, Ton Duc Thang University, Ho Chi Minh City, Vietnam
Other publications >>

Documents attached

Click on thumbnail to read

Record metadata

Click to view metadata