Predictive Modeling Approach for Surface Water Quality: Development and Comparison of Machine Learning Models

Water pollution is an increasing global issue that societies are facing and is threating human health, ecosystem functions and agriculture production. The distinguished features of artificial intelligence (AI) based modeling can deliver a deep insight pertaining to rising water quality concerns. The...

Full description

Bibliographic Details
Main Authors: Muhammad Izhar Shah, Wesam Salah Alaloul, Abdulaziz Alqahtani, Ali Aldrees, Muhammad Ali Musarat, Muhammad Faisal Javed
Format: Article
Language:English
Published: MDPI AG 2021-07-01
Series:Sustainability
Subjects:
Online Access:https://www.mdpi.com/2071-1050/13/14/7515
id doaj-44669bd7a60f4a76b5efdc2a274a230e
record_format Article
spelling doaj-44669bd7a60f4a76b5efdc2a274a230e2021-07-23T14:06:48ZengMDPI AGSustainability2071-10502021-07-01137515751510.3390/su13147515Predictive Modeling Approach for Surface Water Quality: Development and Comparison of Machine Learning ModelsMuhammad Izhar Shah0Wesam Salah Alaloul1Abdulaziz Alqahtani2Ali Aldrees3Muhammad Ali Musarat4Muhammad Faisal Javed5Department of Civil Engineering, COMSATS University Islamabad, Abbottabad Campus, Abbottabad 22060, PakistanDepartment of Civil and Environmental Engineering, Universiti Teknologi PETRONAS, Bander Seri Iskandar 32610, MalaysiaDepartment of Civil Engineering, College of Engineering in Al-Kharj, Prince Sattam Bin Abdulaziz University, Al-Kharj 16273, Saudi ArabiaDepartment of Civil Engineering, College of Engineering in Al-Kharj, Prince Sattam Bin Abdulaziz University, Al-Kharj 16273, Saudi ArabiaDepartment of Civil and Environmental Engineering, Universiti Teknologi PETRONAS, Bander Seri Iskandar 32610, MalaysiaDepartment of Civil Engineering, COMSATS University Islamabad, Abbottabad Campus, Abbottabad 22060, PakistanWater pollution is an increasing global issue that societies are facing and is threating human health, ecosystem functions and agriculture production. The distinguished features of artificial intelligence (AI) based modeling can deliver a deep insight pertaining to rising water quality concerns. The current study investigates the predictive performance of gene expression programming (GEP), artificial neural network (ANN) and linear regression model (LRM) for modeling monthly total dissolved solids (TDS) and specific conductivity (EC) in the upper Indus River at two outlet stations. In total, 30 years of historical water quality data, comprising 360 TDS and EC monthly records, were used for models training and testing. Based on a significant correlation, the TDS and EC modeling were correlated with seven input parameters. Results were evaluated using various performance measure indicators, error assessment and external criteria. The simulated outcome of the models indicated a strong association with actual data where the correlation coefficient above 0.9 was observed for both TDS and EC. Both the GEP and ANN models remained the reliable techniques in predicting TDS and EC. The formulated GEP mathematical equations depict its novelty as compared to ANN and LRM. The results of sensitivity analysis indicated the increasing trend of input variables affecting TDS as HCO<sub>3</sub><sup>−</sup> (22.33%) > Cl<sup>−</sup> (21.66%) > Mg<sup>2+</sup> (16.98%) > Na<sup>+</sup> (14.55%) > Ca<sup>2+</sup> (12.92%) > SO<sub>4</sub><sup>2−</sup> (11.55%) > pH (0%), while, in the case of EC, it followed the trend as HCO<sub>3</sub><sup>−</sup> (42.36%) > SO<sub>4</sub><sup>2−</sup>(25.63%) > Ca<sup>2+</sup> (13.59%) > Cl<sup>−</sup> (12.8%) > Na<sup>+</sup> (5.01%) > pH (0.61%) > Mg<sup>2+</sup> (0%). The parametric analysis revealed that models have incorporated the effect of all the input parameters in the modeling process. The external assessment criteria confirmed the generalized outcome and robustness of the proposed approaches. Conclusively, the outcomes of this study demonstrated that the formulation of AI based models are cost effective and helpful for river water quality assessment, management and policy making.https://www.mdpi.com/2071-1050/13/14/7515river water qualitysustainable environmentsoft computingregression analysistotal dissolved solidsspecific conductivity
collection DOAJ
language English
format Article
sources DOAJ
author Muhammad Izhar Shah
Wesam Salah Alaloul
Abdulaziz Alqahtani
Ali Aldrees
Muhammad Ali Musarat
Muhammad Faisal Javed
spellingShingle Muhammad Izhar Shah
Wesam Salah Alaloul
Abdulaziz Alqahtani
Ali Aldrees
Muhammad Ali Musarat
Muhammad Faisal Javed
Predictive Modeling Approach for Surface Water Quality: Development and Comparison of Machine Learning Models
Sustainability
river water quality
sustainable environment
soft computing
regression analysis
total dissolved solids
specific conductivity
author_facet Muhammad Izhar Shah
Wesam Salah Alaloul
Abdulaziz Alqahtani
Ali Aldrees
Muhammad Ali Musarat
Muhammad Faisal Javed
author_sort Muhammad Izhar Shah
title Predictive Modeling Approach for Surface Water Quality: Development and Comparison of Machine Learning Models
title_short Predictive Modeling Approach for Surface Water Quality: Development and Comparison of Machine Learning Models
title_full Predictive Modeling Approach for Surface Water Quality: Development and Comparison of Machine Learning Models
title_fullStr Predictive Modeling Approach for Surface Water Quality: Development and Comparison of Machine Learning Models
title_full_unstemmed Predictive Modeling Approach for Surface Water Quality: Development and Comparison of Machine Learning Models
title_sort predictive modeling approach for surface water quality: development and comparison of machine learning models
publisher MDPI AG
series Sustainability
issn 2071-1050
publishDate 2021-07-01
description Water pollution is an increasing global issue that societies are facing and is threating human health, ecosystem functions and agriculture production. The distinguished features of artificial intelligence (AI) based modeling can deliver a deep insight pertaining to rising water quality concerns. The current study investigates the predictive performance of gene expression programming (GEP), artificial neural network (ANN) and linear regression model (LRM) for modeling monthly total dissolved solids (TDS) and specific conductivity (EC) in the upper Indus River at two outlet stations. In total, 30 years of historical water quality data, comprising 360 TDS and EC monthly records, were used for models training and testing. Based on a significant correlation, the TDS and EC modeling were correlated with seven input parameters. Results were evaluated using various performance measure indicators, error assessment and external criteria. The simulated outcome of the models indicated a strong association with actual data where the correlation coefficient above 0.9 was observed for both TDS and EC. Both the GEP and ANN models remained the reliable techniques in predicting TDS and EC. The formulated GEP mathematical equations depict its novelty as compared to ANN and LRM. The results of sensitivity analysis indicated the increasing trend of input variables affecting TDS as HCO<sub>3</sub><sup>−</sup> (22.33%) > Cl<sup>−</sup> (21.66%) > Mg<sup>2+</sup> (16.98%) > Na<sup>+</sup> (14.55%) > Ca<sup>2+</sup> (12.92%) > SO<sub>4</sub><sup>2−</sup> (11.55%) > pH (0%), while, in the case of EC, it followed the trend as HCO<sub>3</sub><sup>−</sup> (42.36%) > SO<sub>4</sub><sup>2−</sup>(25.63%) > Ca<sup>2+</sup> (13.59%) > Cl<sup>−</sup> (12.8%) > Na<sup>+</sup> (5.01%) > pH (0.61%) > Mg<sup>2+</sup> (0%). The parametric analysis revealed that models have incorporated the effect of all the input parameters in the modeling process. The external assessment criteria confirmed the generalized outcome and robustness of the proposed approaches. Conclusively, the outcomes of this study demonstrated that the formulation of AI based models are cost effective and helpful for river water quality assessment, management and policy making.
topic river water quality
sustainable environment
soft computing
regression analysis
total dissolved solids
specific conductivity
url https://www.mdpi.com/2071-1050/13/14/7515
work_keys_str_mv AT muhammadizharshah predictivemodelingapproachforsurfacewaterqualitydevelopmentandcomparisonofmachinelearningmodels
AT wesamsalahalaloul predictivemodelingapproachforsurfacewaterqualitydevelopmentandcomparisonofmachinelearningmodels
AT abdulazizalqahtani predictivemodelingapproachforsurfacewaterqualitydevelopmentandcomparisonofmachinelearningmodels
AT alialdrees predictivemodelingapproachforsurfacewaterqualitydevelopmentandcomparisonofmachinelearningmodels
AT muhammadalimusarat predictivemodelingapproachforsurfacewaterqualitydevelopmentandcomparisonofmachinelearningmodels
AT muhammadfaisaljaved predictivemodelingapproachforsurfacewaterqualitydevelopmentandcomparisonofmachinelearningmodels
_version_ 1721285852791832576