Predictive Modeling Approach for Surface Water Quality: Development and Comparison of Machine Learning Models
Water pollution is an increasing global issue that societies are facing and is threating human health, ecosystem functions and agriculture production. The distinguished features of artificial intelligence (AI) based modeling can deliver a deep insight pertaining to rising water quality concerns. The...
Main Authors: | , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2021-07-01
|
Series: | Sustainability |
Subjects: | |
Online Access: | https://www.mdpi.com/2071-1050/13/14/7515 |
id |
doaj-44669bd7a60f4a76b5efdc2a274a230e |
---|---|
record_format |
Article |
spelling |
doaj-44669bd7a60f4a76b5efdc2a274a230e2021-07-23T14:06:48ZengMDPI AGSustainability2071-10502021-07-01137515751510.3390/su13147515Predictive Modeling Approach for Surface Water Quality: Development and Comparison of Machine Learning ModelsMuhammad Izhar Shah0Wesam Salah Alaloul1Abdulaziz Alqahtani2Ali Aldrees3Muhammad Ali Musarat4Muhammad Faisal Javed5Department of Civil Engineering, COMSATS University Islamabad, Abbottabad Campus, Abbottabad 22060, PakistanDepartment of Civil and Environmental Engineering, Universiti Teknologi PETRONAS, Bander Seri Iskandar 32610, MalaysiaDepartment of Civil Engineering, College of Engineering in Al-Kharj, Prince Sattam Bin Abdulaziz University, Al-Kharj 16273, Saudi ArabiaDepartment of Civil Engineering, College of Engineering in Al-Kharj, Prince Sattam Bin Abdulaziz University, Al-Kharj 16273, Saudi ArabiaDepartment of Civil and Environmental Engineering, Universiti Teknologi PETRONAS, Bander Seri Iskandar 32610, MalaysiaDepartment of Civil Engineering, COMSATS University Islamabad, Abbottabad Campus, Abbottabad 22060, PakistanWater pollution is an increasing global issue that societies are facing and is threating human health, ecosystem functions and agriculture production. The distinguished features of artificial intelligence (AI) based modeling can deliver a deep insight pertaining to rising water quality concerns. The current study investigates the predictive performance of gene expression programming (GEP), artificial neural network (ANN) and linear regression model (LRM) for modeling monthly total dissolved solids (TDS) and specific conductivity (EC) in the upper Indus River at two outlet stations. In total, 30 years of historical water quality data, comprising 360 TDS and EC monthly records, were used for models training and testing. Based on a significant correlation, the TDS and EC modeling were correlated with seven input parameters. Results were evaluated using various performance measure indicators, error assessment and external criteria. The simulated outcome of the models indicated a strong association with actual data where the correlation coefficient above 0.9 was observed for both TDS and EC. Both the GEP and ANN models remained the reliable techniques in predicting TDS and EC. The formulated GEP mathematical equations depict its novelty as compared to ANN and LRM. The results of sensitivity analysis indicated the increasing trend of input variables affecting TDS as HCO<sub>3</sub><sup>−</sup> (22.33%) > Cl<sup>−</sup> (21.66%) > Mg<sup>2+</sup> (16.98%) > Na<sup>+</sup> (14.55%) > Ca<sup>2+</sup> (12.92%) > SO<sub>4</sub><sup>2−</sup> (11.55%) > pH (0%), while, in the case of EC, it followed the trend as HCO<sub>3</sub><sup>−</sup> (42.36%) > SO<sub>4</sub><sup>2−</sup>(25.63%) > Ca<sup>2+</sup> (13.59%) > Cl<sup>−</sup> (12.8%) > Na<sup>+</sup> (5.01%) > pH (0.61%) > Mg<sup>2+</sup> (0%). The parametric analysis revealed that models have incorporated the effect of all the input parameters in the modeling process. The external assessment criteria confirmed the generalized outcome and robustness of the proposed approaches. Conclusively, the outcomes of this study demonstrated that the formulation of AI based models are cost effective and helpful for river water quality assessment, management and policy making.https://www.mdpi.com/2071-1050/13/14/7515river water qualitysustainable environmentsoft computingregression analysistotal dissolved solidsspecific conductivity |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Muhammad Izhar Shah Wesam Salah Alaloul Abdulaziz Alqahtani Ali Aldrees Muhammad Ali Musarat Muhammad Faisal Javed |
spellingShingle |
Muhammad Izhar Shah Wesam Salah Alaloul Abdulaziz Alqahtani Ali Aldrees Muhammad Ali Musarat Muhammad Faisal Javed Predictive Modeling Approach for Surface Water Quality: Development and Comparison of Machine Learning Models Sustainability river water quality sustainable environment soft computing regression analysis total dissolved solids specific conductivity |
author_facet |
Muhammad Izhar Shah Wesam Salah Alaloul Abdulaziz Alqahtani Ali Aldrees Muhammad Ali Musarat Muhammad Faisal Javed |
author_sort |
Muhammad Izhar Shah |
title |
Predictive Modeling Approach for Surface Water Quality: Development and Comparison of Machine Learning Models |
title_short |
Predictive Modeling Approach for Surface Water Quality: Development and Comparison of Machine Learning Models |
title_full |
Predictive Modeling Approach for Surface Water Quality: Development and Comparison of Machine Learning Models |
title_fullStr |
Predictive Modeling Approach for Surface Water Quality: Development and Comparison of Machine Learning Models |
title_full_unstemmed |
Predictive Modeling Approach for Surface Water Quality: Development and Comparison of Machine Learning Models |
title_sort |
predictive modeling approach for surface water quality: development and comparison of machine learning models |
publisher |
MDPI AG |
series |
Sustainability |
issn |
2071-1050 |
publishDate |
2021-07-01 |
description |
Water pollution is an increasing global issue that societies are facing and is threating human health, ecosystem functions and agriculture production. The distinguished features of artificial intelligence (AI) based modeling can deliver a deep insight pertaining to rising water quality concerns. The current study investigates the predictive performance of gene expression programming (GEP), artificial neural network (ANN) and linear regression model (LRM) for modeling monthly total dissolved solids (TDS) and specific conductivity (EC) in the upper Indus River at two outlet stations. In total, 30 years of historical water quality data, comprising 360 TDS and EC monthly records, were used for models training and testing. Based on a significant correlation, the TDS and EC modeling were correlated with seven input parameters. Results were evaluated using various performance measure indicators, error assessment and external criteria. The simulated outcome of the models indicated a strong association with actual data where the correlation coefficient above 0.9 was observed for both TDS and EC. Both the GEP and ANN models remained the reliable techniques in predicting TDS and EC. The formulated GEP mathematical equations depict its novelty as compared to ANN and LRM. The results of sensitivity analysis indicated the increasing trend of input variables affecting TDS as HCO<sub>3</sub><sup>−</sup> (22.33%) > Cl<sup>−</sup> (21.66%) > Mg<sup>2+</sup> (16.98%) > Na<sup>+</sup> (14.55%) > Ca<sup>2+</sup> (12.92%) > SO<sub>4</sub><sup>2−</sup> (11.55%) > pH (0%), while, in the case of EC, it followed the trend as HCO<sub>3</sub><sup>−</sup> (42.36%) > SO<sub>4</sub><sup>2−</sup>(25.63%) > Ca<sup>2+</sup> (13.59%) > Cl<sup>−</sup> (12.8%) > Na<sup>+</sup> (5.01%) > pH (0.61%) > Mg<sup>2+</sup> (0%). The parametric analysis revealed that models have incorporated the effect of all the input parameters in the modeling process. The external assessment criteria confirmed the generalized outcome and robustness of the proposed approaches. Conclusively, the outcomes of this study demonstrated that the formulation of AI based models are cost effective and helpful for river water quality assessment, management and policy making. |
topic |
river water quality sustainable environment soft computing regression analysis total dissolved solids specific conductivity |
url |
https://www.mdpi.com/2071-1050/13/14/7515 |
work_keys_str_mv |
AT muhammadizharshah predictivemodelingapproachforsurfacewaterqualitydevelopmentandcomparisonofmachinelearningmodels AT wesamsalahalaloul predictivemodelingapproachforsurfacewaterqualitydevelopmentandcomparisonofmachinelearningmodels AT abdulazizalqahtani predictivemodelingapproachforsurfacewaterqualitydevelopmentandcomparisonofmachinelearningmodels AT alialdrees predictivemodelingapproachforsurfacewaterqualitydevelopmentandcomparisonofmachinelearningmodels AT muhammadalimusarat predictivemodelingapproachforsurfacewaterqualitydevelopmentandcomparisonofmachinelearningmodels AT muhammadfaisaljaved predictivemodelingapproachforsurfacewaterqualitydevelopmentandcomparisonofmachinelearningmodels |
_version_ |
1721285852791832576 |