Canonical Variable Selection for Ecological Modeling of Fecal Indicators
More than 270,000 km of rivers and streams are impaired due to fecal pathogens, creating an economic and public health burden. Fecal indicator organisms such as Escherichia coli are used to determine if surface waters are pathogen impaired, but they fail to identify human health risks, provide sourc...
Main Authors: | , , , |
---|---|
Published: |
Digital Commons @ East Tennessee State University
2018
|
Subjects: | |
Online Access: | https://dc.etsu.edu/etsu-works/5479 https://doi.org/10.2134/jeq2017.12.0474 |
id |
ndltd-ETSU-oai-dc.etsu.edu-etsu-works-6691 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-ETSU-oai-dc.etsu.edu-etsu-works-66912019-11-01T03:35:20Z Canonical Variable Selection for Ecological Modeling of Fecal Indicators Gilfillan, Dennis Hall, Kimberlee Joyner, Timothy Andrew Scheuerman, Phillip R. More than 270,000 km of rivers and streams are impaired due to fecal pathogens, creating an economic and public health burden. Fecal indicator organisms such as Escherichia coli are used to determine if surface waters are pathogen impaired, but they fail to identify human health risks, provide source information, or have unique fate and transport processes. Statistical and machine learning models can be used to overcome some of these weaknesses, including identifying ecological mechanisms influencing fecal pollution. In this study, canonical correlation analysis (CCorA) was performed to select parameters for the machine learning model, Maxent, to identify how chemical and microbial parameters can predict E. coli impairment and F+-somatic bacteriophage detections. Models were validated using a bootstrapping cross-validation. Three suites of models were developed; initial models using all parameters, models using parameters identified in CCorA, and optimized models after further sensitivity analysis. Canonical correlation analysis reduced the number of parameters needed to achieve the same degree of accuracy in the initial E. coli model (84.7%), and sensitivity analysis improved accuracy to 86.1%. Bacteriophage model accuracies were 79.2, 70.8, and 69.4% for the initial, CCorA, and optimized models, respectively; this suggests complex ecological interactions of bacteriophages are not captured by CCorA. Results indicate distinct ecological drivers of impairment depending on the fecal indicator organism used. Escherichia coli impairment is driven by increased hardness and microbial activity, whereas bacteriophage detection is inhibited by high levels of coliforms in sediment. Both indicators were influenced by organic pollution and phosphorus limitation. 2018-09-20T07:00:00Z text https://dc.etsu.edu/etsu-works/5479 https://doi.org/10.2134/jeq2017.12.0474 ETSU Faculty Works Digital Commons @ East Tennessee State University fecal indicators ecological modeling canonical variable selection Environmental Health Environmental Microbiology and Microbial Ecology Environmental Public Health |
collection |
NDLTD |
sources |
NDLTD |
topic |
fecal indicators ecological modeling canonical variable selection Environmental Health Environmental Microbiology and Microbial Ecology Environmental Public Health |
spellingShingle |
fecal indicators ecological modeling canonical variable selection Environmental Health Environmental Microbiology and Microbial Ecology Environmental Public Health Gilfillan, Dennis Hall, Kimberlee Joyner, Timothy Andrew Scheuerman, Phillip R. Canonical Variable Selection for Ecological Modeling of Fecal Indicators |
description |
More than 270,000 km of rivers and streams are impaired due to fecal pathogens, creating an economic and public health burden. Fecal indicator organisms such as Escherichia coli are used to determine if surface waters are pathogen impaired, but they fail to identify human health risks, provide source information, or have unique fate and transport processes. Statistical and machine learning models can be used to overcome some of these weaknesses, including identifying ecological mechanisms influencing fecal pollution. In this study, canonical correlation analysis (CCorA) was performed to select parameters for the machine learning model, Maxent, to identify how chemical and microbial parameters can predict E. coli impairment and F+-somatic bacteriophage detections. Models were validated using a bootstrapping cross-validation. Three suites of models were developed; initial models using all parameters, models using parameters identified in CCorA, and optimized models after further sensitivity analysis. Canonical correlation analysis reduced the number of parameters needed to achieve the same degree of accuracy in the initial E. coli model (84.7%), and sensitivity analysis improved accuracy to 86.1%. Bacteriophage model accuracies were 79.2, 70.8, and 69.4% for the initial, CCorA, and optimized models, respectively; this suggests complex ecological interactions of bacteriophages are not captured by CCorA. Results indicate distinct ecological drivers of impairment depending on the fecal indicator organism used. Escherichia coli impairment is driven by increased hardness and microbial activity, whereas bacteriophage detection is inhibited by high levels of coliforms in sediment. Both indicators were influenced by organic pollution and phosphorus limitation. |
author |
Gilfillan, Dennis Hall, Kimberlee Joyner, Timothy Andrew Scheuerman, Phillip R. |
author_facet |
Gilfillan, Dennis Hall, Kimberlee Joyner, Timothy Andrew Scheuerman, Phillip R. |
author_sort |
Gilfillan, Dennis |
title |
Canonical Variable Selection for Ecological Modeling of Fecal Indicators |
title_short |
Canonical Variable Selection for Ecological Modeling of Fecal Indicators |
title_full |
Canonical Variable Selection for Ecological Modeling of Fecal Indicators |
title_fullStr |
Canonical Variable Selection for Ecological Modeling of Fecal Indicators |
title_full_unstemmed |
Canonical Variable Selection for Ecological Modeling of Fecal Indicators |
title_sort |
canonical variable selection for ecological modeling of fecal indicators |
publisher |
Digital Commons @ East Tennessee State University |
publishDate |
2018 |
url |
https://dc.etsu.edu/etsu-works/5479 https://doi.org/10.2134/jeq2017.12.0474 |
work_keys_str_mv |
AT gilfillandennis canonicalvariableselectionforecologicalmodelingoffecalindicators AT hallkimberlee canonicalvariableselectionforecologicalmodelingoffecalindicators AT joynertimothyandrew canonicalvariableselectionforecologicalmodelingoffecalindicators AT scheuermanphillipr canonicalvariableselectionforecologicalmodelingoffecalindicators |
_version_ |
1719284755204145152 |