Canonical Variable Selection for Ecological Modeling of Fecal Indicators

More than 270,000 km of rivers and streams are impaired due to fecal pathogens, creating an economic and public health burden. Fecal indicator organisms such as Escherichia coli are used to determine if surface waters are pathogen impaired, but they fail to identify human health risks, provide sourc...

Full description

Bibliographic Details
Main Authors: Gilfillan, Dennis, Hall, Kimberlee, Joyner, Timothy Andrew, Scheuerman, Phillip R.
Published: Digital Commons @ East Tennessee State University 2018
Subjects:
Online Access:https://dc.etsu.edu/etsu-works/5479
https://doi.org/10.2134/jeq2017.12.0474
id ndltd-ETSU-oai-dc.etsu.edu-etsu-works-6691
record_format oai_dc
spelling ndltd-ETSU-oai-dc.etsu.edu-etsu-works-66912019-11-01T03:35:20Z Canonical Variable Selection for Ecological Modeling of Fecal Indicators Gilfillan, Dennis Hall, Kimberlee Joyner, Timothy Andrew Scheuerman, Phillip R. More than 270,000 km of rivers and streams are impaired due to fecal pathogens, creating an economic and public health burden. Fecal indicator organisms such as Escherichia coli are used to determine if surface waters are pathogen impaired, but they fail to identify human health risks, provide source information, or have unique fate and transport processes. Statistical and machine learning models can be used to overcome some of these weaknesses, including identifying ecological mechanisms influencing fecal pollution. In this study, canonical correlation analysis (CCorA) was performed to select parameters for the machine learning model, Maxent, to identify how chemical and microbial parameters can predict E. coli impairment and F+-somatic bacteriophage detections. Models were validated using a bootstrapping cross-validation. Three suites of models were developed; initial models using all parameters, models using parameters identified in CCorA, and optimized models after further sensitivity analysis. Canonical correlation analysis reduced the number of parameters needed to achieve the same degree of accuracy in the initial E. coli model (84.7%), and sensitivity analysis improved accuracy to 86.1%. Bacteriophage model accuracies were 79.2, 70.8, and 69.4% for the initial, CCorA, and optimized models, respectively; this suggests complex ecological interactions of bacteriophages are not captured by CCorA. Results indicate distinct ecological drivers of impairment depending on the fecal indicator organism used. Escherichia coli impairment is driven by increased hardness and microbial activity, whereas bacteriophage detection is inhibited by high levels of coliforms in sediment. Both indicators were influenced by organic pollution and phosphorus limitation. 2018-09-20T07:00:00Z text https://dc.etsu.edu/etsu-works/5479 https://doi.org/10.2134/jeq2017.12.0474 ETSU Faculty Works Digital Commons @ East Tennessee State University fecal indicators ecological modeling canonical variable selection Environmental Health Environmental Microbiology and Microbial Ecology Environmental Public Health
collection NDLTD
sources NDLTD
topic fecal indicators
ecological modeling
canonical variable selection
Environmental Health
Environmental Microbiology and Microbial Ecology
Environmental Public Health
spellingShingle fecal indicators
ecological modeling
canonical variable selection
Environmental Health
Environmental Microbiology and Microbial Ecology
Environmental Public Health
Gilfillan, Dennis
Hall, Kimberlee
Joyner, Timothy Andrew
Scheuerman, Phillip R.
Canonical Variable Selection for Ecological Modeling of Fecal Indicators
description More than 270,000 km of rivers and streams are impaired due to fecal pathogens, creating an economic and public health burden. Fecal indicator organisms such as Escherichia coli are used to determine if surface waters are pathogen impaired, but they fail to identify human health risks, provide source information, or have unique fate and transport processes. Statistical and machine learning models can be used to overcome some of these weaknesses, including identifying ecological mechanisms influencing fecal pollution. In this study, canonical correlation analysis (CCorA) was performed to select parameters for the machine learning model, Maxent, to identify how chemical and microbial parameters can predict E. coli impairment and F+-somatic bacteriophage detections. Models were validated using a bootstrapping cross-validation. Three suites of models were developed; initial models using all parameters, models using parameters identified in CCorA, and optimized models after further sensitivity analysis. Canonical correlation analysis reduced the number of parameters needed to achieve the same degree of accuracy in the initial E. coli model (84.7%), and sensitivity analysis improved accuracy to 86.1%. Bacteriophage model accuracies were 79.2, 70.8, and 69.4% for the initial, CCorA, and optimized models, respectively; this suggests complex ecological interactions of bacteriophages are not captured by CCorA. Results indicate distinct ecological drivers of impairment depending on the fecal indicator organism used. Escherichia coli impairment is driven by increased hardness and microbial activity, whereas bacteriophage detection is inhibited by high levels of coliforms in sediment. Both indicators were influenced by organic pollution and phosphorus limitation.
author Gilfillan, Dennis
Hall, Kimberlee
Joyner, Timothy Andrew
Scheuerman, Phillip R.
author_facet Gilfillan, Dennis
Hall, Kimberlee
Joyner, Timothy Andrew
Scheuerman, Phillip R.
author_sort Gilfillan, Dennis
title Canonical Variable Selection for Ecological Modeling of Fecal Indicators
title_short Canonical Variable Selection for Ecological Modeling of Fecal Indicators
title_full Canonical Variable Selection for Ecological Modeling of Fecal Indicators
title_fullStr Canonical Variable Selection for Ecological Modeling of Fecal Indicators
title_full_unstemmed Canonical Variable Selection for Ecological Modeling of Fecal Indicators
title_sort canonical variable selection for ecological modeling of fecal indicators
publisher Digital Commons @ East Tennessee State University
publishDate 2018
url https://dc.etsu.edu/etsu-works/5479
https://doi.org/10.2134/jeq2017.12.0474
work_keys_str_mv AT gilfillandennis canonicalvariableselectionforecologicalmodelingoffecalindicators
AT hallkimberlee canonicalvariableselectionforecologicalmodelingoffecalindicators
AT joynertimothyandrew canonicalvariableselectionforecologicalmodelingoffecalindicators
AT scheuermanphillipr canonicalvariableselectionforecologicalmodelingoffecalindicators
_version_ 1719284755204145152