The National Eutrophication Survey: lake characteristics and historical nutrient concentrations

Historical ecological surveys serve as a baseline and provide context for contemporary research, yet many of these records are not preserved in a way that ensures their long-term usability. The National Eutrophication Survey (NES) database is currently only available as scans of the original rep...

Full description

Bibliographic Details
Main Authors: J. Stachelek, C. Ford, D. Kincaid, K. King, H. Miller, R. Nagelkirk
Format: Article
Language:English
Published: Copernicus Publications 2018-01-01
Series:Earth System Science Data
Online Access:https://www.earth-syst-sci-data.net/10/81/2018/essd-10-81-2018.pdf
id doaj-dd4aa8d10bcb473cad24b78dd361546f
record_format Article
spelling doaj-dd4aa8d10bcb473cad24b78dd361546f2020-11-24T23:26:44ZengCopernicus PublicationsEarth System Science Data1866-35081866-35162018-01-0110818610.5194/essd-10-81-2018The National Eutrophication Survey: lake characteristics and historical nutrient concentrationsJ. Stachelek0C. Ford1D. Kincaid2D. Kincaid3K. King4H. Miller5R. Nagelkirk6Department of Fisheries and Wildlife, Michigan State University, East Lansing, MI, USADepartment of Earth and Environmental Sciences, Michigan State University, East Lansing, MI, USADepartment of Integrative Biology, Michigan State University, East Lansing, MI, USAW.K. Kellogg Biological Station, Michigan State University, Hickory Corners, MI, USADepartment of Fisheries and Wildlife, Michigan State University, East Lansing, MI, USADepartment of Microbiology and Molecular Genetics, Michigan State University, East Lansing, MI, USADepartment of Geography, Environment, and Spatial Sciences, Michigan State University, East Lansing, MI, USAHistorical ecological surveys serve as a baseline and provide context for contemporary research, yet many of these records are not preserved in a way that ensures their long-term usability. The National Eutrophication Survey (NES) database is currently only available as scans of the original reports (PDF files) with no embedded character information. This limits its searchability, machine readability, and the ability of current and future scientists to systematically evaluate its contents. The NES data were collected by the US Environmental Protection Agency between 1972 and 1975 as part of an effort to investigate eutrophication in freshwater lakes and reservoirs. Although several studies have manually transcribed small portions of the database in support of specific studies, there have been no systematic attempts to transcribe and preserve the database in its entirety. Here we use a combination of automated optical character recognition and manual quality assurance procedures to make these data available for analysis. The performance of the optical character recognition protocol was found to be linked to variation in the quality (clarity) of the original documents. For each of the four archival scanned reports, our quality assurance protocol found an error rate between 5.9 and 17 %. The goal of our approach was to strike a balance between efficiency and data quality by combining entry of data by hand with digital transcription technologies. The finished database contains information on the physical characteristics, hydrology, and water quality of about 800 lakes in the contiguous US (Stachelek et al.(2017), <a href="https://doi.org/10.5063/F1639MVD" target="_blank">https://doi.org/10.5063/F1639MVD</a>). Ultimately, this database could be combined with more recent studies to generate meta-analyses of water quality trends and spatial variation across the continental US.https://www.earth-syst-sci-data.net/10/81/2018/essd-10-81-2018.pdf
collection DOAJ
language English
format Article
sources DOAJ
author J. Stachelek
C. Ford
D. Kincaid
D. Kincaid
K. King
H. Miller
R. Nagelkirk
spellingShingle J. Stachelek
C. Ford
D. Kincaid
D. Kincaid
K. King
H. Miller
R. Nagelkirk
The National Eutrophication Survey: lake characteristics and historical nutrient concentrations
Earth System Science Data
author_facet J. Stachelek
C. Ford
D. Kincaid
D. Kincaid
K. King
H. Miller
R. Nagelkirk
author_sort J. Stachelek
title The National Eutrophication Survey: lake characteristics and historical nutrient concentrations
title_short The National Eutrophication Survey: lake characteristics and historical nutrient concentrations
title_full The National Eutrophication Survey: lake characteristics and historical nutrient concentrations
title_fullStr The National Eutrophication Survey: lake characteristics and historical nutrient concentrations
title_full_unstemmed The National Eutrophication Survey: lake characteristics and historical nutrient concentrations
title_sort national eutrophication survey: lake characteristics and historical nutrient concentrations
publisher Copernicus Publications
series Earth System Science Data
issn 1866-3508
1866-3516
publishDate 2018-01-01
description Historical ecological surveys serve as a baseline and provide context for contemporary research, yet many of these records are not preserved in a way that ensures their long-term usability. The National Eutrophication Survey (NES) database is currently only available as scans of the original reports (PDF files) with no embedded character information. This limits its searchability, machine readability, and the ability of current and future scientists to systematically evaluate its contents. The NES data were collected by the US Environmental Protection Agency between 1972 and 1975 as part of an effort to investigate eutrophication in freshwater lakes and reservoirs. Although several studies have manually transcribed small portions of the database in support of specific studies, there have been no systematic attempts to transcribe and preserve the database in its entirety. Here we use a combination of automated optical character recognition and manual quality assurance procedures to make these data available for analysis. The performance of the optical character recognition protocol was found to be linked to variation in the quality (clarity) of the original documents. For each of the four archival scanned reports, our quality assurance protocol found an error rate between 5.9 and 17 %. The goal of our approach was to strike a balance between efficiency and data quality by combining entry of data by hand with digital transcription technologies. The finished database contains information on the physical characteristics, hydrology, and water quality of about 800 lakes in the contiguous US (Stachelek et al.(2017), <a href="https://doi.org/10.5063/F1639MVD" target="_blank">https://doi.org/10.5063/F1639MVD</a>). Ultimately, this database could be combined with more recent studies to generate meta-analyses of water quality trends and spatial variation across the continental US.
url https://www.earth-syst-sci-data.net/10/81/2018/essd-10-81-2018.pdf
work_keys_str_mv AT jstachelek thenationaleutrophicationsurveylakecharacteristicsandhistoricalnutrientconcentrations
AT cford thenationaleutrophicationsurveylakecharacteristicsandhistoricalnutrientconcentrations
AT dkincaid thenationaleutrophicationsurveylakecharacteristicsandhistoricalnutrientconcentrations
AT dkincaid thenationaleutrophicationsurveylakecharacteristicsandhistoricalnutrientconcentrations
AT kking thenationaleutrophicationsurveylakecharacteristicsandhistoricalnutrientconcentrations
AT hmiller thenationaleutrophicationsurveylakecharacteristicsandhistoricalnutrientconcentrations
AT rnagelkirk thenationaleutrophicationsurveylakecharacteristicsandhistoricalnutrientconcentrations
AT jstachelek nationaleutrophicationsurveylakecharacteristicsandhistoricalnutrientconcentrations
AT cford nationaleutrophicationsurveylakecharacteristicsandhistoricalnutrientconcentrations
AT dkincaid nationaleutrophicationsurveylakecharacteristicsandhistoricalnutrientconcentrations
AT dkincaid nationaleutrophicationsurveylakecharacteristicsandhistoricalnutrientconcentrations
AT kking nationaleutrophicationsurveylakecharacteristicsandhistoricalnutrientconcentrations
AT hmiller nationaleutrophicationsurveylakecharacteristicsandhistoricalnutrientconcentrations
AT rnagelkirk nationaleutrophicationsurveylakecharacteristicsandhistoricalnutrientconcentrations
_version_ 1725553813964718080