Model-based control of observer bias for the analysis of presence-only data in ecology.

Presence-only data, where information is available concerning species presence but not species absence, are subject to bias due to observers being more likely to visit and record sightings at some locations than others (hereafter "observer bias"). In this paper, we describe and evaluate a...

Full description

Bibliographic Details
Main Authors: David I Warton, Ian W Renner, Daniel Ramp
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2013-01-01
Series:PLoS ONE
Online Access:http://europepmc.org/articles/PMC3832482?pdf=render
id doaj-aad622a4f4864ee5851c8f432ba3794b
record_format Article
spelling doaj-aad622a4f4864ee5851c8f432ba3794b2020-11-25T02:31:31ZengPublic Library of Science (PLoS)PLoS ONE1932-62032013-01-01811e7916810.1371/journal.pone.0079168Model-based control of observer bias for the analysis of presence-only data in ecology.David I WartonIan W RennerDaniel RampPresence-only data, where information is available concerning species presence but not species absence, are subject to bias due to observers being more likely to visit and record sightings at some locations than others (hereafter "observer bias"). In this paper, we describe and evaluate a model-based approach to accounting for observer bias directly--by modelling presence locations as a function of known observer bias variables (such as accessibility variables) in addition to environmental variables, then conditioning on a common level of bias to make predictions of species occurrence free of such observer bias. We implement this idea using point process models with a LASSO penalty, a new presence-only method related to maximum entropy modelling, that implicitly addresses the "pseudo-absence problem" of where to locate pseudo-absences (and how many). The proposed method of bias-correction is evaluated using systematically collected presence/absence data for 62 plant species endemic to the Blue Mountains near Sydney, Australia. It is shown that modelling and controlling for observer bias significantly improves the accuracy of predictions made using presence-only data, and usually improves predictions as compared to pseudo-absence or "inventory" methods of bias correction based on absences from non-target species. Future research will consider the potential for improving the proposed bias-correction approach by estimating the observer bias simultaneously across multiple species.http://europepmc.org/articles/PMC3832482?pdf=render
collection DOAJ
language English
format Article
sources DOAJ
author David I Warton
Ian W Renner
Daniel Ramp
spellingShingle David I Warton
Ian W Renner
Daniel Ramp
Model-based control of observer bias for the analysis of presence-only data in ecology.
PLoS ONE
author_facet David I Warton
Ian W Renner
Daniel Ramp
author_sort David I Warton
title Model-based control of observer bias for the analysis of presence-only data in ecology.
title_short Model-based control of observer bias for the analysis of presence-only data in ecology.
title_full Model-based control of observer bias for the analysis of presence-only data in ecology.
title_fullStr Model-based control of observer bias for the analysis of presence-only data in ecology.
title_full_unstemmed Model-based control of observer bias for the analysis of presence-only data in ecology.
title_sort model-based control of observer bias for the analysis of presence-only data in ecology.
publisher Public Library of Science (PLoS)
series PLoS ONE
issn 1932-6203
publishDate 2013-01-01
description Presence-only data, where information is available concerning species presence but not species absence, are subject to bias due to observers being more likely to visit and record sightings at some locations than others (hereafter "observer bias"). In this paper, we describe and evaluate a model-based approach to accounting for observer bias directly--by modelling presence locations as a function of known observer bias variables (such as accessibility variables) in addition to environmental variables, then conditioning on a common level of bias to make predictions of species occurrence free of such observer bias. We implement this idea using point process models with a LASSO penalty, a new presence-only method related to maximum entropy modelling, that implicitly addresses the "pseudo-absence problem" of where to locate pseudo-absences (and how many). The proposed method of bias-correction is evaluated using systematically collected presence/absence data for 62 plant species endemic to the Blue Mountains near Sydney, Australia. It is shown that modelling and controlling for observer bias significantly improves the accuracy of predictions made using presence-only data, and usually improves predictions as compared to pseudo-absence or "inventory" methods of bias correction based on absences from non-target species. Future research will consider the potential for improving the proposed bias-correction approach by estimating the observer bias simultaneously across multiple species.
url http://europepmc.org/articles/PMC3832482?pdf=render
work_keys_str_mv AT davidiwarton modelbasedcontrolofobserverbiasfortheanalysisofpresenceonlydatainecology
AT ianwrenner modelbasedcontrolofobserverbiasfortheanalysisofpresenceonlydatainecology
AT danielramp modelbasedcontrolofobserverbiasfortheanalysisofpresenceonlydatainecology
_version_ 1724823969380433920