Methods for estimating population density in data-limited areas: evaluating regression and tree-based models in Peru.
Obtaining accurate small area estimates of population is essential for policy and health planning but is often difficult in countries with limited data. In lieu of available population data, small area estimate models draw information from previous time periods or from similar areas. This study focu...
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Public Library of Science (PLoS)
2014-01-01
|
Series: | PLoS ONE |
Online Access: | http://europepmc.org/articles/PMC4081515?pdf=render |
id |
doaj-3ab27644afa9457ab11677fc2d91ca35 |
---|---|
record_format |
Article |
spelling |
doaj-3ab27644afa9457ab11677fc2d91ca352020-11-25T02:04:18ZengPublic Library of Science (PLoS)PLoS ONE1932-62032014-01-0197e10003710.1371/journal.pone.0100037Methods for estimating population density in data-limited areas: evaluating regression and tree-based models in Peru.Weston AndersonSeth GuikemaBen ZaitchikWilliam PanObtaining accurate small area estimates of population is essential for policy and health planning but is often difficult in countries with limited data. In lieu of available population data, small area estimate models draw information from previous time periods or from similar areas. This study focuses on model-based methods for estimating population when no direct samples are available in the area of interest. To explore the efficacy of tree-based models for estimating population density, we compare six different model structures including Random Forest and Bayesian Additive Regression Trees. Results demonstrate that without information from prior time periods, non-parametric tree-based models produced more accurate predictions than did conventional regression methods. Improving estimates of population density in non-sampled areas is important for regions with incomplete census data and has implications for economic, health and development policies.http://europepmc.org/articles/PMC4081515?pdf=render |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Weston Anderson Seth Guikema Ben Zaitchik William Pan |
spellingShingle |
Weston Anderson Seth Guikema Ben Zaitchik William Pan Methods for estimating population density in data-limited areas: evaluating regression and tree-based models in Peru. PLoS ONE |
author_facet |
Weston Anderson Seth Guikema Ben Zaitchik William Pan |
author_sort |
Weston Anderson |
title |
Methods for estimating population density in data-limited areas: evaluating regression and tree-based models in Peru. |
title_short |
Methods for estimating population density in data-limited areas: evaluating regression and tree-based models in Peru. |
title_full |
Methods for estimating population density in data-limited areas: evaluating regression and tree-based models in Peru. |
title_fullStr |
Methods for estimating population density in data-limited areas: evaluating regression and tree-based models in Peru. |
title_full_unstemmed |
Methods for estimating population density in data-limited areas: evaluating regression and tree-based models in Peru. |
title_sort |
methods for estimating population density in data-limited areas: evaluating regression and tree-based models in peru. |
publisher |
Public Library of Science (PLoS) |
series |
PLoS ONE |
issn |
1932-6203 |
publishDate |
2014-01-01 |
description |
Obtaining accurate small area estimates of population is essential for policy and health planning but is often difficult in countries with limited data. In lieu of available population data, small area estimate models draw information from previous time periods or from similar areas. This study focuses on model-based methods for estimating population when no direct samples are available in the area of interest. To explore the efficacy of tree-based models for estimating population density, we compare six different model structures including Random Forest and Bayesian Additive Regression Trees. Results demonstrate that without information from prior time periods, non-parametric tree-based models produced more accurate predictions than did conventional regression methods. Improving estimates of population density in non-sampled areas is important for regions with incomplete census data and has implications for economic, health and development policies. |
url |
http://europepmc.org/articles/PMC4081515?pdf=render |
work_keys_str_mv |
AT westonanderson methodsforestimatingpopulationdensityindatalimitedareasevaluatingregressionandtreebasedmodelsinperu AT sethguikema methodsforestimatingpopulationdensityindatalimitedareasevaluatingregressionandtreebasedmodelsinperu AT benzaitchik methodsforestimatingpopulationdensityindatalimitedareasevaluatingregressionandtreebasedmodelsinperu AT williampan methodsforestimatingpopulationdensityindatalimitedareasevaluatingregressionandtreebasedmodelsinperu |
_version_ |
1724943179641257984 |