Comparison of tests for spatial heterogeneity on data with global clustering patterns and outliers

<p>Abstract</p> <p>Background</p> <p>The ability to evaluate geographic heterogeneity of cancer incidence and mortality is important in cancer surveillance. Many statistical methods for evaluating global clustering and local cluster patterns are developed and have been...

Full description

Bibliographic Details
Main Authors: Hachey Mark, Luo Jun, Huang Lan, Jackson Monica C, Feuer Eric
Format: Article
Language:English
Published: BMC 2009-10-01
Series:International Journal of Health Geographics
Online Access:http://www.ij-healthgeographics.com/content/8/1/55
id doaj-1bbf049c769d473f98ff0acfc967dafe
record_format Article
spelling doaj-1bbf049c769d473f98ff0acfc967dafe2020-11-25T01:29:47ZengBMCInternational Journal of Health Geographics1476-072X2009-10-01815510.1186/1476-072X-8-55Comparison of tests for spatial heterogeneity on data with global clustering patterns and outliersHachey MarkLuo JunHuang LanJackson Monica CFeuer Eric<p>Abstract</p> <p>Background</p> <p>The ability to evaluate geographic heterogeneity of cancer incidence and mortality is important in cancer surveillance. Many statistical methods for evaluating global clustering and local cluster patterns are developed and have been examined by many simulation studies. However, the performance of these methods on two extreme cases (global clustering evaluation and local anomaly (outlier) detection) has not been thoroughly investigated.</p> <p>Methods</p> <p>We compare methods for global clustering evaluation including Tango's Index, Moran's <it>I</it>, and Oden's <it>I</it>*<sub><it>pop</it></sub>; and cluster detection methods such as local Moran's <it>I </it>and SaTScan elliptic version on simulated count data that mimic global clustering patterns and outliers for cancer cases in the continental United States. We examine the power and precision of the selected methods in the purely spatial analysis. We illustrate Tango's MEET and SaTScan elliptic version on a 1987-2004 HIV and a 1950-1969 lung cancer mortality data in the United States.</p> <p>Results</p> <p>For simulated data with outlier patterns, Tango's MEET, Moran's <it>I </it>and <it>I</it>*<sub><it>pop </it></sub>had powers less than 0.2, and SaTScan had powers around 0.97. For simulated data with global clustering patterns, Tango's MEET and <it>I</it>*<sub><it>pop </it></sub>(with 50% of total population as the maximum search window) had powers close to 1. SaTScan had powers around 0.7-0.8 and Moran's <it>I </it>has powers around 0.2-0.3. In the real data example, Tango's MEET indicated the existence of global clustering patterns in both the HIV and lung cancer mortality data. SaTScan found a large cluster for HIV mortality rates, which is consistent with the finding from Tango's MEET. SaTScan also found clusters and outliers in the lung cancer mortality data.</p> <p>Conclusion</p> <p>SaTScan elliptic version is more efficient for outlier detection compared with the other methods evaluated in this article. Tango's MEET and Oden's <it>I</it>*<sub><it>pop </it></sub>perform best in global clustering scenarios among the selected methods. The use of SaTScan for data with global clustering patterns should be used with caution since SatScan may reveal an incorrect spatial pattern even though it has enough power to reject a null hypothesis of homogeneous relative risk. Tango's method should be used for global clustering evaluation instead of SaTScan.</p> http://www.ij-healthgeographics.com/content/8/1/55
collection DOAJ
language English
format Article
sources DOAJ
author Hachey Mark
Luo Jun
Huang Lan
Jackson Monica C
Feuer Eric
spellingShingle Hachey Mark
Luo Jun
Huang Lan
Jackson Monica C
Feuer Eric
Comparison of tests for spatial heterogeneity on data with global clustering patterns and outliers
International Journal of Health Geographics
author_facet Hachey Mark
Luo Jun
Huang Lan
Jackson Monica C
Feuer Eric
author_sort Hachey Mark
title Comparison of tests for spatial heterogeneity on data with global clustering patterns and outliers
title_short Comparison of tests for spatial heterogeneity on data with global clustering patterns and outliers
title_full Comparison of tests for spatial heterogeneity on data with global clustering patterns and outliers
title_fullStr Comparison of tests for spatial heterogeneity on data with global clustering patterns and outliers
title_full_unstemmed Comparison of tests for spatial heterogeneity on data with global clustering patterns and outliers
title_sort comparison of tests for spatial heterogeneity on data with global clustering patterns and outliers
publisher BMC
series International Journal of Health Geographics
issn 1476-072X
publishDate 2009-10-01
description <p>Abstract</p> <p>Background</p> <p>The ability to evaluate geographic heterogeneity of cancer incidence and mortality is important in cancer surveillance. Many statistical methods for evaluating global clustering and local cluster patterns are developed and have been examined by many simulation studies. However, the performance of these methods on two extreme cases (global clustering evaluation and local anomaly (outlier) detection) has not been thoroughly investigated.</p> <p>Methods</p> <p>We compare methods for global clustering evaluation including Tango's Index, Moran's <it>I</it>, and Oden's <it>I</it>*<sub><it>pop</it></sub>; and cluster detection methods such as local Moran's <it>I </it>and SaTScan elliptic version on simulated count data that mimic global clustering patterns and outliers for cancer cases in the continental United States. We examine the power and precision of the selected methods in the purely spatial analysis. We illustrate Tango's MEET and SaTScan elliptic version on a 1987-2004 HIV and a 1950-1969 lung cancer mortality data in the United States.</p> <p>Results</p> <p>For simulated data with outlier patterns, Tango's MEET, Moran's <it>I </it>and <it>I</it>*<sub><it>pop </it></sub>had powers less than 0.2, and SaTScan had powers around 0.97. For simulated data with global clustering patterns, Tango's MEET and <it>I</it>*<sub><it>pop </it></sub>(with 50% of total population as the maximum search window) had powers close to 1. SaTScan had powers around 0.7-0.8 and Moran's <it>I </it>has powers around 0.2-0.3. In the real data example, Tango's MEET indicated the existence of global clustering patterns in both the HIV and lung cancer mortality data. SaTScan found a large cluster for HIV mortality rates, which is consistent with the finding from Tango's MEET. SaTScan also found clusters and outliers in the lung cancer mortality data.</p> <p>Conclusion</p> <p>SaTScan elliptic version is more efficient for outlier detection compared with the other methods evaluated in this article. Tango's MEET and Oden's <it>I</it>*<sub><it>pop </it></sub>perform best in global clustering scenarios among the selected methods. The use of SaTScan for data with global clustering patterns should be used with caution since SatScan may reveal an incorrect spatial pattern even though it has enough power to reject a null hypothesis of homogeneous relative risk. Tango's method should be used for global clustering evaluation instead of SaTScan.</p>
url http://www.ij-healthgeographics.com/content/8/1/55
work_keys_str_mv AT hacheymark comparisonoftestsforspatialheterogeneityondatawithglobalclusteringpatternsandoutliers
AT luojun comparisonoftestsforspatialheterogeneityondatawithglobalclusteringpatternsandoutliers
AT huanglan comparisonoftestsforspatialheterogeneityondatawithglobalclusteringpatternsandoutliers
AT jacksonmonicac comparisonoftestsforspatialheterogeneityondatawithglobalclusteringpatternsandoutliers
AT feuereric comparisonoftestsforspatialheterogeneityondatawithglobalclusteringpatternsandoutliers
_version_ 1725094739100827648