1,290 research outputs found
Recommended from our members
Power evaluation of disease clustering tests
BACKGROUND: Many different test statistics have been proposed to test for spatial clustering. Some of these statistics have been widely used in various applications. In this paper, we use an existing collection of 1,220,000 simulated benchmark data, generated under 51 different clustering models, to compare the statistical power of several disease clustering tests. These tests are Besag-Newell's R, Cuzick-Edwards' k-Nearest Neighbors (k-NN), the spatial scan statistic, Tango's Maximized Excess Events Test (MEET), Swartz' entropy test, Whittemore's test, Moran's I and a modification of Moran's I. RESULTS: Except for Moran's I and Whittemore's test, all other tests have good power for detecting some kind of clustering. The spatial scan statistic is good at detecting localized clusters. Tango's MEET is good at detecting global clustering. With appropriate choice of parameter, Besag-Newell's R and Cuzick-Edwards' k-NN also perform well. CONCLUSION: The power varies greatly for different test statistics and alternative clustering models. Consideration of the power is important before we decide which test statistic to use
Dead Bird Clusters as an Early Warning System for West Nile Virus Activity
An early warning system for West Nile virus (WNV) outbreaks could provide a basis for targeted public education and surveillance activities as well as more timely larval and adult mosquito control. We adapted the spatial scan statistic for prospective detection of infectious disease outbreaks, applied the results to data on dead birds reported from New York City in 2000, and reviewed its utility in providing an early warning of WNV activity in 2001. Prospective geographic cluster analysis of dead bird reports may provide early warning of increasing viral activity in birds and mosquitoes, allowing jurisdictions to triage limited mosquito-collection and laboratory resources and more effectively prevent human disease caused by the virus. This adaptation of the scan statistic could also be useful in other infectious disease surveillance systems, including that for bioterrorism
Recommended from our members
Tango's maximized excess events test with different weights
BACKGROUND: Tango's maximized excess events test (MEET) has been shown to have very good statistical power in detecting global disease clustering. A nice feature of this test is that it considers a range of spatial scale parameters, adjusting for the multiple testing. This means that it has good power to detect a wide range of clustering processes. The test depends on the functional form of a weight function, and it is unknown how sensitive the test is to the choice of this weight function and what function provides optimal power for different clustering processes. In this study, we evaluate the performance of the test for a wide range of weight functions. RESULTS: The power varies greatly with different choice of weight. Tango's original choice for the weight function works very well. There are also other weight functions that provide good power. CONCLUSION: We recommend the use of Tango's MEET to test global disease clustering, either with the original weight or one of the alternate weights that have good power
Maximum linkage space-time permutation scan statistics for disease outbreak detection
Background: In disease surveillance, the prospective space-time permutation scan statistic is commonly used for the early detection of disease outbreaks. The scanning window that defines potential clusters of diseases is cylindrical in shape, which does not allow incorporating into the cluster shape potential factors that can contribute to the spread of the disease, such as information about roads, landscape, among others. Furthermore, the cylinder scanning window assumes that the spatial extent of the cluster does not change in time. Alternatively, a dynamic space-time cluster may indicate the potential spread of the disease through time. For instance, the cluster may decrease over time indicating that the spread of the disease is vanishing. Methods: This paper proposes two irregularly shaped space-time permutation scan statistics. The cluster geometry is dynamically created using a graph structure. The graph can be created to include nearest-neighbor structures, geographical adjacency information or any relevant prior information regarding the contagious behavior of the event under surveillance. Results: The new methods are illustrated using influenza cases in three New England states, and compared with the cylindrical version. A simulation study is provided to investigate some properties of the proposed arbitrary cluster detection techniques. Conclusion: We have successfully developed two new space-time permutation scan statistics methods with irregular shapes and improved computational performance. The results demonstrate the potential of these methods to quickly detect disease outbreaks with irregular geometries. Future work aims at performing intensive simulation studies to evaluate the proposed methods using different scenarios, number of cases, and graph structures
Recommended from our members
Influence of Spatial Resolution on Space-Time Disease Cluster Detection
Background: Utilizing highly precise spatial resolutions within disease outbreak detection, such as the patients’ address, is most desirable as this provides the actual residential location of the infected individual(s). However, this level of precision is not always readily available or only available for purchase, and when utilized, increases the risk of exposing protected health information. Aggregating data to less precise scales (e.g., ZIP code or county centroids) may mitigate this risk but at the expense of potentially masking smaller isolated high risk areas. Methods: To experimentally examine the effect of spatial data resolution on space-time cluster detection, we extracted administrative medical claims data for 122500 viral lung episodes occurring during 2007–2010 in Tennessee. We generated 10000 spatial datasets with varying cluster location, size and intensity at the address-level. To represent spatial data aggregation (i.e., reduced resolution), we then created 10000 corresponding datasets both at the ZIP code and county level for a total of 30000 datasets. Using the space-time permutation scan statistic and the SaTScan™ cluster software, we evaluated statistical power, sensitivity and positive predictive values of outbreak detection when using exact address locations compared to ZIP code and county level aggregations. Results: The power to detect disease outbreaks did not largely diminish when using spatially aggregated data compared to more precise address information. However, aggregations negatively impacted the ability to more accurately determine the exact spatial location of the outbreak, particularly in smaller clusters (<800 km2). Conclusions: Spatial aggregations do not necessitate a loss of power or sensitivity; rather, the relationship is more complex and involves simultaneously considering relative risk within the cluster and cluster size. The likelihood of spatially over-estimating outbreaks by including geographical areas outside the actual disease cluster increases with aggregated data
Under-five mortality: spatial-temporal clusters in Ifakara HDSS in South-eastern Tanzania.
BACKGROUND\ud
\ud
Childhood mortality remains an important subject, particularly in sub-Saharan Africa where levels are still unacceptably high. To achieve the set Millennium Development Goals 4, calls for comprehensive application of the proven cost-effective interventions. Understanding spatial clustering of childhood mortality can provide a guide in targeting the interventions in a more strategic approach to the population where mortality is highest and the interventions are most likely to make an impact.\ud
\ud
METHODS\ud
\ud
Annual child mortality rates were calculated for each village, using person-years observed as the denominator. Kulldorff's spatial scan statistic was used for the identification and testing of childhood mortality clusters. All under-five deaths that occurred within a 10-year period from 1997 to 2006 were included in the analysis. Villages were used as units of clusters; all 25 health and demographic surveillance sites (HDSS) villages in the Ifakara health and demographic surveillance area were included.\ud
\ud
RESULTS\ud
\ud
Of the 10 years of analysis, statistically significant spatial clustering was identified in only 2 years (1998 and 2001). In 1998, the statistically significant cluster (p < 0.01) was composed of nine villages. A total of 106 childhood deaths were observed against an expected 77.3. The other statistically significant cluster (p < 0.05) identified in 2001 was composed of only one village. In this cluster, 36 childhood deaths were observed compared to 20.3 expected. Purely temporal analysis indicated that the year 2003 was a significant cluster (p < 0.05). Total deaths were 393 and expected were 335.8. Spatial-temporal analysis showed that nine villages were identified as statistically significant clusters (p < 0.05) for the period covering January 1997-December 1998. Total observed deaths in this cluster were 205 while 150.7 were expected.\ud
\ud
CONCLUSION\ud
\ud
There is evidence of spatial clustering in childhood mortality within the Ifakara HDSS. Further investigations are needed to explore the source of clustering and identify strategies of reaching the cluster population with the existing effective interventions. However, that should happen alongside delivery of interventions to the broader population
A scan statistic for continuous data based on the normal probability model
Temporal, spatial and space-time scan statistics are commonly used to detect and evaluate the statistical significance of temporal and/or geographical disease clusters, without any prior assumptions on the location, time period or size of those clusters. Scan statistics are mostly used for count data, such as disease incidence or mortality. Sometimes there is an interest in looking for clusters with respect to a continuous variable, such as lead levels in children or low birth weight. For such continuous data, we present a scan statistic where the likelihood is calculated using the the normal probability model. It may also be used for other distributions, while still maintaining the correct alpha level. In an application of the new method, we look for geographical clusters of low birth weight in New York City
- …