76 research outputs found

    Geovisual analytics to enhance spatial scan statistic interpretation: an analysis of U.S. cervical cancer mortality

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Kulldorff's spatial scan statistic and its software implementation – SaTScan – are widely used for detecting and evaluating geographic clusters. However, two issues make using the method and interpreting its results non-trivial: (1) the method lacks cartographic support for understanding the clusters in geographic context and (2) results from the method are sensitive to parameter choices related to cluster scaling (abbreviated as scaling parameters), but the system provides no direct support for making these choices. We employ both established and novel geovisual analytics methods to address these issues and to enhance the interpretation of SaTScan results. We demonstrate our geovisual analytics approach in a case study analysis of cervical cancer mortality in the U.S.</p> <p>Results</p> <p>We address the first issue by providing an interactive visual interface to support the interpretation of SaTScan results. Our research to address the second issue prompted a broader discussion about the sensitivity of SaTScan results to parameter choices. Sensitivity has two components: (1) the method can identify clusters that, while being statistically significant, have heterogeneous contents comprised of both high-risk and low-risk locations and (2) the method can identify clusters that are unstable in location and size as the spatial scan scaling parameter is varied. To investigate cluster result stability, we conducted multiple SaTScan runs with systematically selected parameters. The results, when scanning a large spatial dataset (e.g., U.S. data aggregated by county), demonstrate that no single spatial scan scaling value is known to be optimal to identify clusters that exist at different scales; instead, multiple scans that vary the parameters are necessary. We introduce a novel method of measuring and visualizing reliability that facilitates identification of homogeneous clusters that are stable across analysis scales. Finally, we propose a logical approach to proceed through the analysis of SaTScan results.</p> <p>Conclusion</p> <p>The geovisual analytics approach described in this manuscript facilitates the interpretation of spatial cluster detection methods by providing cartographic representation of SaTScan results and by providing visualization methods and tools that support selection of SaTScan parameters. Our methods distinguish between heterogeneous and homogeneous clusters and assess the stability of clusters across analytic scales.</p> <p>Method</p> <p>We analyzed the cervical cancer mortality data for the United States aggregated by county between 2000 and 2004. We ran SaTScan on the dataset fifty times with different parameter choices. Our geovisual analytics approach couples SaTScan with our visual analytic platform, allowing users to interactively explore and compare SaTScan results produced by different parameter choices. The Standardized Mortality Ratio and reliability scores are visualized for all the counties to identify stable, homogeneous clusters. We evaluated our analysis result by comparing it to that produced by other independent techniques including the Empirical Bayes Smoothing and Kafadar spatial smoother methods. The geovisual analytics approach introduced here is developed and implemented in our Java-based Visual Inquiry Toolkit.</p

    LLMs Understand Glass-Box Models, Discover Surprises, and Suggest Repairs

    Full text link
    We show that large language models (LLMs) are remarkably good at working with interpretable models that decompose complex outcomes into univariate graph-represented components. By adopting a hierarchical approach to reasoning, LLMs can provide comprehensive model-level summaries without ever requiring the entire model to fit in context. This approach enables LLMs to apply their extensive background knowledge to automate common tasks in data science such as detecting anomalies that contradict prior knowledge, describing potential reasons for the anomalies, and suggesting repairs that would remove the anomalies. We use multiple examples in healthcare to demonstrate the utility of these new capabilities of LLMs, with particular emphasis on Generalized Additive Models (GAMs). Finally, we present the package TalkToEBM\texttt{TalkToEBM} as an open-source LLM-GAM interface

    Diabetes status and being up-to-date on colorectal cancer screening, 2012 Behavioral Risk Factor Surveillance System

    Get PDF
    INTRODUCTION: Although screening rates for colorectal cancer are increasing, 22 million Americans are not up-to-date with recommendations. People with diabetes are an important and rapidly growing group at increased risk for colorectal cancer. Screening status and predictors of being up-to-date on screening are largely unknown in this population. METHODS: This study used logistic regression modeling and data from the 2012 Behavioral Risk Factor Surveillance System to examine the association between diabetes and colorectal cancer screening predictors with being up-to-date on colorectal cancer screening according to criteria of the US Preventive Services Task Force for adults aged 50 or older. State prevalence rates of up-to-date colorectal cancer screening were also calculated and mapped. RESULTS: The prevalence of being up-to-date with colorectal cancer screening for all respondents aged 50 or older was 65.6%; for respondents with diabetes, the rate was 69.2%. Respondents with diabetes were 22% more likely to be up-to-date on colorectal cancer screening than those without diabetes. Among those with diabetes, having a routine checkup within the previous year significantly increased the odds of being up-to-date on colorectal cancer screening (odds ratio, 1.90). Other factors such as age, income, education, race/ethnicity, insurance status, and history of cancer were also associated with up-to-date status. CONCLUSION: Regardless of diabetes status, people who had a routine checkup within the past year were more likely to be up-to-date than people who had not. Among people with diabetes, the duration between routine checkups may be of greater importance than the frequency of diabetes-related doctor visits. Continued efforts should be made to ensure that routine care visits occur regularly to address the preventive health needs of patients with and patients without diabetes

    A National Survey of State Comprehensive Cancer Control Managers: Implications of Geographic Information Systems

    No full text
    The growth of geographic information systems (GIS) for comprehensive cancer control (CCC) planning activities has been documented. We examined concerns about use and derived principles for practice. A national survey of US CCC program managers (n = 49) was conducted. Results include statements and frequency of barriers to use GIS mapping for CCC. Uses of GIS for CCC activities have benefits, but must be considered within organizational frameworks designed to safeguard confidentiality of health information and community relationships. Education to guide understanding of and input into the decisions linked to GIS mapping can limit possible harms while advancing CCC aims
    • …
    corecore