6 research outputs found

    Spatio-temporal clustering in application

    Get PDF
    The importance of machine learning methods in the data analysis of both academic research and industry applications has advanced rapidly in recent years. This thesis will investigate how a method of unsupervised machine learning known as clustering can be employed to analyse spatial and spatio-temporal data from different fields of application. Spatio-temporal data present a particular challenge. In spatial contexts, the notion of dependency among geographically close elements needs to be considered when analysing the geographic distance as well as other spatial components. The temporal dimension of the data makes traditional dissimilarity metrics unsuitable due to the sequential ordering of data points. For this reason, this thesis will present ways of overcoming the shortcomings in existing methodologies when applied to these data types. By doing so, it will contribute to the literature on clustering through innovative extensions, adaptations, and considerations. The flexibility of clustering will be demonstrated in three different application contexts in health, finance, and marketing. As such, this thesis will also contribute to the academic literature in these areas and offer valuable insights into applicable machine learning methodology for practitioners

    Filaments of crime: Informing policing via thresholded ridge estimation

    Full text link
    Objectives: We introduce a new method for reducing crime in hot spots and across cities through ridge estimation. In doing so, our goal is to explore the application of density ridges to hot spots and patrol optimization, and to contribute to the policing literature in police patrolling and crime reduction strategies. Methods: We make use of the subspace-constrained mean shift algorithm, a recently introduced approach for ridge estimation further developed in cosmology, which we modify and extend for geospatial datasets and hot spot analysis. Our experiments extract density ridges of Part I crime incidents from the City of Chicago during the year 2018 and early 2019 to demonstrate the application to current data. Results: Our results demonstrate nonlinear mode-following ridges in agreement with broader kernel density estimates. Using early 2019 incidents with predictive ridges extracted from 2018 data, we create multi-run confidence intervals and show that our patrol templates cover around 94% of incidents for 0.1-mile envelopes around ridges, quickly rising to near-complete coverage. We also develop and provide researchers, as well as practitioners, with a user-friendly and open-source software for fast geospatial density ridge estimation. Conclusions: We show that ridges following crime report densities can be used to enhance patrolling capabilities. Our empirical tests show the stability of ridges based on past data, offering an accessible way of identifying routes within hot spots instead of patrolling epicenters. We suggest further research into the application and efficacy of density ridges for patrolling.Comment: 17 pages, 3 figure
    corecore