1,020 research outputs found

    Adaptive main-memory indexing for high-performance point-polygon joins

    Get PDF
    Connected mobility applications rely heavily on geospatial joins that associate point data, such as locations of Uber cars, to static polygonal regions, such as city neighborhoods. These joins typically involve expensive geometric computations, which makes it hard to provide an interactive user experience. In this paper, we propose an adaptive polygon index that leverages true hit fltering to avoid expensive geometric computations in most cases. In particular, our approach closely approximates polygons by combining quadtrees with true hit filtering, and stores these approximations in a query-effcient radix tree. Based on this index, we introduce two geospatial join algorithms: an approximate one that guarantees a user-defined precision, and an exact one that adapts to the expected point distribution. In summary, our technique outperforms existing CPU-based joins by up to two orders of magnitude and is competitive with state-of-the-art GPU implementations

    Building Complex and Site Categorization Using Similarity to a Prototypical Site

    Get PDF
    This project presents an assessment tool for classifying building complexes using sitebased relationships as calculated from ArcGIS 9.2 using model builder and Python scripting. Anthropogenic features extracted from imagery often form the foundation of spatial databases. These data are in turn used to inform situational awareness for relief, law enforcement, and military agencies among many others. Buildings and the complexes they form are critical features within the landscape. The categorization of complexes requires an understanding of the relationships of the buildings within the site. In this study, building complexes in California were assessed for similarity to a prototypical California high school defined with a training set of known high schools and compared to a set of uncategorized sites. Eighty-eight percent of the high schools were correctly classified as being highly similar to the control data set

    Fast Mapping onto Census Blocks

    Full text link
    Pandemic measures such as social distancing and contact tracing can be enhanced by rapidly integrating dynamic location data and demographic data. Projecting billions of longitude and latitude locations onto hundreds of thousands of highly irregular demographic census block polygons is computationally challenging in both research and deployment contexts. This paper describes two approaches labeled "simple" and "fast". The simple approach can be implemented in any scripting language (Matlab/Octave, Python, Julia, R) and is easily integrated and customized to a variety of research goals. This simple approach uses a novel combination of hierarchy, sparse bounding boxes, polygon crossing-number, vectorization, and parallel processing to achieve 100,000,000+ projections per second on 100 servers. The simple approach is compact, does not increase data storage requirements, and is applicable to any country or region. The fast approach exploits the thread, vector, and memory optimizations that are possible using a low-level language (C++) and achieves similar performance on a single server. This paper details these approaches with the goal of enabling the broader community to quickly integrate location and demographic data.Comment: 8 pages, 7 figures, 55 references; accepted to IEEE HPEC 202

    An Interactive Framework for Raster Data Spatial Joins

    Get PDF
    ABSTRACT Many Geographic Information Systems (GIS) handle large geospatial datasets stored in raster representation. Spatial joins over raster data are important queries in GIS for data analysis and decision support. However, evaluating spatial joins can be very time intensive due to the size of these datasets. In this paper we propose a new interactive framework that allows users to get approximate answers in near instantaneous time, thus allowing for truly interactive data exploration. Our method utilizes two proposed statistical approaches: probabilistic join and sampling based join. Our probabilistic join method provides speedup of two orders of magnitude with no correctness guarantee, while our sampling based method provides an order of magnitude improvement over the full quad-tree join and also provides running confidence intervals. We propose a framework that combines the two approaches to allow end users to tradeoff speed versus bounded accuracy. The two approaches are evaluated empirically with real and synthetic datasets

    Advanced data structures for the interpretation of image and cartographic data in geo-based information systems

    Get PDF
    A growing need to usse geographic information systems (GIS) to improve the flexibility and overall performance of very large, heterogeneous data bases was examined. The Vaster structure and the Topological Grid structure were compared to test whether such hybrid structures represent an improvement in performance. The use of artificial intelligence in a geographic/earth sciences data base context is being explored. The architecture of the Knowledge Based GIS (KBGIS) has a dual object/spatial data base and a three tier hierarchial search subsystem. Quadtree Spatial Spectra (QTSS) are derived, based on the quadtree data structure, to generate and represent spatial distribution information for large volumes of spatial data

    Using Raster Sketches for Digital Image Retrieval

    Get PDF
    This research addresses the problem of content-based image retrieval using queries on image-object shape, completely in the raster domain. It focuses on the particularities of image databases encountered in typical topographic applications and presents the development of an environment for visual information management that enables such queries. The query consists of a user-provided raster sketch of the shape of an imaged object. The objective of the search is to retrieve images that contain an object sufficiently similar to the one specified in the query. The new contribution of this work combines the design of a comprehensive digital image database on-line query access strategy through the development of a feature library, image library and metadata library and the necessary matching tools. The matching algorithm is inspired by least-squares matching (lsm), and represents an extension of lsm to function with a variety of raster representations. The image retrieval strategy makes use of a hierarchical organization of linked feature (image-object) shapes within the feature library. The query results are ranked according to statistical scores and the user can subsequently narrow or broaden his/her search according to the previously obtained results and the purpose of the search

    Weiterentwicklung analytischer Datenbanksysteme

    Get PDF
    This thesis contributes to the state of the art in analytical database systems. First, we identify and explore extensions to better support analytics on event streams. Second, we propose a novel polygon index to enable efficient geospatial data processing in main memory. Third, we contribute a new deep learning approach to cardinality estimation, which is the core problem in cost-based query optimization.Diese Arbeit trägt zum aktuellen Forschungsstand von analytischen Datenbanksystemen bei. Wir identifizieren und explorieren Erweiterungen um Analysen auf Eventströmen besser zu unterstützen. Wir stellen eine neue Indexstruktur für Polygone vor, die eine effiziente Verarbeitung von Geodaten im Hauptspeicher ermöglicht. Zudem präsentieren wir einen neuen Ansatz für Kardinalitätsschätzungen mittels maschinellen Lernens

    Using Raster Sketches for Digital Image Retrieval

    Get PDF
    This research addresses the problem of content-based image retrieval using queries on image-object shape, completely in the raster domain. It focuses on the particularities of image databases encountered in typical topographic applications and presents the development of an environment for visual information management that enables such queries. The query consists of a user-provided raster sketch of the shape of an imaged object. The objective of the search is to retrieve images that contain an object sufficiently similar to the one specified in the query. The new contribution of this work combines the design of a comprehensive digital image database on-line query access strategy through the development of a feature library, image library and metadata library and the necessary matching tools. The matching algorithm is inspired by least-squares matching (lsm), and represents an extension of lsm to function with a variety of raster representations. The image retrieval strategy makes use of a hierarchical organization of linked feature (image-object) shapes within the feature library. The query results are ranked according to statistical scores and the user can subsequently narrow or broaden his/her search according to the previously obtained results and the purpose of the search

    0018/2009 - Indexação em Bancos de Dados Espaciais

    Get PDF
    Um banco de dados espacial é um banco de dados que deve ter a capacidade de lidar com informações relacionadas ao espaço. Para tanto, tipos de dados específicos para este contexto são essenciais na modelagem dos objetos espaciais e na sua implementação. Além disso, operações e consultas no banco espacial devem ser eficientes e capazes de interagir com esses tipos específicos. Aumentar esta eficiência é um dos incentivos para a criação de índices em banco de dados. No caso dos bancos de dados espaciais, os índices devem considerar características próprias do contexto, o que torna a criação de índices tão ou mais complexa que a mesma tarefa em bancos relacionais. O desempenho dos bancos espaciais é influenciado negativamente pelo aumento do uso e do volume de dados, o que aumenta a importância dos índices. Métodos usados para indexação espacial se utilizam de árvores de busca, onde os ramos representam divisões do espaço para a construção de dos índices. Outra abordagem utilizada é a aproximação (ou assinatura) do resultado para permitir que operações usando o aspecto espacial dos objetos, sejam realizados com maior velocidade. Este trabalho apresenta um levantamento da literatura sobre o tema de indexação em bancos de dados espaciais
    corecore