32 research outputs found

    GIScience Driven R&D: Interdisciplinary GIST Group at Oak Ridge National Laboratory

    Get PDF
    Oak Ridge National Laboratory (ORNL) is the largest DOE multi-research facility in the US and is located in Oak Ridge, TN. One of the signature strengths of ORNL is Computational Science and Engineering and the Geographic Information Science and Technology (GIST) group contributes to that strength as part of the Computer Sciences and Engineering Division (CSED) within the Computer Sciences Directorate. The GIST group is at the forefront of High Resolution Population and Social Dynamics research and development resulting in innovative products such as LandScan Global (population distribution at 30 arc seconds) and now LandScan HD (population distribution at 3 arc seconds). Other research capabilities within the group include Critical Infrastructure Modeling, Energy Assurance, High Performance Geocomputation and Visualization, Emergency Preparedness and Response, Earth Science Informatics, and Climate Change Impacts. The GIST group is an interdisciplinary group ranging of approximately 50 researchers (staff and students) and over the summer, the number of students increases anywhere from 15 to 25. As for Purdue graduates within the group, there are three staff and two interns at this time and Purdue students regularly participate in our summer internships programs

    Deep Learning for Spatiotemporal Big Data: A Vision on Opportunities and Challenges

    Full text link
    With advancements in GPS, remote sensing, and computational simulation, an enormous volume of spatiotemporal data is being collected at an increasing speed from various application domains, spanning Earth sciences, agriculture, smart cities, and public safety. Such emerging geospatial and spatiotemporal big data, coupled with recent advances in deep learning technologies, foster new opportunities to solve problems that have not been possible before. For instance, remote sensing researchers can potentially train a foundation model using Earth imagery big data for numerous land cover and land use modeling tasks. Coastal modelers can train AI surrogates to speed up numerical simulations. However, the distinctive characteristics of spatiotemporal big data pose new challenges for deep learning technologies. This vision paper introduces various types of spatiotemporal big data, discusses new research opportunities in the realm of deep learning applied to spatiotemporal big data, lists the unique challenges, and identifies several future research needs

    A Decision Tree Based on Spatial Relationships for Predicting Hotspots in Peatlands

    Get PDF
    Predicting hotspot occurrence as an indicator of forest and land fires is essential in developing an early warning system for fire prevention.  This work applied a spatial decision tree algorithm on spatial data of forest fires. The algorithm is the improvement of the conventional decision tree algorithm in which the distance and topological relationships are included to grow up spatial decision trees. Spatial data consist of a target layer and ten explanatory layers representing physical, weather, socio-economic and peatland characteristics in the study area Rokan Hilir District, Indonesia. Target objects are hotspots of 2008 and non-hotspot points.  The result is a pruned spatial decision tree with 122 leaves and the accuracy of 71.66%.  The spatial tree has produces higher accuracy than the non-spatial trees that were created using the ID3 and C4.5 algorithm. The ID3 decision tree has accuracy of 49.02% while the accuracy of C4.5 decision tree reaches 65.24%

    Towards a model for the multidimensional analysis of field data

    Get PDF
    International audienceIntegration of spatial data into multidimensional models leads to the concept of Spatial OLAP (SOLAP). Usually, SOLAP models exploit discrete spatial data. Few works integrate continuous field data into dimensions and measures. In this paper, we provide a multidimensional model that supports measures and dimension as continuous field data, independently of their implementation

    SMoT+: Extending the SMoT Algorithm for Discovering Stops in Nested Sites

    Get PDF
    Several methods have been proposed to analyse trajectory data. However, a few of these methods consider trajectory relations with relevant features of the geographic space. One of the best-known methods that take into account the geographical regions crossed by a trajectory is the SMoT algorithm. Nevertheless, SMoT considers only disjoint geographic regions that a trajectory may traverse, while many regions of interest are contained in other regions. In this article, we extend the SMoT algorithm for discovering stops in nested regions. The proposed algorithm, called SMoT+, takes advantage of information about the hierarchy of nested regions to efficiently discover the stops in regions at different levels of this hierarchy. Experiments with real data show that SMoT+ detects stops in nested regions, which are not detected by the original SMoT algorithm, with minor growth of processing time

    Language-Based Access to Large Sensor Repositories

    Get PDF
    Sensor data have broadened their scope recently, ranging now from the simple time series measurements to, e.g., hyperspectral satellite image maps timeseries. In addition to observed data, simulation data increasingly have to be merged, for example 4-D ocean and atmospheric data. The majority of these data fall into the category of multi-dimensional rasters. However, when it comes to flexible retrieval, including sensor data search, aggregation, analysis, fusion, etc., standard query language support in the past has not kept up with the service level of, e.g., metadata retrieval. To close this gap, the Open GeoSpatial Consortium (OGC) has issued the Web Coverage Processing Service (WCPS) Standard in December 2008. WCPS defines a request language for multi-dimensional raster data, suitable for specifying navigation, download, and analysis of sensor, image, and statistics data. This contribution emphasises sensor data modeling and the perspectives for an integrated, cross-dimensional sensor data retrieval. Further, the WCPS reference implementation is briefly discussed

    Spatial Big Data Analytics: Classification Techniques for Earth Observation Imagery

    Get PDF
    University of Minnesota Ph.D. dissertation. August 2016. Major: Computer Science. Advisor: Shashi Shekhar. 1 computer file (PDF); xi, 120 pages.Spatial Big Data (SBD), e.g., earth observation imagery, GPS trajectories, temporally detailed road networks, etc., refers to geo-referenced data whose volume, velocity, and variety exceed the capability of current spatial computing platforms. SBD has the potential to transform our society. Vehicle GPS trajectories together with engine measurement data provide a new way to recommend environmentally friendly routes. Satellite and airborne earth observation imagery plays a crucial role in hurricane tracking, crop yield prediction, and global water management. The potential value of earth observation data is so significant that the White House recently declared that full utilization of this data is one of the nation's highest priorities. However, SBD poses significant challenges to current big data analytics. In addition to its huge dataset size (NASA collects petabytes of earth images every year), SBD exhibits four unique properties related to the nature of spatial data that must be accounted for in any data analysis. First, SBD exhibits spatial autocorrelation effects. In other words, we cannot assume that nearby samples are statistically independent. Current analytics techniques that ignore spatial autocorrelation often perform poorly such as low prediction accuracy and salt-and-pepper noise (i.e., pixels predicted as different from neighbors by mistake). Second, spatial interactions are not isotropic and vary across directions. Third, spatial dependency exists in multiple spatial scales. Finally, spatial big data exhibits heterogeneity, i.e., identical feature values may correspond to distinct class labels in different regions. Thus, learned predictive models may perform poorly in many local regions. My thesis investigates novel SBD analytics techniques to address some of these challenges. To date, I have been mostly focusing on the challenges of spatial autocorrelation and anisotropy via developing novel spatial classification models such as spatial decision trees for raster SBD (e.g., earth observation imagery). To scale up the proposed models, I developed efficient learning algorithms via computational pruning. The proposed techniques have been applied to real world remote sensing imagery for wetland mapping. I also had developed spatial ensemble learning framework to address the challenge of spatial heterogeneity, particularly the class ambiguity issues in geographical classification, i.e., samples with the same feature values belong to different classes in different spatial zones. Evaluations on three real world remote sensing datasets confirmed that proposed spatial ensemble learning outperforms current approaches such as bagging, boosting, and mixture of experts when class ambiguity exists
    corecore