11,312 research outputs found

    Geocoded data structures and their applications to Earth science investigations

    Get PDF
    A geocoded data structure is a means for digitally representing a geographically referenced map or image. The characteristics of representative cellular, linked, and hybrid geocoded data structures are reviewed. The data processing requirements of Earth science projects at the Goddard Space Flight Center and the basic tools of geographic data processing are described. Specific ways that new geocoded data structures can be used to adapt these tools to scientists' needs are presented. These include: expanding analysis and modeling capabilities; simplifying the merging of data sets from diverse sources; and saving computer storage space

    Data Mining and Machine Learning in Astronomy

    Full text link
    We review the current state of data mining and machine learning in astronomy. 'Data Mining' can have a somewhat mixed connotation from the point of view of a researcher in this field. If used correctly, it can be a powerful approach, holding the potential to fully exploit the exponentially increasing amount of available data, promising great scientific advance. However, if misused, it can be little more than the black-box application of complex computing algorithms that may give little physical insight, and provide questionable results. Here, we give an overview of the entire data mining process, from data collection through to the interpretation of results. We cover common machine learning algorithms, such as artificial neural networks and support vector machines, applications from a broad range of astronomy, emphasizing those where data mining techniques directly resulted in improved science, and important current and future directions, including probability density functions, parallel algorithms, petascale computing, and the time domain. We conclude that, so long as one carefully selects an appropriate algorithm, and is guided by the astronomical problem at hand, data mining can be very much the powerful tool, and not the questionable black box.Comment: Published in IJMPD. 61 pages, uses ws-ijmpd.cls. Several extra figures, some minor additions to the tex

    Computer-aided processing of LANDSAT MSS data for classification of forestlands

    Get PDF
    There are no author-identified significant results in this report

    Monitoring the defoliation of hardwood forests in Pennsylvania using LANDSAT

    Get PDF
    An automated system for conducting annual gypsy moth defoliation surveys using LANDSAT MSS data and digital processing techniques is described. A two-step preprocessing procedure was developed that uses multitemporal data sets representing forest canopy conditions before and after defoliation to create a digital image in which all nonforest cover types are eliminated or masked out of a LANDSAT image that exhibits insect defoliation. A temporal window for defoliation assessment was identified and a statewide data base was established. A data management system to interface image analysis software with the statewide data base was developed and a cost benefit analysis of this operational system was conducted

    Data Mining Applications in Big Data

    Get PDF
    Data mining is a process of extracting hidden, unknown, but potentially useful information from massive data. Big Data has great impacts on scientific discoveries and value creation. This paper introduces methods in data mining and technologies in Big Data. Challenges of data mining and data mining with big data are discussed. Some technology progress of data mining and data mining with big data are also presented

    Application of an Artificial Neural Network for the CPT-based Soil Stratigraphy Classification

    Get PDF
    Subsurface soil profiling is an essential step in a site investigation. The traditional methods for in situ investigations, such as SPT borings and sampling, have been progressively replaced by CPT soundings since they are fast, repeatable, economical and provide continuous parameters of the mechanical behaviour of the soils. However, the derived CPT-based stratigraphy profiles might present noisy thin layers, and its soil type description might not reflect a textural-based classification (i.e. Universal Soil Classification System, USCS). Thus, this paper presents a straightforward artificial neural network (ANN) algorithm, to classify CPT soundings according to the USCS. Data for training the model have been retrieved from SPT-CPT pairs collected after the 2011 Christchurch earthquake in New Zealand. The application of the ANN to case studies show how the method is a cost-effective and time-efficient approach, but more input parameters and data are needed for increasing its performance

    Ecology, biodiversity and mining: Science and solving the challenges

    Get PDF
    Mining has an impact on the ecology and biodiversity of an area, and this impact is required to be mitigated under regulatory requirements. The mitigation is achieved by applying measures, deeply rooted in scientific knowledge, of the functioning of the impacted (and rehabilitated) ecosystems and the biota supported by these ecosystems. This paper focusses on selected issues of the pre-mining ecology and biodiversity surveys and the scientific basis of regulatory requirements. Using Western Australian experience and drawing on comparisons from other countries, this paper addresses two challenges. The first is the classification of vegetation of mining tenements to inform on the variability of vegetation using a series of data-analytical experiments. The paper reveals flaws in the current practise aimed at recognition and description of plant communities in vegetation surveys. Concrete steps serving and enhancing the clarity and plausibility of regulatory tools (guidance manuals) are identified. It also advises on what we should do if we find discrepancies between the regulatory expectations and the level they reflect current scientific knowledge. The second is the comparative and spatial aspects of the identification of communities of conservation interest. The paper further identifies the missing vital information of the vegetation mapping procedures and briefly analyses the need for tools assisting assessment of conservation value of the vegetation classifications, including the building of a centralised vegetation database, formulation of vegetation and habitat classification systems, and construction of scientifically sound and ecologically informative vegetation maps at various scales of complexity
    corecore