47 research outputs found

    Smart Environmental Data Infrastructures: Bridging the Gap between Earth Sciences and Citizens

    Get PDF
    The monitoring and forecasting of environmental conditions is a task to which much effort and resources are devoted by the scientific community and relevant authorities. Representative examples arise in meteorology, oceanography, and environmental engineering. As a consequence, high volumes of data are generated, which include data generated by earth observation systems and different kinds of models. Specific data models, formats, vocabularies and data access infrastructures have been developed and are currently being used by the scientific community. Due to this, discovering, accessing and analyzing environmental datasets requires very specific skills, which is an important barrier for their reuse in many other application domains. This paper reviews earth science data representation and access standards and technologies, and identifies the main challenges to overcome in order to enable their integration in semantic open data infrastructures. This would allow non-scientific information technology practitioners to devise new end-user solutions for citizen problems in new application domainsThis research was co-funded by (i) the TRAFAIR project (2017-EU-IA-0167), co-financed by the Connecting Europe Facility of the European Union, (ii) the RADAR-ON-RAIA project (0461_RADAR_ON_RAIA_1_E) co-financed by the European Regional Development Fund (ERDF) through the Iterreg V-A Spain-Portugal program (POCTEP) 2014-2020, and (iii) the Consellería de Educación, Universidade e Formación Profesional of the regional government of Galicia (Spain), through the support for research groups with growth potential (ED431B 2018/28)S

    Subsurface Characterization by Means of Geovisual Analytics

    Get PDF
    This Thesis is concerned with one of the major problems in subsurface characterizations emerging from ever-increasing loads of data in the last decades: What kind of technologies suit well for extracting novel, valid and useful knowledge from persistent data repositories for the characterization of subsurface regions and how can such technologies be implemented in an integrated, community-open software platform? In order to address those questions, an interactive, open-source software platform for geoscientific knowledge discovery has been developed, which enables domain experts to generate, optimize and validate prognostic models of the subsurface domain. Such a free tool has been missing in the geoscientific community so far. The extensible software platform GeoReVi (Geological Reservoir Virtualization) implements selected aspects of geovisual analytics with special attention being paid to an implementation of the knowledge discovery in databases process. With GeoReVi the human expert can model and visualize static and dynamic systems in the subsurface in a feedback cycle. The created models can be analyzed and parameterized by means of modern approaches from geostatistics and data mining. Hence, knowledge that is useful to both the assessment of subsurface potentials and to support decision-making during the utilization process of the subsurface regions can be extracted and exchanged in a formalized manner. The modular software application is composed of both integrated and centralized databases, a graphical user interface and a business logic. In order to fulfill the needs of low computing time in accordance with high computational complexity of spatial problems, the software system makes intense use of parallelism and asynchronous programming. The competitiveness of industry branches, which are aimed at utilizing the subsurface in unknown regions, such as the geothermal energy production or carbon capture and storage, are especially dependent on the quality of spatial forecasts for relevant rock and fluid properties. Thus, the focus of this work has been laid upon the implementation of algorithms, which enhance the predictability of properties in space under consideration of uncertainty. The software system was therefore evaluated in ample real-world scenarios by solving problems from scientific, educational and industrial projects. The implemented software system shows an excellent suitability to generically address spatial problems such as interpolation or stochastic simulation under consideration of numerical uncertainty. In this context, GeoReVi served as a tool for discovering new knowledge with special regard to investigating the heterogeneity of rock media on multiple scales of investigation. Among others, it could be demonstrated that the three-dimensional scalar fields of different petrophysical and geochemical properties in sandstone media may diverge significantly at small-scales. In fact, if the small-scale variability is not considered in field-scale projects, in which the sampling density is usually low, statistical correlations and thus empirical relationships might be feigned. Furthermore, it could be demonstrated that the simple kriging variance, which is used to simulate the natural variability in sequential simulations, systematically underestimates the intrinsic variability of the investigated sandstone media. If the small-scale variability can be determined by high-resolution sampling, it can be used to enhance conditional simulations at the scale of depositional environments

    A new approach of top-down induction of decision trees for knowledge discovery

    Get PDF
    Top-down induction of decision trees is the most popular technique for classification in the field of data mining and knowledge discovery. Quinlan developed the basic induction algorithm of decision trees, ID3 (1984), and extended to C4.5 (1993). There is a lot of research work for dealing with a single attribute decision-making node (so-called the first-order decision) of decision trees. Murphy and Pazzani (1991) addressed about multiple-attribute conditions at decision-making nodes. They show that higher order decision-making generates smaller decision trees and better accuracy. However, there always exist NP-complete combinations of multiple-attribute decision-makings.;We develop a new algorithm of second-order decision-tree inductions (SODI) for nominal attributes. The induction rules of first-order decision trees are combined by \u27AND\u27 logic only, but those of SODI consist of \u27AND\u27, \u27OR\u27, and \u27OTHERWISE\u27 logics. It generates more accurate results and smaller decision trees than any first-order decision tree inductions.;Quinlan used information gains via VC-dimension (Vapnik-Chevonenkis; Vapnik, 1995) for clustering the experimental values for each numerical attribute. However, many researchers have discovered the weakness of the use of VC-dim analysis. Bennett (1997) sophistically applies support vector machines (SVM) to decision tree induction. We suggest a heuristic algorithm (SVMM; SVM for Multi-category) that combines a TDIDT scheme with SVM. In this thesis it will be also addressed how to solve multiclass classification problems.;Our final goal for this thesis is IDSS (Induction of Decision Trees using SODI and SVMM). We will address how to combine SODI and SVMM for the construction of top-down induction of decision trees in order to minimize the generalized penalty cost

    An algorithmic framework for visualising and exploring multidimensional data

    Get PDF
    To help understand multidimensional data, information visualisation techniques are often applied to take advantage of human visual perception in exposing latent structure. A popular means of presenting such data is via two-dimensional scatterplots where the inter-point proximities reflect some notion of similarity between the entities represented. This can result in potentially interesting structure becoming almost immediately apparent. Traditional algorithms for carrying out this dimension reduction tend to have different strengths and weaknesses in terms of run times and layout quality. However, it has been found that the combination of algorithms can produce hybrid variants that exhibit significantly lower run times while maintaining accurate depictions of high-dimensional structure. The author's initial contribution in the creation of such algorithms led to the design and implementation of a software system (HIVE) for the development and investigation of new hybrid variants and the subsequent analysis of the data they transform. This development was motivated by the fact that there are potentially many hybrid algorithmic combinations to explore and therefore an environment that is conductive to their development, analysis and use is beneficial not only in exploring the data they transform but also in exploring the growing number of visualisation tools that these algorithms beget. This thesis descries three areas of the author's contribution to the field of information visualisation. Firstly, work on hybrid algorithms for dimension reduction is presented and their analysis shows their effectiveness. Secondly, the development of a framework for the creation of tailored hybrid algorithms is illustrated. Thirdly, a system embodying the framework, providing an environment conductive to the development, evaluation and use of the algorithms is described. Case studies are provided to demonstrate how the author and others have used and found value in the system across areas as diverse as environmental science, social science and investigative psychology, where multidimensional data are in abundance

    Big Data Computing for Geospatial Applications

    Get PDF
    The convergence of big data and geospatial computing has brought forth challenges and opportunities to Geographic Information Science with regard to geospatial data management, processing, analysis, modeling, and visualization. This book highlights recent advancements in integrating new computing approaches, spatial methods, and data management strategies to tackle geospatial big data challenges and meanwhile demonstrates opportunities for using big data for geospatial applications. Crucial to the advancements highlighted in this book is the integration of computational thinking and spatial thinking and the transformation of abstract ideas and models to concrete data structures and algorithms

    Geospatial Information Research: State of the Art, Case Studies and Future Perspectives

    Get PDF
    Geospatial information science (GI science) is concerned with the development and application of geodetic and information science methods for modeling, acquiring, sharing, managing, exploring, analyzing, synthesizing, visualizing, and evaluating data on spatio-temporal phenomena related to the Earth. As an interdisciplinary scientific discipline, it focuses on developing and adapting information technologies to understand processes on the Earth and human-place interactions, to detect and predict trends and patterns in the observed data, and to support decision making. The authors – members of DGK, the Geoinformatics division, as part of the Committee on Geodesy of the Bavarian Academy of Sciences and Humanities, representing geodetic research and university teaching in Germany – have prepared this paper as a means to point out future research questions and directions in geospatial information science. For the different facets of geospatial information science, the state of art is presented and underlined with mostly own case studies. The paper thus illustrates which contributions the German GI community makes and which research perspectives arise in geospatial information science. The paper further demonstrates that GI science, with its expertise in data acquisition and interpretation, information modeling and management, integration, decision support, visualization, and dissemination, can help solve many of the grand challenges facing society today and in the future

    Cognitive Foundations for Visual Analytics

    Full text link
    corecore