1,591 research outputs found
Requirements for Information Extraction for Knowledge Management
Knowledge Management (KM) systems inherently suffer from the knowledge acquisition bottleneck - the difficulty of modeling and formalizing knowledge relevant for specific domains. A potential solution to this problem is Information Extraction (IE) technology. However, IE was originally developed for database population and there is a mismatch between what is required to successfully perform KM and what current IE technology provides. In this paper we begin to address this issue by outlining requirements for IE based KM
An introduction to Graph Data Management
A graph database is a database where the data structures for the schema
and/or instances are modeled as a (labeled)(directed) graph or generalizations
of it, and where querying is expressed by graph-oriented operations and type
constructors. In this article we present the basic notions of graph databases,
give an historical overview of its main development, and study the main current
systems that implement them
Integrating Semantic Web and Web Mining into Semantic Web Mining
We know semantic web makes information more readable and meaningful to people bymaking it more understandable to machines and web mining is the application of data miningtechniques to discover patterns from the Web.We are aware websites on the Internet are increasing, daily, in size and complexity, whichmakes rather difficult that specific information, can be easily found.In this paper will be described how we can integrate semantic web and web mining intosemantic web mining
Profiling relational data: a survey
Profiling data to determine metadata about a given dataset is an important and frequent activity of any IT professional and researcher and is necessary for various use-cases. It encompasses a vast array of methods to examine datasets and produce metadata. Among the simpler results are statistics, such as the number of null values and distinct values in a column, its data type, or the most frequent patterns of its data values. Metadata that are more difficult to compute involve multiple columns, namely correlations, unique column combinations, functional dependencies, and inclusion dependencies. Further techniques detect conditional properties of the dataset at hand. This survey provides a classification of data profiling tasks and comprehensively reviews the state of the art for each class. In addition, we review data profiling tools and systems from research and industry. We conclude with an outlook on the future of data profiling beyond traditional profiling tasks and beyond relational databases
EU-Raw Materials Intelligence Capacity Platform (EU-RMCP) – Technical system specification
EU-Raw Materials Intelligence Capacity Platform (or EU-RMICP) integrates metadata on data sources related to primary and secondary mineral resources and brings the end users an expertise on the methods and tools used in mineral intelligence. The system is capable of bringing relevant user ‘answers’ of the type 'how to proceed for …' on almost any question related to mineral resources, on the whole supply chain, from prospecting to recycling, taking into account the environmental, political and social dimensions.
EU-RMICP is based on an ontology of the domain of mineral resources (coupled with more generic cross-functional ontologies, relative to commodities, time and space), which represents the domain of the questions of the users (experts and non-experts). The user navigates in the ontology by using a Dynamic Graph of Decision (DDG), which allows him/her to discover the solutions which he/she is looking for without having to formulate any question. The system is coupled with a 'RDF Triple Store' (a database storing the ontologies), factSheets, doc-Sheets and flowSheets (i.e., specific formatted forms) related to methods and documentation, scenarios and metadata.JRC.B.6-Digital Econom
Web and Semantic Web Query Languages
A number of techniques have been developed to facilitate
powerful data retrieval on the Web and Semantic Web. Three categories
of Web query languages can be distinguished, according to the format
of the data they can retrieve: XML, RDF and Topic Maps. This article
introduces the spectrum of languages falling into these categories
and summarises their salient aspects. The languages are introduced using
common sample data and query types. Key aspects of the query
languages considered are stressed in a conclusion
Grammar-Based Geodesics in Semantic Networks
A geodesic is the shortest path between two vertices in a connected network.
The geodesic is the kernel of various network metrics including radius,
diameter, eccentricity, closeness, and betweenness. These metrics are the
foundation of much network research and thus, have been studied extensively in
the domain of single-relational networks (both in their directed and undirected
forms). However, geodesics for single-relational networks do not translate
directly to multi-relational, or semantic networks, where vertices are
connected to one another by any number of edge labels. Here, a more
sophisticated method for calculating a geodesic is necessary. This article
presents a technique for calculating geodesics in semantic networks with a
focus on semantic networks represented according to the Resource Description
Framework (RDF). In this framework, a discrete "walker" utilizes an abstract
path description called a grammar to determine which paths to include in its
geodesic calculation. The grammar-based model forms a general framework for
studying geodesic metrics in semantic networks.Comment: First draft written in 200
- …