12,605 research outputs found
TiFi: Taxonomy Induction for Fictional Domains [Extended version]
Taxonomies are important building blocks of structured knowledge bases, and their construction from text sources and Wikipedia has received much attention. In this paper we focus on the construction of taxonomies for fictional domains, using noisy category systems from fan wikis or text extraction as input. Such fictional domains are archetypes of entity universes that are poorly covered by Wikipedia, such as also enterprise-specific knowledge bases or highly specialized verticals. Our fiction-targeted approach, called TiFi, consists of three phases: (i) category cleaning, by identifying candidate categories that truly represent classes in the domain of interest, (ii) edge cleaning, by selecting subcategory relationships that correspond to class subsumption, and (iii) top-level construction, by mapping classes onto a subset of high-level WordNet categories. A comprehensive evaluation shows that TiFi is able to construct taxonomies for a diverse range of fictional domains such as Lord of the Rings, The Simpsons or Greek Mythology with very high precision and that it outperforms state-of-the-art baselines for taxonomy induction by a substantial margin
Identification of Design Principles
This report identifies those design principles for a (possibly new) query and transformation
language for the Web supporting inference that are considered essential. Based upon these
design principles an initial strawman is selected. Scenarios for querying the Semantic Web
illustrate the design principles and their reflection in the initial strawman, i.e., a first draft of
the query language to be designed and implemented by the REWERSE working group I4
Ontology-based Why-Question Analysis Using Lexico-Syntactic Patterns
This research focuses on developing a method to analyze why-questions. Some previous researches on the why-question analysis usually used the morphological and the syntactical approach without considering the expected answer types. Moreover, they rarely involved domain ontology to capture the semantic or conceptualization of the content. Consequently, some semantic mismatches occurred and then resulting not appropriate answers. The proposed method considers the expected answer types and involves domain ontology. It adapts the simple, the bag-of-words like model, by using semantic entities (i.e., concepts/entities and relations) instead of words to represent a query. The proposed method expands the question by adding the additional semantic entities got by executing the constructed SPARQL query of the why-question over the domain ontology. The major contribution of this research is in developing an ontology-based why-question analysis method by considering the expected answer types. Some experiments have been conducted to evaluate each phase of the proposed method. The results show good performance for all performance measures used (i.e., precision, recall, undergeneration, and overgeneration). Furthermore, comparison against two baseline methods, the keyword-based ones (i.e., the term-based and the phrase-based method), shows that the proposed method obtained better performance results in terms of MRR and P@10 values
The semantic Web : theories, languages, and applications
La popularité croissante du Web permet la diffusion d’une quantité phénoménale d’information de toutes sortes et l’accès à une panoplie de services en ligne en raison du développement effréné de ses contenus et de leur consultation quotidienne à faible coût. Malheureusement, cette explosion du Web cause un problème de surabondance de données pas toujours crédibles et souvent inutilisables; les réponses obtenues des engins de recherche peuvent être inadéquates ou imprécises et les services en ligne sont exclusifs ou incompatibles entre eux. Dans le but de pallier à ces inconvénients, le consortium W3C a proposé une solution globale, connue sous le nom de Web sémantique, qui améliore les structures de représentation des données de façon à rendre les contenus signifiants et à permettre l’inférence de nouvelles connaissances par des programmes. Ce mémoire explore les théories sous-jacentes au Web sémantique ainsi que les technologies qui lui sont propres. D’une part, les concepts de logique descriptive et de structure ontologique sont présentés et des liens sont établis entre eux. D’autre part, une hiérarchie de langages incluant, entre autres, les langages XML, RDF, DAML+OIL et OWL est introduite ainsi qu’une étude comparative de plusieurs moteurs d’inférence basés sur ces langages. Enfin, ce mémoire présente un exemple complet qui permet d’illustrer les principaux concepts du Web sémantique et d’évaluer la faisabilité de la mise en oeuvre d’une application par rapport à l’état actuel des technologies
Survey over Existing Query and Transformation Languages
A widely acknowledged obstacle for realizing the vision of the Semantic Web is the inability
of many current Semantic Web approaches to cope with data available in such diverging
representation formalisms as XML, RDF, or Topic Maps. A common query language is the first
step to allow transparent access to data in any of these formats. To further the understanding
of the requirements and approaches proposed for query languages in the conventional as well
as the Semantic Web, this report surveys a large number of query languages for accessing
XML, RDF, or Topic Maps. This is the first systematic survey to consider query languages from
all these areas. From the detailed survey of these query languages, a common classification
scheme is derived that is useful for understanding and differentiating languages within and
among all three areas
A Review of Accessing Big Data with Significant Ontologies
Ontology Based Data Access (OBDA) is a recently proposed approach which is able to provide a conceptual view on relational data sources. It addresses the problem of the direct access to big data through providing end-users with an ontology that goes between users and sources in which the ontology is connected to the data via mappings. We introduced the languages used to represent the ontologies and the mapping assertions technique that derived the query answering from sources. Query answering is divided into two steps: (i) Ontology rewriting, in which the query is rewritten with respect to the ontology into new query; (ii) mapping rewriting the query that obtained from previous step reformulating it over the data sources using mapping assertions. In this survey, we aim to study the earlier works done by other researchers in the fields of ontology, mapping and query answering over data sources
Web and Semantic Web Query Languages
A number of techniques have been developed to facilitate
powerful data retrieval on the Web and Semantic Web. Three categories
of Web query languages can be distinguished, according to the format
of the data they can retrieve: XML, RDF and Topic Maps. This article
introduces the spectrum of languages falling into these categories
and summarises their salient aspects. The languages are introduced using
common sample data and query types. Key aspects of the query
languages considered are stressed in a conclusion
- …