6 research outputs found

    THE SEMANTIC WEB AND TURNS OF SOCIAL SCIENCE

    Get PDF
    This research is a theoretical investigation of the semantic Web in light of the turns of social science. Social science has experienced several turns to have better understandings of social realities. If it is consistent with the turns, then the semantic Web is expected to have better representation of social realities than the conventional information systems. There are two major languages in the semantic Web: RDF and OWL. It is revealed that describing concepts by RDF conforms to the idea of phenomenology. Since RDF and OWL are based on the open-world assumption, the semantic Web is considered to be consistent with the linguistic turn. With the high consistency with social science, the semantic Web can be a good candidate of IS research topics. IS research is expected to contribute to the development of the semantic web, and the development of methodology in particula

    Génération automatique d'alignements complexes d'ontologies

    Get PDF
    Le web de données liées (LOD) est composé de nombreux entrepôts de données. Ces données sont décrites par différents vocabulaires (ou ontologies). Chaque ontologie a une terminologie et une modélisation propre ce qui les rend hétérogènes. Pour lier et rendre les données du web de données liées interopérables, les alignements d'ontologies établissent des correspondances entre les entités desdites ontologies. Il existe de nombreux systèmes d'alignement qui génèrent des correspondances simples, i.e., ils lient une entité à une autre entité. Toutefois, pour surmonter l'hétérogénéité des ontologies, des correspondances plus expressives sont parfois nécessaires. Trouver ce genre de correspondances est un travail fastidieux qu'il convient d'automatiser. Dans le cadre de cette thèse, une approche d'alignement complexe basée sur des besoins utilisateurs et des instances communes est proposée. Le domaine des alignements complexes est relativement récent et peu de travaux adressent la problématique de leur évaluation. Pour pallier ce manque, un système d'évaluation automatique basé sur de la comparaison d'instances est proposé. Ce système est complété par un jeu de données artificiel sur le domaine des conférences.The Linked Open Data (LOD) cloud is composed of data repositories. The data in the repositories are described by vocabularies also called ontologies. Each ontology has its own terminology and model. This leads to heterogeneity between them. To make the ontologies and the data they describe interoperable, ontology alignments establish correspondences, or links between their entities. There are many ontology matching systems which generate simple alignments, i.e., they link an entity to another. However, to overcome the ontology heterogeneity, more expressive correspondences are sometimes needed. Finding this kind of correspondence is a fastidious task that can be automated. In this thesis, an automatic complex matching approach based on a user's knowledge needs and common instances is proposed. The complex alignment field is still growing and little work address the evaluation of such alignments. To palliate this lack, we propose an automatic complex alignment evaluation system. This system is based on instances. A famous alignment evaluation dataset has been extended for this evaluation

    Web-scale web table to knowledge base matching

    Full text link
    Millions of relational HTML tables are found on the World Wide Web. In contrast to unstructured text, relational web tables provide a compact representation of entities described by attributes. The data within these tables covers a broad topical range. Web table data is used for question answering, augmentation of search results, and knowledge base completion. Until a few years ago, only search engines companies like Google and Microsoft owned large web crawls from which web tables are extracted. Thus, researches outside the companies have not been able to work with web tables. In this thesis, the first publicly available web table corpus containing millions of web tables is introduced. The corpus enables interested researchers to experiment with web tables. A profile of the corpus is created to give insights to the characteristics and topics. Further, the potential of web tables for augmenting cross-domain knowledge bases is investigated. For the use case of knowledge base augmentation, it is necessary to understand the web table content. For this reason, web tables are matched to a knowledge base. The matching comprises three matching tasks: instance, property, and class matching. Existing web table to knowledge base matching systems either focus on a subset of these matching tasks or are evaluated using gold standards which also only cover a subset of the challenges that arise when matching web tables to knowledge bases. This thesis systematically evaluates the utility of a wide range of different features for the web table to knowledge base matching task using a single gold standard. The results of the evaluation are used afterwards to design a holistic matching method which covers all matching tasks and outperforms state-of-the-art web table to knowledge base matching systems. In order to achieve these goals, we first propose the T2K Match algorithm which addresses all three matching tasks in an integrated fashion. In addition, we introduce the T2D gold standard which covers a wide variety of challenges. By evaluating T2K Match against the T2D gold standard, we identify that only considering the table content is insufficient. Hence, we include features of three categories: features found in the table, in the table context like the page title, and features that base on external resources like a synonym dictionary. We analyze the utility of the features for each matching task. The analysis shows that certain problems cannot be overcome by matching each table in isolation to the knowledge base. In addition, relying on the features is not enough for the property matching task. Based on these findings, we extend T2K Match into T2K Match++ which exploits indirect matches to web tables about the same topic and uses knowledge derived from the knowledge base. We show that T2K Match++ outperforms all state-of-the-art web table to knowledge base matching approaches on the T2D and Limaye gold standard. Most systems show good results on one matching task but T2K Match++ is the only system that achieves F-measure scores above 0:8 for all tasks. Compared to results of the best performing system TableMiner+, the F-measure for the difficult property matching task is increased by 0.08, for the class and instance matching task by 0.05 and 0.03, respectively

    Minimal Definition Signatures: Computation and Application to Ontology Alignment

    Get PDF
    In computer science, ontologies define a domain to facilitate knowledge representation and sharing, in a machine processable way. Ontologies approximate an actual world representation, and thus ontologies will differ for many reasons. Therefore knowledge sharing, and in general semantic interoperability, is inherently hindered or even precluded between heterogenous ontologies. Ontology matching addresses this fundamental issue by producing alignments, i.e. sets of correspondences that describe relations between semantically related entities of different ontologies. However, alignments are typically incomplete. In order to support and improve ontology alignment, and semantic interoperability in general, this thesis exploits the notion of implicit definability. Implicit definability is a semantic property of ontologies, signatures, and concepts (and roles) stating that whenever the signature is fixed under a given ontology then the definition of a particular concept (or role) is also fixed. This thesis introduces the notion of minimal definition signature (MDS) from which a given entity is implicitly definable, and presents a novel approach that provides an efficient way to compute in practice all MDSs of the definable entities. Furthermore, it investigates the application of MDSs in the context of alignment generation, evaluation, and negotiation (whereby agents cooperatively establish a mutually acceptable alignment to support opportunistic communication within open environments). As implicit definability permits defined entities to be removed without semantic loss, this thesis argues, that if the meaning of the defined entity is wholly fixed by the terms of its definition, only the terms in the definition are required to be mapped in order to map the defined entity itself; thus implicit definability entails a new type of definability-based correspondence correspondence. Therefore this thesis defines and explores the properties of definability- based correspondences, and extends several ontology alignment evaluation metrics in order to accommodate their assessment. As task signature coverage is a prerequisite of many knowledge-based tasks (e.g. service invocation), a definability-based, efficient approximation approach to obtaining minimal signature cover sets is presented. Moreover, this thesis outlines a specific alignment negotiation approach and shows that by considering definability, agents are better equipped to: (i) determine whether an alignment provides the necessary coverage to achieve a particular task (align the whole ontology, formulate a message or query); (ii) adhere to privacy and confidentiality constraints; and (iii) minimalise the cardinality of the resulting mutual alignment

    Task-oriented complex alignments on conference organisation

    No full text
    This archive contains two complex alignment sets between 5 ontologies of the OAEI conference track dataset : cmt, conference, confOf, edas and ekaw. <br><br>One alignment set is designed for <i>ontology merging.<br></i>The other alignment set is designed for <i>query rewriting.</i><br><br>The alignments were manually created following a methodology.<br><br>The dataset is structured as follow:<br><b><i>- Readme.md</i></b>: Readme file detailing the structure of the files <br><i><b>- Conference_track_ontologies</b></i>: The ontologies of the conference track dataset<br><i>- <b>Ontology_merging</b></i>: The alignments created in the purpose of ontology merging in 3 formats: First order logic (FOL), EDOAL and OWL axioms.<br><i>- <b>Query_rewriting</b>:</i> The alignments created in the purpose of query rewriting in 2 formats: FOL and EDOAL.<div>- <i>approaches_output.ods</i>: Output of 3 approaches on this dataset:<br> - <i>Ritze2009: </i>Ritze, D., Meilicke, C., Sváb-Zamazal, O., & Stuckenschmidt, H. (2009, October). A pattern-based ontology matching approach for detecting complex correspondences. In <i>ISWC Workshop on Ontology Matching, Chantilly (VA US)</i> (pp. 25-36).<br> - <i>Ritze 2010: </i>Ritze, D., Völker, J., Meilicke, C., & Sváb-Zamazal, O. (2010). Linguistic analysis for complex ontology matching. In <i>CEUR Workshop Proceedings</i> (Vol. 689, pp. Paper-1). RWTH.<br> - <i>Jiang2016: </i>Jiang, S., Lowd, D., Kafle, S., & Dou, D. (2016). Ontology matching with knowledge rules. In <i>Transactions on Large-Scale Data-and Knowledge-Centered Systems XXVIII</i> (pp. 75-95). Springer, Berlin, Heidelberg.<br><br><br><br></div
    corecore