7,493 research outputs found

    Using distributional similarity to organise biomedical terminology

    Get PDF
    We investigate an application of distributional similarity techniques to the problem of structural organisation of biomedical terminology. Our application domain is the relatively small GENIA corpus. Using terms that have been accurately marked-up by hand within the corpus, we consider the problem of automatically determining semantic proximity. Terminological units are dened for our purposes as normalised classes of individual terms. Syntactic analysis of the corpus data is carried out using the Pro3Gres parser and provides the data required to calculate distributional similarity using a variety of dierent measures. Evaluation is performed against a hand-crafted gold standard for this domain in the form of the GENIA ontology. We show that distributional similarity can be used to predict semantic type with a good degree of accuracy

    Dealing with uncertain entities in ontology alignment using rough sets

    Get PDF
    This is the author's accepted manuscript. The final published article is available from the link below. Copyright @ 2012 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other users, including reprinting/ republishing this material for advertising or promotional purposes, creating new collective works for resale or redistribution to servers or lists, or reuse of any copyrighted components of this work in other works.Ontology alignment facilitates exchange of knowledge among heterogeneous data sources. Many approaches to ontology alignment use multiple similarity measures to map entities between ontologies. However, it remains a key challenge in dealing with uncertain entities for which the employed ontology alignment measures produce conflicting results on similarity of the mapped entities. This paper presents OARS, a rough-set based approach to ontology alignment which achieves a high degree of accuracy in situations where uncertainty arises because of the conflicting results generated by different similarity measures. OARS employs a combinational approach and considers both lexical and structural similarity measures. OARS is extensively evaluated with the benchmark ontologies of the ontology alignment evaluation initiative (OAEI) 2010, and performs best in the aspect of recall in comparison with a number of alignment systems while generating a comparable performance in precision

    A model of the dynamics of organizational communication

    Get PDF
    We propose a model of the dynamics of organizational communication. Our model specifies the mechanics by which communication impact is fed back to communication inputs and closes the gap between sender and receiver of messages. We draw on language critique, a branch of language philosophy, and derive joint linguistic actions of interlocutors to explain the emergence and adaptation of communication on the group level. The model is framed by Te'eni's cognitive-affective model of organizational communication

    On the similarity relation within fuzzy ontology components

    Get PDF
    Ontology reuse is an important research issue. Ontology merging, integration, mapping, alignment and versioning are some of its subprocesses. A considerable research work has been conducted on them. One common issue to these subprocesses is the problem of defining similarity relations among ontologies components. Crisp ontologies become less suitable in all domains in which the concepts to be represented have vague, uncertain and imprecise definitions. Fuzzy ontologies are developed to cope with these aspects. They are equally concerned with the problem of ontology reuse. Defining similarity relations within fuzzy context may be realized basing on the linguistic similarity among ontologies components or may be deduced from their intentional definitions. The latter approach needs to be dealt with differently in crisp and fuzzy ontologies. This is the scope of this paper.ou

    MultiFarm: A benchmark for multilingual ontology matching

    Full text link
    In this paper we present the MultiFarm dataset, which has been designed as a benchmark for multilingual ontology matching. The MultiFarm dataset is composed of a set of ontologies translated in different languages and the corresponding alignments between these ontologies. It is based on the OntoFarm dataset, which has been used successfully for several years in the Ontology Alignment Evaluation Initiative (OAEI). By translating the ontologies of the OntoFarm dataset into eight different languages – Chinese, Czech, Dutch, French, German, Portuguese, Russian, and Spanish – we created a comprehensive set of realistic test cases. Based on these test cases, it is possible to evaluate and compare the performance of matching approaches with a special focus on multilingualism

    LabelTranslator: A Tool to Automatically Localize an Ontology

    Full text link
    This demo proposal briefly presents LabelTranslator, a system that suggests translations of ontology labels, with the purpose of localizing ontologies. LabelTranslator takes as input an ontology whose labels are described in a source natural language and obtains the most probable translation of each ontology label into a target natural language.Our main contribution is the automatization of this process, which reduces human efforts to localize manually the ontology

    An iterative approach for lexicon characterization in juridical context

    Get PDF
    In the juridical context, knowledge management applications have a central role. In order to improve the effectiveness of document management procedures, techniques for automatic comprehension of textual content are required. In this work, a methodology for semi-automatic derivation of knowledge from document collections is proposed. In order to extract relevant information from document text, a process integrating both statistical and lexical approaches is applied. Moreover, we propose a system for the evaluation of the extracted peculiar lexicon quality. The system is used for the processing of heterogeneous documents corpus issued by Italy’s juridical domain
    corecore