278 research outputs found

    Semantic enrichment of knowledge sources supported by domain ontologies

    Get PDF
    This thesis introduces a novel conceptual framework to support the creation of knowledge representations based on enriched Semantic Vectors, using the classical vector space model approach extended with ontological support. One of the primary research challenges addressed here relates to the process of formalization and representation of document contents, where most existing approaches are limited and only take into account the explicit, word-based information in the document. This research explores how traditional knowledge representations can be enriched through incorporation of implicit information derived from the complex relationships (semantic associations) modelled by domain ontologies with the addition of information presented in documents. The relevant achievements pursued by this thesis are the following: (i) conceptualization of a model that enables the semantic enrichment of knowledge sources supported by domain experts; (ii) development of a method for extending the traditional vector space, using domain ontologies; (iii) development of a method to support ontology learning, based on the discovery of new ontological relations expressed in non-structured information sources; (iv) development of a process to evaluate the semantic enrichment; (v) implementation of a proof-of-concept, named SENSE (Semantic Enrichment kNowledge SourcEs), which enables to validate the ideas established under the scope of this thesis; (vi) publication of several scientific articles and the support to 4 master dissertations carried out by the department of Electrical and Computer Engineering from FCT/UNL. It is worth mentioning that the work developed under the semantic referential covered by this thesis has reused relevant achievements within the scope of research European projects, in order to address approaches which are considered scientifically sound and coherent and avoid “reinventing the wheel”.European research projects - CoSpaces (IST-5-034245), CRESCENDO (FP7-234344) and MobiS (FP7-318452

    An ontology-based recommender system using scholar's background knowledge

    Get PDF
    Scholar’s recommender systems recommend scientific articles based on the similarity of articles to scholars’ profiles, which are a collection of keywords that scholars are interested in. Recent profiling approaches extract keywords from the scholars’ information such as publications, searching keywords, and homepages, and train a reference ontology, which is often a general-purpose ontology, in order to profile the scholars’ interests. However, such approaches do not consider the scholars’ knowledge because the recommender system only recommends articles which are syntactically similar to articles that scholars have already visited, while scholars are interested in articles which contain comparatively new knowledge. In addition, the systems do not support multi-area property of scholars’ knowledge as researchers usually do research in multiple topics simultaneously and are expected to receive focused-topic articles in each recommendation. To address these problems, this study develops a domain-specific reference ontology by merging six Web taxonomies and exploits Wikipedia as a conflict resolver of ontologies. Then, the knowledge items from the scholars’ information are extracted, transformed by DBpedia, and clustered into relevant topics in order to model the multi-area property of scholars’ knowledge. Finally, the clustered knowledge items are mapped to the reference ontology by using DBpedia to create clustered profiles. In addition a semantic similarity algorithm is adapted to the clustered profiles, which enables recommendation of focused-topic articles that contain new knowledge. To evaluate performance of the proposed approach, three different data sets from scholars’ information in Computer Science domain are created, and the precisions in different cases are measured. The proposed method, in comparison with the baseline methods, improves the average precision by 6% when the new reference ontology along with the full scholars’ knowledge is utilized, by an extra 7.2% when scholars’ knowledge is transformed by DBpedia, and further 8.9% when clustered profile is applied. Experimental results certify that using knowledge items instead of keywords for profiling as well as transforming the knowledge items by DBpedia can significantly improve the recommendation performance. Besides, the domain-specific reference ontology can effectively capture the full scholars’ knowledge which results to more accurate profiling

    Interim research assessment 2003-2005 - Computer Science

    Get PDF
    This report primarily serves as a source of information for the 2007 Interim Research Assessment Committee for Computer Science at the three technical universities in the Netherlands. The report also provides information for others interested in our research activities

    Formal Linguistic Models and Knowledge Processing. A Structuralist Approach to Rule-Based Ontology Learning and Population

    Get PDF
    2013 - 2014The main aim of this research is to propose a structuralist approach for knowledge processing by means of ontology learning and population, achieved starting from unstructured and structured texts. The method suggested includes distributional semantic approaches and NL formalization theories, in order to develop a framework, which relies upon deep linguistic analysis... [edited by author]XIII n.s

    Deep learning based semantic textual similarity for applications in translation technology

    Get PDF
    A thesis submitted in partial fulfilment of the requirements of the University of Wolverhampton for the degree of Doctor of Philosophy.Semantic Textual Similarity (STS) measures the equivalence of meanings between two textual segments. It is a fundamental task for many natural language processing applications. In this study, we focus on employing STS in the context of translation technology. We start by developing models to estimate STS. We propose a new unsupervised vector aggregation-based STS method which relies on contextual word embeddings. We also propose a novel Siamese neural network based on efficient recurrent neural network units. We empirically evaluate various unsupervised and supervised STS methods, including these newly proposed methods in three different English STS datasets, two non- English datasets and a bio-medical STS dataset to list the best supervised and unsupervised STS methods. We then embed these STS methods in translation technology applications. Firstly we experiment with Translation Memory (TM) systems. We propose a novel TM matching and retrieval method based on STS methods that outperform current TM systems. We then utilise the developed STS architectures in translation Quality Estimation (QE). We show that the proposed methods are simple but outperform complex QE architectures and improve the state-of-theart results. The implementations of these methods have been released as open source

    Ontology mapping with auxiliary resources

    Get PDF
    • …
    corecore