32 research outputs found

    Preliminary results in tag disambiguation using DBpedia

    Get PDF
    The availability of tag-based user-generated content for a variety of Web resources (music, photos, videos, text, etc.) has largely increased in the last years. Users can assign tags freely and then use them to share and retrieve information. However, tag-based sharing and retrieval is not optimal due to the fact that tags are plain text labels without an explicit or formal meaning, and hence polysemy and synonymy should be dealt with appropriately. To ameliorate these problems, we propose a context-based tag disambiguation algorithm that selects the meaning of a tag among a set of candidate DBpedia entries, using a common information retrieval similarity measure. The most similar DBpedia en-try is selected as the one representing the meaning of the tag. We describe and analyze some preliminary results, and discuss about current challenges in this area

    Social tags and linked data for ontology development: a case study in the financial domain

    Get PDF
    We describe a domain ontology development approach that extracts domain terms from folksonomies and enrich them with data and vocabularies from the Linked Open Data cloud. As a result, we obtain lightweight domain ontologies that combine the emergent knowledge of social tagging systems with formal knowledge from Ontologies. In order to illustrate the feasibility of our approach, we have produced an ontology in the financial domain from tags available in Delicious, using DBpedia, OpenCyc and UMBEL as additional knowledge sources

    Review of the state of the art: discovering and associating semantics to tags in folksonomies

    Get PDF
    This paper describes and compares the most relevant approaches for associating tags with semantics in order to make explicit the meaning of those tags. We identify a common set of steps that are usually considered across all these approaches and frame our descriptions according to them, providing a unified view of how each approach tackles the different problems that appear during the semantic association process. Furthermore, we provide some recommendations on (a) how and when to use each of the approaches according to the characteristics of the data source, and (b) how to improve results by leveraging the strengths of the different approaches

    Linking Folksonomies and Ontologies for Supporting Knowledge Sharing: a State of the Art

    Get PDF
    Deliverable of ISICIL ANR-funded projectSocial tagging systems have recently become very popular as a means to classify large sets of resources shared among on-line communities over the social Web. However, the folksonomies resulting from the use of these systems revealed limitations: tags are ambiguous and their spelling may vary, and folksonomies are difficult to exploit in order to retrieve or exchange information. This report compares the recent attempts to overcome these limitations and to support the use of folksonomies with formal languages and ontologies from the Semantic Web

    Learning Structured Knowledge from Social Tagging Data A critical review of methods and techniques

    Get PDF
    For more than a decade, researchers have been proposing various methods and techniques to mine social tagging data and to learn structured knowledge. It is essential to conduct a comprehensive survey on the related work, which would benefit the research community by providing better understanding of the state-of-the-art and insights into the future research directions. The paper first defines the spectrum of Knowledge Organization Systems, from unstructured with less semantics to highly structured with richer semantics. It then reviews the related work by classifying the methods and techniques into two main categories, namely, learning term lists and learning relations. The method and techniques originated from natural language processing, data mining, machine learning, social network analysis, and the Semantic Web are discussed in detail under the two categories. We summarize the prominent issues with the current research and highlight future directions on learning constantly evolving knowledge from social media data

    Ontology Learning and Knowledge Discovery Using the Web: Challenges and Recent Advances

    Get PDF
    By providing interoperability and shared meaning across actors and domains, lightweight domain ontologies are a cornerstone technology of the Semantic Web. This chapter investigates evidence sources for ontology learning and describes a generic and extensible approach to ontology learning that combines such evidence sources to extract domain concepts, identify relations between the ontology’s concepts, and detect relation labels automatically. An implementation illustrates the presented ontology learning and relation labeling framework and serves as the basis for dis- cussing possible pitfalls in ontology learning. Afterwards, three use cases demonstrate the usefulness of the presented framework and its application to real-world problems

    A Survey of the First 20 Years of Research on Semantic Web and Linked Data

    Get PDF
    International audienceThis paper is a survey of the research topics in the field of Semantic Web, Linked Data and Web of Data. This study looks at the contributions of this research community over its first twenty years of existence. Compiling several bibliographical sources and bibliometric indicators , we identify the main research trends and we reference some of their major publications to provide an overview of that initial period. We conclude with some perspectives for the future research challenges.Cet article est une étude des sujets de recherche dans le domaine du Web sémantique, des données liées et du Web des données. Cette étude se penche sur les contributions de cette communauté de recherche au cours de ses vingt premières années d'existence. En compilant plusieurs sources bibliographiques et indicateurs bibliométriques, nous identifions les principales tendances de la recherche et nous référençons certaines de leurs publications majeures pour donner un aperçu de cette période initiale. Nous concluons avec une discussion sur les tendances et perspectives de recherche

    Deliverable D4.1 Specification of user profiling and contextualisation

    Get PDF
    This deliverable presents a comprehensive research of past work in the field of capturing and interpreting user preferences and context and an overview of relevant digital media-specific techniques, aiming to provide insights and ideas for innovative context-aware user preference learning and to justify the user modelling strategies considered within LinkedTV’s WP4. Based on this research and a study over the specific technical and conceptual requirements of LinkedTV, a prototypical design for profiling and contextualizing user needs in a linked media environment is specified

    Co-evolución entre la Web Social y la Web Semántica

    Get PDF
    La Web Social y la Web Semántica han impactado en la forma en que la creación de conocimiento se ha llevado a cabo en la Web. La Web Social promociona la participación de los usuarios para crear y editar contenido y conocimiento en la Web. La proliferación de contenido y la necesidad de tener una administración automatizada de esta información disparó la aparición de la Web Semántica. Actualmente, la Web Social y la Web Semántica conviven y comparten un mismo tema: un mejor manejo del conocimiento. Sin embargo, la mayoría de la información en la Web Social no es parte de la Web Semántica, y la información de la Web Semántica no es utilizada para mejorar a la Web Social. Esta tesis presenta un enfoque innovador para estimular una co-evolución entre la Web Semántica y la Web Social: las fuerzas que impulsan la Web Social y las herramientas que llevan a cabo la Web Semántica trabajando en conjunto con el fin de tener beneficios mutuos. En este trabajo afirmamos que la co-evolución entre la Web Social y la Web Semántica mejorará la generación de información semántica en la Web Semántica, y mejorará la producción de conocimiento en la Web Social. Esto invita a responder las siguientes preguntas: ¿Cómo puede incluirse la generación de datos semánticos en las actividades de los usuarios de la Web Social? ¿Como puede definirse la semántica de un recurso web en un entorno social? ¿Cómo puede inyectarse en la Web Social las nuevas piezas de información extraídas de la Web Semántica? ¿Poseen las comunidades de la Web Social convenciones generales que deban ser respetadas? Con el fin de mejorar la Web Semántica con las fuerzas de la Web Social, en este trabajo se proponen dos enfoques de Social Semantic Tagging: P-Swooki que permite a usuarios de una wiki semántica gestionar anotaciones semánticas permitiendo completar el proceso de construcción de conocimiento, y Semdrops que permite a los usuarios describir en forma semántica cualquier recurso de la Web tanto en un espacio de conocimiento personal como en un espacio compartido. Además, con el fin de mejorar el contenido de la Web Social, proponemos BlueFinder: un sistema de recomendación que detecta y recomienda la mejor manera de representar en un sitio de la Web Social, información que es extraída de la Web Semántica. En particular, BlueFinder recomienda la manera de representar una propiedad semántica de DBpedia en Wikipedia, respetando las convenciones de la comunidad de usuarios de Wikipedia.Tesis realizada en co-tutela con la Universidad de Nantes (Francia). Director de tesis por la Universidad de Nantes: Pascal Molli; co-director de tesis por la Universidad de Nantes: Hala Skaf-Molli. Grado alcanzado por la Universidad de Nantes: Docteur de l'Université de Nantes.Facultad de Informátic

    Semantic web approach for italian graduates' surveys: the AlmaLaurea ontology proposal

    Get PDF
    Il crescente sviluppo e la promozione della trasparenza dei dati nell’ambito della pubblica amministrazione copre molteplici aspetti, fra cui l’educazione universitaria. Attualmente sono difatti numerosi i dataset rilasciati in formato Linked Open Data disponibili a livello nazionale ed internazionale. Fra le informazioni pubblicamente disponibili spiccano concetti riguardo l’occupazione e la numerosità dei laureati. Nonostante il progresso riscontrato, la mancanza di una metodologia standard per la descrizione di informazioni statistiche sui laureati rende difficoltoso un confronto di determinati fatti a partire da differenti sorgenti di dati. Sul piano nazionale, le indagini AlmaLaurea colmano il gap informativo dell’eterogeneità delle fonti proponendo statistiche centralizzate su profilo dei laureati e relativa condizione occupazionale, aggiornate annualmente. Scopo del progetto di tesi è la realizzazione di un’ontologia di dominio che descriva diverse peculiarità dei laureati, promuovendo allo stesso tempo la definizione strutturata dei dati AlmaLaurea e la successiva pubblicazione nel contesto Linked Open Data. Il progetto, realizzato con l’ausilio delle tecnologie del Web Semantico, propone infine la creazione di un endpoint SPARQL e di una interfaccia web per l'interrogazione e la visualizzazione dei dati strutturati
    corecore