14 research outputs found

    Data-Driven RDF Property Semantic-Equivalence Detection Using NLP Techniques

    Full text link
    DBpedia extracts most of its data from Wikipedia’s infoboxes. Manually-created “mappings” link infobox attributes to DBpedia ontology properties (dbo properties) producing most used DBpedia triples. However, infoxbox attributes without a mapping produce triples with properties in a different namespace (dbp properties). In this position paper we point out that (a) the number of triples containing dbp properties is significant compared to triples containing dbo properties for the DBpedia instances analyzed, (b) the SPARQL queries made by users barely use both dbp and dbo properties simultaneously, (c) as an exploitation example we show a method to automatically enhance SPARQL queries by using syntactic and semantic similarities between dbo properties and dbp properties

    Kaposi sarcoma in unusual locations

    Full text link

    Gli strumenti informatici. Sviluppo e risultati

    No full text
    Per riconoscere i tratti linguistici di interesse su un corpus composto da quasi tremila temi e per annotarli in modo coerente si è reso necessario lo sviluppo di diversi strumenti informatici. Tali software appartengono a due tipologie: da un lato, si sono sviluppati alcuni moduli per l'analisi del testo, che in modo automatico riconoscono dei tratti o estraggono delle informazioni parziali utili a riconoscere i tratti in modo manuale. Dall'altro, si è adattata al progetto una piattaforma online che permette di effettuare annotazione linguistica multilivello con diversi annotatori al lavoro in parallelo su porzioni diverse del corpus di temi

    Gli strumenti informatici. Sviluppo e risultati

    No full text
    Per riconoscere i tratti linguistici di interesse su un corpus composto da quasi tremila temi e per annotarli in modo coerente si è reso necessario lo sviluppo di diversi strumenti informatici. Tali software appartengono a due tipologie: da un lato, si sono sviluppati alcuni moduli per l'analisi del testo, che in modo automatico riconoscono dei tratti o estraggono delle informazioni parziali utili a riconoscere i tratti in modo manuale. Dall'altro, si è adattata al progetto una piattaforma online che permette di effettuare annotazione linguistica multilivello con diversi annotatori al lavoro in parallelo su porzioni diverse del corpus di temi

    From Conditional Random Field (CRF) to Rhetorical Structure Theory (RST): incorporating context information in sentiment analysis

    No full text
    This paper investigates a method based on Conditional Random Fields (CRFs) to incorporate sentence structure (syntax and semantics) and context information to identify sentiments of sentences. It also demonstrates the usefulness of the Rhetorical Structure Theory (RST) taking into consideration the discourse role of text segments. Thus, this paper’s aim is to reconsider the effectiveness of CRF and RST methods in incorporating the contextual information into Sentiment Analysis systems. Both methods are evaluated on two, different in size and genre of information sources, the Movie Review Dataset and the Finegrained Sentiment Dataset (FSD). Finally, we discuss the lessons learned from these experimental settings w.r.t. addressing the following key research questions such as whether there is an appropriate type of social media repository to incorporate contextual information, whether extending the pool of the selected features could improve context incorporation into SA systems and which is the best performing feature combination to achieve such improved performance

    Opinion Mining with a Clause-Based Approach

    No full text
    With different social media and commercial platforms, users express their opinion about products in a textual form. Automatically extracting the polarity (i.e. whether the opinion is positive or negative) of a user can be useful for both actors: the online platform incorporating the feedback to improve their product as well as the client who might get recommendations according to his or her preferences. Different approaches for tackling the problem, have been suggested mainly using syntactic features. The “Challenge on Semantic Sentiment Analysis” aims to go beyond the word-level analysis by using semantic information. In this paper we propose a novel approach by employing the semantic information of grammatical unit called preposition. We try to derive the target of the review from the summary information, which serves as an input to identify the proposition in it. Our implementation relies on the hypothesis that the proposition expressing the target of the summary, usually containing the main polarity information

    Aspect-Based Opinion Mining Using Knowledge Bases

    No full text
    In the last decade, the focus of the Opinion Mining field moved to detection of the pairs “aspect-polarity” instead of limiting approaches in the computation of the general polarity of a text. In this work, we propose an aspect-based opinion mining system based on the use of semantic resources for the extraction of the aspects from a text and for the computation of their polarities. The proposed system participated at the third edition of the Semantic Sentiment Analysis (SSA) challenge took place during ESWC 2017 achieving the runner-up place in the Task #2 concerning the aspect-based sentiment analysis. Moreover, a further evaluation performed on the SemEval 2015 benchmarks demonstrated the feasibility of the proposed approach

    The NeuroSent System at ESWC-2018 Challenge on Semantic Sentiment Analysis

    No full text
    Multi-domain sentiment analysis consists in estimating the polarity of a given text by exploiting domain-specific information. One of the main issues common to the approaches discussed in the literature is their poor capabilities of being applied on domains which are different from those used for building the opinion model. In this paper, we will present an approach exploiting the linguistic overlap between domains to build sentiment models supporting polarity inference for documents belonging to every domain. Word embeddings together with a deep learning architecture have been implemented for enabling the building of multi-domain sentiment model. The proposed technique is validated by following the Dranziera protocol in order to ease the repeatability of the experiments and the comparison of the results. The outcomes demonstrate the effectiveness of the proposed approach and also set a plausible starting point for future work

    The CLAUSY System at ESWC-2018 Challenge on Semantic Sentiment Analysis

    No full text
    With different social media and commercial platforms, users express their opinion about products in a textual form. Automatically extracting the polarity(i.e. whether the opinion is positive or negative) of a user can be useful for both actors: the online platform incorporating the feedback to improve their product as well as the client who might get recommendations according to his or her preferences. Different approaches for tackling the problem, have been suggested mainly using syntactic features. The “Challenge on Semantic Sentiment Analysis” aims to go beyond the word-level analysis by using semantic information. In this paper we propose a novel approach by employing the semantic information of grammatical unit called preposition. We try to derive the target of the review from the summary information, which serves as an input to identify the proposition in it. Our implementation relies on the hypothesis that the proposition expressing the target of the summary, usually containing the main polarity information

    The IRMUDOSA System at ESWC-2017 Challenge on Semantic Sentiment Analysis

    No full text
    Multi-Domain opinion mining consists in estimating the polarity of a document by exploiting domain-specific information. One of the main issue of the approaches discussed in literature is their poor capability of being applied on domains that have not been used for building the opinion model. In this paper, we present an approach exploiting the linguistic overlap between domains for building models enabling the estimation of polarities for documents belonging to any other domain. The system implementing such an approach has been presented at the third edition of the Semantic Sentiment Analysis Challenge co-located with ESWC 2017. Fuzzy representation of features polarity supports the modeling of information uncertainty learned from training set and integrated with knowledge extracted from two well-known resources used in the opinion mining field, namely Sentic.Net and the General Inquirer. The proposed technique has been validated on a multi-domain dataset and the results demonstrated the effectiveness of the proposed approach by setting a plausible starting point for future work
    corecore