Search CORE

144 research outputs found

Temporal Evolution of the Migration-related Topics on Social Media?

Author: Alam M.
Chen Y.
Gesese G. A.
Sack H.
Seneviratne O. Sequeda J. Etcheverry L., Pesquita C.
Publication venue: RWTH Aachen
Publication date: 01/01/2021
Field of study

This poster focuses on capturing the temporal evolution of migration-related topics on relevant tweets. It uses Dynamic Embedded Topic Model (DETM) as a learning algorithm to perform a quantitative and qualitative analysis of these emerging topics. TweetsKB is extended with the extracted Twitter dataset along with the results of DETM which considers temporality. These results are then further analyzed and visualized. It reveals that the trajectories of the migration-related topics are in agreement with historical events

KITopen

Recommended from our members

Results of the ontology alignment evaluation initiative 2019

Author: Algergawy A.
Faria D.
Ferrara A.
Fundulaki I.
Harrow I.
Hertling S.
Jimenez-Ruiz E.
Karam N.
Khiat A.
Lambrix P.
Li H.
Montanelli S.
Paulheim H.
Pesquita C.
Saveta T.
Shvaiko P.
Splendiani A.
Thiéblin E.
Trojahn C.
Vataščinová J.
Zamazal O.
Zhou L.
Publication venue
Publication date: 01/01/2019
Field of study

The Ontology Alignment Evaluation Initiative (OAEI) aims at comparing ontology matching systems on precisely defined test cases. These test cases can be based on ontologies of different levels of complexity (from simple thesauri to expressive OWL ontologies) and use different evaluation modalities (e.g., blind evaluation, open evaluation, or consensus). The OAEI 2019 campaign offered 11 tracks with 29 test cases, and was attended by 20 participants. This paper is an overall presentation of that campaign

City Research Online

MAnnheim DOCument Server

Recommended from our members

Crowd-assessing quality in uncertain data linking datasets

Author: Faria D.
Ferrara A.
Jimenez-Ruiz E.
Montanelli S.
Pesquita C.
Publication venue: 'Cambridge University Press (CUP)'
Publication date: 01/01/2020
Field of study

The quality of a dataset used for evaluating data linking methods, techniques, and tools depends on the availability of a set of mappings, called reference alignment, that is known to be correct. In particular, it is crucial that mappings effectively represent relations between pairs of entities that are indeed similar due to the fact that they denote the same object. Since the reliability of mappings is decisive in order to perform a fair evaluation of automatic linking methods and tools, we call this property of mappings as mapping fairness. In this article, we propose a crowd-based approach, called Crowd Quality(CQ), for assessing the quality of data linking datasets by measuring the fairness of the mappings in the reference alignment. Moreover, we present a real experiment, where we evaluate two state-of-the-art data linking tools before and after the refinement of the reference alignment based on the CQ approach, in order to present the benefits deriving from the crowd assessment of mapping fairness

City Research Online

AIR Universita degli studi di Milano

NORA - Norwegian Open Research Archives

An improved method for scoring protein-protein interactions using semantic similarity within the gene ontology

Author: A del Pozo
A Hofer
A Patil
A Schlicker
AC Gavin
AJ Faller
C Pesquita
C Pesquita
C Pesquita
C Pesquita
CM Deane
D Li
D Lin
D Warde-Farley
DP Eisinger
DR Rhodes
F Azuaje
FM Couto
G Alterovitz
G van Rossum
G Yu
Gary D Bader
H Wu
H Yu
H Yu
I Xenarios
J Cheng
JJ Jiang
JL Sevilla
JZ Wang
K Strasser
K Xia
LJ Jensen
M Ashburner
M West
N Nariai
NJ Krogan
P Resnik
P Uetz
P Zhang
R Gentleman
R Shen
S Chavez
S Razick
Shobhit Jain
T Xu
U Stelzl
X Guo
Y Chen
Y Tao
Z Lei
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Abstract Background Semantic similarity measures are useful to assess the physiological relevance of protein-protein interactions (PPIs). They quantify similarity between proteins based on their function using annotation systems like the Gene Ontology (GO). Proteins that interact in the cell are likely to be in similar locations or involved in similar biological processes compared to proteins that do not interact. Thus the more semantically similar the gene function annotations are among the interacting proteins, more likely the interaction is physiologically relevant. However, most semantic similarity measures used for PPI confidence assessment do not consider the unequal depth of term hierarchies in different classes of cellular location, molecular function, and biological process ontologies of GO and thus may over-or under-estimate similarity. Results We describe an improved algorithm, Topological Clustering Semantic Similarity (TCSS), to compute semantic similarity between GO terms annotated to proteins in interaction datasets. Our algorithm, considers unequal depth of biological knowledge representation in different branches of the GO graph. The central idea is to divide the GO graph into sub-graphs and score PPIs higher if participating proteins belong to the same sub-graph as compared to if they belong to different sub-graphs. Conclusions The TCSS algorithm performs better than other semantic similarity measurement techniques that we evaluated in terms of their performance on distinguishing true from false protein interactions, and correlation with gene expression and protein families. We show an average improvement of 4.6 times the <it>F</it>1 score over Resnik, the next best method, on our <it>Saccharomyces cerevisiae </it>PPI dataset and 2 times on our <it>Homo sapiens </it>PPI dataset using cellular component, biological process and molecular function GO annotations.</p

University of Toronto Research Repository

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Global Network Alignment Using Multiscale Spectral Signatures

Author: Altschul
Ashburner
Banerjee
C. Kingsford
Fields
Gavin
Jaeger
Kanehisa
Milenkovic
Ovaska
Parrish
Peregrin-Alvarez
Pesquita
R. Patro
Publication venue
Publication date: 01/01/2011
Field of study

Motivation: Protein interaction networks provide an important system-level view of biological processes. One of the fundamental problems in biological network analysis is the global alignment of a pair of networks, which puts the proteins of one network into correspondence with the proteins of another network in a manner that conserves their interactions while respecting other evidence of their homology. By providing a mapping between the networks of different species, alignments can be used to inform hypotheses about the functions of unannotated proteins, the existence of unobserved interactions, the evolutionary divergence between the two species and the evolution of complexes and pathways. Results: We introduce GHOST, a global pairwise network aligner that uses a novel spectral signature to measure topological similarity across disparate networks. It exhibits state-of-the-art performance on several network alignment tasks. We show that the spectral signature used by GHOST is highly discriminative, while the alignments it produces are also robust to experimental noise. When compared with other recent approaches, we find that GHOST is able to recover larger and biologically-significant, shared subnetworks between species. Availability: An efficient and parallelized implementation of GHOST, released under the Apache 2.0 license, is available at http:// cbcb.umd.edu/kingsford-group/ghostFunding: This work was supported by the National Science Foundation [CCF-1053918, EF-0849899, and IIS-0812111]; the National Institutes of Health [1R21AI085376]; and a University of Maryland Institute for Advanced Studies New Frontiers Award

Crossref

PubMed Central

Digital Repository at the University of Maryland

Semantic Similarity for Automatic Classification of Chemical Compounds

Author: A Mehta
AM Richard
B Chandrasekaran
C Cortes
C Pesquita
C Pesquita
C Pesquita
D Healy
DR Flower
Francisco M. Couto
FV So
G Lehne
GW Bemis
GW Bemis
H Wolosker
IH Witten
JD Amsterdam
JE Penzotti
John B. O. Mitchell
João D. Ferreira
JP Keogh
JW Raymond
JW Raymond
L Markiewicz
LG Ranilla
M Kanehisa
MF Ullah
N Nikolova
P Baldi
P De Matos
P Jaccard
P Resnik
P Willett
PW Lord
R Dias
R Gentleman
R Guha
R Mishra
RJ Miksicek
RM Harris
RSR Zand
S Doniger
SK Kearsley
SM Ross
SQ Le
T Grego
T Joachims
V Svetnik
W Tong
Y Fukunishi
Y Fukunishi
Y Tohsato
Y Xue
YC Martin
Publication venue: Public Library of Science
Publication date: 01/09/2010
Field of study

With the increasing amount of data made available in the chemical field, there is a strong need for systems capable of comparing and classifying chemical compounds in an efficient and effective way. The best approaches existing today are based on the structure-activity relationship premise, which states that biological activity of a molecule is strongly related to its structural or physicochemical properties. This work presents a novel approach to the automatic classification of chemical compounds by integrating semantic similarity with existing structural comparison methods. Our approach was assessed based on the Matthews Correlation Coefficient for the prediction, and achieved values of 0.810 when used as a prediction of blood-brain barrier permeability, 0.694 for P-glycoprotein substrate, and 0.673 for estrogen receptor binding activity. These results expose a significant improvement over the currently existing methods, whose best performances were 0.628, 0.591, and 0.647 respectively. It was demonstrated that the integration of semantic similarity is a feasible and effective way to improve existing chemical compound classification systems. Among other possible uses, this tool helps the study of the evolution of metabolic pathways, the study of the correlation of metabolic networks with properties of those networks, or the improvement of ontologies that represent chemical information

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Metrics for GO based protein semantic similarity: a systematic evaluation

Author: A Schlicker
A Valencia
André O Falcão
António EN Ferreira
C Pesquita
C Wu
Catia Pesquita
D Devos
D Devos
D Faria
D Lin
Daniel Faria
E Camon
EB Camon
F Azuaje
F Azuaje
F Couto
F Couto
FM Couto
Francisco M Couto
Gentleman
Hugo Bastos
J Chabalier
J Jiang
J Tuikkala
JL Sevilla
L Stein
P Lord
P Lord
P Resnik
PH Lee
RM Othman
RM Riensche
S Cao
T Joshi
X Guo
X Wu
Y Tao
Z Lei
ZH Duan
Publication venue: BioMed Central
Publication date: 01/01/2008
Field of study

Abstract Background Several semantic similarity measures have been applied to gene products annotated with Gene Ontology terms, providing a basis for their functional comparison. However, it is still unclear which is the best approach to semantic similarity in this context, since there is no conclusive evaluation of the various measures. Another issue, is whether electronic annotations should or not be used in semantic similarity calculations. Results We conducted a systematic evaluation of GO-based semantic similarity measures using the relationship with sequence similarity as a means to quantify their performance, and assessed the influence of electronic annotations by testing the measures in the presence and absence of these annotations. We verified that the relationship between semantic and sequence similarity is not linear, but can be well approximated by a rescaled Normal cumulative distribution function. Given that the majority of the semantic similarity measures capture an identical behaviour, but differ in resolution, we used the latter as the main criterion of evaluation. Conclusions This work has provided a basis for the comparison of several semantic similarity measures, and can aid researchers in choosing the most adequate measure for their work. We have found that the hybrid <it>simGIC</it> was the measure with the best overall performance, followed by Resnik's measure using a best-match average combination approach. We have also found that the average and maximum combination approaches are problematic since both are inherently influenced by the number of terms being combined. We suspect that there may be a direct influence of data circularity in the behaviour of the results including electronic annotations, as a result of functional inference from sequence similarity.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Universidade de Lisboa: Repositório.UL

Recommended from our members

Knowledge Graphs for the Life Sciences: Recent Developments, Challenges and Opportunities

Author: Chen J.
Dong H.
Hastings J.
Jimenez-Ruiz E.
Lopez V.
Monnin P.
Pesquita C.
Tamma V.
Škoda P.
Publication venue
Publication date: 28/03/2024
Field of study

The term life sciences refers to the disciplines that study living organisms and life processes, and include chemistry, biology, medicine, and a range of other related disciplines. Research efforts in life sciences are heavily data-driven, as they produce and consume vast amounts of scientific data, much of which is intrinsically relational and graphstructured. The volume of data and the complexity of scientific concepts and relations referred to therein promote the application of advanced knowledgedriven technologies for managing and interpreting data, with the ultimate aim to advance scientific discovery. In this survey and position paper, we discuss recent developments and advances in the use of graph-based technologies in life sciences and set out a vision for how these technologies will impact these fields into the future. We focus on three broad topics: the construction and management of Knowledge Graphs (KGs), the use of KGs and associated technologies in the discovery of new knowledge, and the use of KGs in artificial intelligence applications to support explanations (explainable AI). We select a few exemplary use cases for each topic, discuss the challenges and open research questions within these topics, and conclude with a perspective and outlook that summarizes the overarching challenges and their potential solutions as a guide for future research

City Research Online

Open Research Exeter

Recommended from our members

Results of the ontology alignment evaluation initiative 2017

Author: Achichi M.
Cheatham M.
Dragisic Z.
Euzenat J.
Faria D.
Ferrara A.
Flouris G.
Fundulaki I.
Harrow I.
Ivanova V.
Jimenez-Ruiz E.
Kolthoff K.
Kuss E.
Lambrix P.
Leopold H.
Li H.
Meilicke C.
Mohammadi M.
Montanelli S.
Pesquita C.
Saveta T.
Shvaiko P.
Splendiani A.
Stuckenschmidt H.
Thiéblin E.
Todorov K.
Trojahn C.
Zamazal O.
Publication venue
Publication date: 01/01/2016
Field of study

Ontology matching consists of finding correspondences between semantically related entities of different ontologies. The Ontology Alignment Evaluation Initiative (OAEI) aims at comparing ontology matching systems on precisely defined test cases. These test cases can be based on ontologies of different levels of complexity (from simple thesauri to expressive OWL ontologies) and use different evaluation modalities (e.g., blind evaluation, open evaluation, or consensus). The OAEI 2017 campaign offered 9 tracks with 23 test cases, and was attended by 21 participants. This paper is an overall presentation of that campaign

City Research Online

Scientific Publications of the University of Toulouse II Le Mirail

Hal - Université Grenoble Alpes

AIR Universita degli studi di Milano

TU Delft Repository

INRIA a CCSD electronic archive server

Open Archive Toulouse Archive Ouverte

MAnnheim DOCument Server

Hal-Diderot

Recommended from our members

Results of SemTab 2021

Author: Abdelmageed N.
Chen J.
Cutrona V.
Efthymiou V.
Hassanzadeh O.
Hulsebos M.
Jimenez-Ruiz E.
Oliveira D.
Pesquita C.
Sequeda J.
Srinivas K.
Publication venue: CEUR Workshop Proceedings
Publication date: 09/03/2022
Field of study

SemTab 2021 was the third edition of the Semantic Web Challenge on Tabular Data to Knowledge Graph Matching, successfully collocated with the 20th International Semantic Web Conference (ISWC) and the 16th Ontology Matching (OM) Workshop. SemTab provides a common framework to conduct a systematic evaluation of state-of-the-art systems

City Research Online