Search CORE

46 research outputs found

Risk-Averse Matchings over Uncertain Graph Databases

Author: A Khan
AE Roth
B Bollobás
D Liben-Nowell
G Kollios
J Edmonds
LG Valiant
M Kargar
M Kearns
M Potamias
N Bansal
N Chen
NJ Krogan
NN Dalvi
P Berman
P Boldi
RM Karp
S Asthana
YH Chan
Publication venue
Publication date: 09/01/2018
Field of study

A large number of applications such as querying sensor networks, and analyzing protein-protein interaction (PPI) networks, rely on mining uncertain graph and hypergraph databases. In this work we study the following problem: given an uncertain, weighted (hyper)graph, how can we efficiently find a (hyper)matching with high expected reward, and low risk? This problem naturally arises in the context of several important applications, such as online dating, kidney exchanges, and team formation. We introduce a novel formulation for finding matchings with maximum expected reward and bounded risk under a general model of uncertain weighted (hyper)graphs that we introduce in this work. Our model generalizes probabilistic models used in prior work, and captures both continuous and discrete probability distributions, thus allowing to handle privacy related applications that inject appropriately distributed noise to (hyper)edge weights. Given that our optimization problem is NP-hard, we turn our attention to designing efficient approximation algorithms. For the case of uncertain weighted graphs, we provide a

\frac{1}{3}

-approximation algorithm, and a

\frac{1}{5}

-approximation algorithm with near optimal run time. For the case of uncertain weighted hypergraphs, we provide a

\Omega(\frac{1}{k})

-approximation algorithm, where

k

is the rank of the hypergraph (i.e., any hyperedge includes at most

k

nodes), that runs in almost (modulo log factors) linear time. We complement our theoretical results by testing our approximation algorithms on a wide variety of synthetic experiments, where we observe in a controlled setting interesting findings on the trade-off between reward, and risk. We also provide an application of our formulation for providing recommendations of teams that are likely to collaborate, and have high impact.Comment: 25 page

arXiv.org e-Print Archive

Crossref

Direction-preserving trajectory simplification

Author: Brakatsoulas S.
Brakatsoulas S.
Cao H.
Chen M.
Chen Y.
Douglas D.
Giannotti F.
Gudmundsson J.
Hung C.-C.
Kellaris G.
Kolesnikov A.
Lange R.
Lee J. G.
Lee J. G.
Lee J. G.
Meratnia N.
Muckell J.
Patel D.
Pelekis N.
Potamias M.
Singh M.
Yuan J.
Zheng Y.
Publication venue: 'VLDB Endowment'
Publication date
Field of study

Crossref

Reviewing the integration of patient data: how systems are evolving in practice to meet patient needs

Author: A Berler
A Taddei
A Taddei
A Taddei
A Takada
A Zuker
AAT Bui
Altamiro M Costa-Pereira
Ana M Ferreira
BK Stewart
C Breant
C Carpeggiani
C Safran
CJ McDonald
CJ McDonald
D Bandon
D Kalra
DG Katehakis
F Borst
F Chiarugi
F Malamateniou
F Malamateniou
F Malamateniou
F Uckert
Filipa C Almeida
G Aloisio
G Hripcsak
G Potamias
G Quade
H Heathfield
H Li
H Munch
HJ Lowe
IK Kim
J Bergmann
J Bernarding
J Ferranti
J Grimson
J Grimson
J Grimson
J Overhage
J Reina-Tosina
J Zhang
JC Wyatt
JD Halamka
JD Kay
JD Kay
Jeremy C Wyatt
JM Overhage
JR Scherrer
K Bernstein
L Bird
M Berg
M Hägglund
M Poulymenopoulou
M Tsiknakis
M Wagner
M Zviran
MA Morales
ML Muller
ML Muller
ML Muller
ML Müller
MYY Law
NA Goulas
NT Cheunga
P Gong
P Hanzlicek
P Littlejohns
PD Clayton
Pedro M Vieira-Marques
PG Biondich
PM Kuzmak
R Cruz-Correia
R Lenz
R Strom
RE Dayhoff
RG Duncan
RG Jost
Ricardo J Cruz-Correia
S Bouam
S Koch
S Mycek
S Orphanoudakis
SC Lin
SS Spyrou
T Kaae
T Kalinski
T Schabetsberger
T Schabetsberger
U Altmann
U Tachinardi
W Grimson
W Grimson
WB Lober
Y Xu
Publication venue: BioMed Central
Publication date: 01/01/2007
Field of study

Abstract Background The integration of Information Systems (IS) is essential to support shared care and to provide consistent care to individuals – patient-centred care. This paper identifies, appraises and summarises studies examining different approaches to integrate patient data from heterogeneous IS. Methods The literature was systematically reviewed between 1995–2005 to identify articles mentioning patient records, computers and data integration or sharing. Results Of 3124 articles, 84 were included describing 56 distinct projects. Most of the projects were on a regional scale. Integration was most commonly accomplished by messaging with pre-defined templates and middleware solutions. HL7 was the most widely used messaging standard. Direct database access and web services were the most common communication methods. The user interface for most systems was a Web browser. Regarding the type of medical data shared, 77% of projects integrated diagnosis and problems, 67% medical images and 65% lab results. More recently significantly more IS are extending to primary care and integrating referral letters. Conclusion It is clear that Information Systems are evolving to meet people's needs by implementing regional networks, allowing patient access and integration of ever more items of patient data. Many distinct technological solutions coexist to integrate patient data, using differing standards and data architectures which may difficult further interoperability.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Efficient simrank-based similarity join over large graphs

Author: Abbassi Z.
Albert Réka
Chan E. P. F.
Cheng J.
Dinur I.
Fan W.
Fogaras D.
Gentle J. E.
Jeh G.
Karakostas G.
Karypis G.
Kessler M. M.
Khan A.
Khot S.
Li C.
Li P.
Li P.
Liben-Nowell D.
Lizorkin D.
McCallum A.
McPherson M.
Potamias M.
Small H.
Sun L.
Tretyakov K.
Trißl S.
Wang H.
Zhang S.
Zhang S.
Zhao P.
Zou L.
Publication venue: 'VLDB Endowment'
Publication date
Field of study

Crossref

Grid-based knowledge discovery in clinico-genomic data

Author: May M.
Potamias G.
Rüping S.
Publication venue
Publication date: 01/01/2006
Field of study

Fraunhofer-ePrints

Nearest neighbor retrieval using distance-based hashing

Author: Athitsos V.
Kollios G.
Papapetrou Panagiotis
Potamias M.
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2008
Field of study

A method is proposed for indexing spaces with arbitrary distance measures, so as to achieve efficient approximate nearest neighbor retrieval. Hashing methods, such as Locality Sensitive Hashing (LSH), have been successfully applied for similarity indexing in vector spaces and string spaces under the Hamming distance. The key novelty of the hashing technique proposed here is that it can be applied to spaces with arbitrary distance measures, including non-metric distance measures. First, we describe a domain-independent method for constructing a family of binary hash functions. Then, we use these functions to construct multiple multibit hash tables. We show that the LSH formalism is not applicable for analyzing the behavior of these tables as index structures. We present a novel formulation, that uses statistical observations from sample data to analyze retrieval accuracy and efficiency for the proposed indexing method. Experiments on several real-world data sets demonstrate that our method produces good trade-offs between accuracy and efficiency, and significantly outperforms VP-trees, which are a well-known method for distance-based indexing

CiteSeerX

Crossref

Birkbeck Institutional Research Online

Mining Interesting Clinico-Genomic Associations: The HealthObs Approach

Author: A. Amir
A. Analyti
D. Kanterakis
F. Cardoso
G. Potamias
G. Potamias
G. Potamias
H.P. Eich
J. Grimson
L.J. Veer van t
M. May
M. Tsiknakis
M. Tsiknakis
O.M. San
R.J. Bayardo Jr.
S. Gupta
S.K. Gruvberger
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2007
Field of study

Crossref

Modeling susceptibility to periodontitis

Author: Koumakis L.
Laine M.L.
Loos B.G.
Moustakis V.
Potamias G.
Publication venue: 'SAGE Publications'
Publication date: 01/01/2013
Field of study

Chronic inflammatory diseases like periodontitis have a complex pathogenesis and a multifactorial etiology, involving complex interactions between multiple genetic loci and infectious agents. We aimed to investigate the influence of genetic polymorphisms and bacteria on chronic periodontitis risk. We determined the prevalence of 12 single-nucleotide polymorphisms (SNPs) in immune response candidate genes and 7 bacterial species of potential relevance to periodontitis etiology, in chronic periodontitis patients and non-periodontitis control individuals (N = 385). Using decision tree analysis, we identified the presence of bacterial species Tannerella forsythia, Porphyromonas gingivalis, Aggregatibacter actinomycetemcomitans, and SNPs TNF -857 and IL-1A -889 as discriminators between periodontitis and non-periodontitis. The model reached an accuracy of 80%, sensitivity of 85%, specificity of 73%, and AUC of 73%. This pilot study shows that, on the basis of 3 periodontal pathogens and SNPs, patterns may be recognized to identify patients at risk for periodontitis. Modern bioinformatics tools are valuable in modeling the multifactorial and complex nature of periodontitis

VU Research Portal

International Migration, Integration and Social Cohesion online publications

Embedding-based subsequence matching in time-series databases

Author: Papapetrou P. Athitsos, V. Potamias, M. Kollios, G. Gunopulos, D.
Publication venue
Publication date: 01/01/2011
Field of study

We propose an embedding-based framework for subsequence matching in time-series databases that improves the efficiency of processing subsequence matching queries under the Dynamic Time Warping (DTW) distance measure. This framework partially reduces subsequence matching to vector matching, using an embedding that maps each query sequence to a vector and each database time series into a sequence of vectors. The database embedding is computed offline, as a preprocessing step. At runtime, given a query object, an embedding of that object is computed online. Relatively few areas of interest are efficiently identified in the database sequences by comparing the embedding of the query with the database vectors. Those areas of interest are then fully explored using the exact DTW-based subsequence matching algorithm. We apply the proposed framework to define two specific methods. The first method focuses on time-series subsequence matching under unconstrained Dynamic Time Warping. The second method targets subsequence matching under constrained Dynamic Time Warping (cDTW), where warping paths are not allowed to stray too much off the diagonal. In our experiments, good trade-offs between retrieval accuracy and retrieval efficiency are obtained for both methods, and the results are competitive with respect to current state-of-the-art methods. © 2011 ACM

Pergamos : Unified Institutional Repository / Digital Library Platform of the National and Kapodistrian University of Athens

Modeling Susceptibility to Periodontitis

Author: Koumakis L.
Laine M.L.
Loos B.G.
Moustakis V.
Potamias G.
Publication venue: 'SAGE Publications'
Publication date: 01/01/2013
Field of study

Crossref

VU Research Portal

International Migration, Integration and Social Cohesion online publications