Search CORE

74 research outputs found

Information Discovery on Electronic Health Records Using Authority Flow Techniques

Author: A Balmin
A Singhal
A Singhal
AK Sehgal
CJ McDonald
DL Shepelyansky
F Farfán
H Hwang
J Savoy
JF Fontaine
L Guo
M Brinkmeier
MG Weiner
MI Lieberman
Michael Weiner
Paul Biondich
R Moskovitch
R Motwani
R Varadarajan
Ramakrishna R Varadarajan
RM Podowski
S Agrawal
S Brin
SE Robertson
SE Robertson
T Haveliwala
T Matsunaga
V Hristidis
V Hristidis
Vagelis Hristidis
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Abstract Background As the use of electronic health records (EHRs) becomes more widespread, so does the need to search and provide effective information discovery within them. Querying by keyword has emerged as one of the most effective paradigms for searching. Most work in this area is based on traditional Information Retrieval (IR) techniques, where each document is compared individually against the query. We compare the effectiveness of two fundamentally different techniques for keyword search of EHRs. Methods We built two ranking systems. The traditional BM25 system exploits the EHRs' content without regard to association among entities within. The Clinical ObjectRank (CO) system exploits the entities' associations in EHRs using an authority-flow algorithm to discover the most relevant entities. BM25 and CO were deployed on an EHR dataset of the cardiovascular division of Miami Children's Hospital. Using sequences of keywords as queries, sensitivity and specificity were measured by two physicians for a set of 11 queries related to congenital cardiac disease. Results Our pilot evaluation showed that CO outperforms BM25 in terms of sensitivity (65% vs. 38%) by 71% on average, while maintaining the specificity (64% vs. 61%). The evaluation was done by two physicians. Conclusions Authority-flow techniques can greatly improve the detection of relevant information in EHRs and hence deserve further study.</p

Crossref

IUPUIScholarWorks

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

DigitalCommons@Florida International University

Keyword search on external memory data graphs

Author: Balmin Andrey
Bijay Kumar Gaurav
Buchsbaum A. L.
Buchsbaum A. L.
Graupmann J.
Gupta Nitin
Hristidis V.
Hristidis V.
Kacholia Varun
Raghavan Sriram
Publication venue: 'VLDB Endowment'
Publication date
Field of study

Crossref

Processing top-N relational queries by learning

Author: A. Marian
A. Motro
A. Silberschatz
B. L. Bowerman
Chunnian Liu
Dazhong Liu
I. Ilyas
K. Zhao
L. Zhu
Liang Zhu
M. Zhu
N. Bruno
S. Chaudhuri
S.-W. Hwang
S.-W. Hwang
V. Hristidis
W. Fleming
Weiyi Meng
Wenzhu Yang
Y. Chen
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

MeanKS

Author: Hristidis V.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date
Field of study

Crossref

Structured search result differentiation

Author: Hristidis V.
Kacholia V.
Publication venue: 'VLDB Endowment'
Publication date
Field of study

Crossref

Templated Search over Relational Databases

Author: Hristidis V.
Robinson I.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date
Field of study

Crossref

REGAL +

Author: Hristidis V.
Panev K.
Publication venue: 'VLDB Endowment'
Publication date
Field of study

Crossref

Finding patterns in a knowledge base using keywords to compose table answers

Author: Hristidis V.
Kacholia V.
Li F.
Publication venue: 'VLDB Endowment'
Publication date
Field of study

Crossref

Extracting k most important groups from data efficiently

Author: Hristidis V
Mamoulis N
Yiu ML
Publication venue: 'Elsevier BV'
Publication date: 01/01/2008
Field of study

We study an important data analysis operator, which extracts the k most important groups from data (i.e., the k groups with the highest aggregate values). In a data warehousing context, an example of the above query is "find the 10 combinations of product-type and month with the largest sum of sales". The problem is challenging as the potential number of groups can be much larger than the memory capacity. We propose on-demand methods for efficient top-k groups processing, under limited memory size. In particular, we design top-k groups retrieval techniques for three representative scenarios as follows. For the scenario with data physically ordered by measure, we propose the write-optimized multi-pass sorted access algorithm (WMSA), that exploits available memory for efficient top-k groups computation. Regarding the scenario with unordered data, we develop the recursive hash algorithm (RHA), which applies hashing with early aggregation, coupled with branch-and-bound techniques and derivation heuristics for tight score bounds of hash partitions. Next, we design the clustered groups algorithm (CGA), which accelerates top-k groups processing for the case where data is clustered by a subset of group-by attributes. Extensive experiments with real and synthetic datasets demonstrate the applicability and efficiency of the proposed algorithms. © 2008 Elsevier B.V. All rights reserved.link_to_subscribed_fulltex

VBN

HKU Scholars Hub

Discovering queries based on example tuples

Author: Golovin D.
Golovin D.
Hristidis V.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date
Field of study

Crossref