Search CORE

2 research outputs found

An Extensible Light-Weight XML-Based Monitoring System for Sequence Databases

Author: E. Cerami
F. Neven
J. Bleiholder
J. Bleiholder
J.F. Wilson
K. Hokamp
K. Hokamp
M. Altinel
M. Shultz
M.R. Garey
S.F. Altschul
T.J. Green
Y. Diao
Y. Diao
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2006
Field of study

Abstract. Life science researchers want biological information in their interest to become available to them as soon as possible. A monitoring system is a solution that relieves biologists from periodic exploration of databases. In particular, it allows them to express their interest in cer-tain data by means of queries/constraints; they are then notified when new data arrives satisfying these queries/constraints. We describe a se-quence monitoring system XSeqM where users can combine metadata queries on sequence records with constraints on an alignment against a given source sequence. The system is an XML-based solution where con-straints are specified through search fields in a user-friendly web interface and which are then translated to corresponding XPath-expressions. The system is easily extensible as addition of new databases to the system then only amounts to the specification of new mappings from search fields to XPath-expressions. To protect private source sequences obtained in labs, it is imperative that researchers do not have to upload their se-quences to a general untrusted system, but that they can run XSeqM locally. To keep the system light-weight, we therefore introduce an op-timization technique based on query containment to reduce the number of XPath-evaluations which constitutes the bottleneck of the system. We experimentally validate this technique and show that it can drastically improve the running time.

CiteSeerX

Crossref