Search CORE

146 research outputs found

Multi-Step Processing of Spatial Joins

Author: Bernhard Seeger
Brinkhoff T.
Dori D.
Gtinther O.
Hans-Peter Kriegel
Kriegel H.-P.
Ralf Schneider
Samet H.
Schneider R.
Sellis T.
Thomas Brinkhoff
Welzl E.
Publication venue
Publication date: 01/01/1994
Field of study

Spatial joins are one of the most important operations for combining spatial objects of several relations. In this paper, spatial join processing is studied in detail for extended spatial objects in twodimensional data space. We present an approach for spatial join processing that is based on three steps. First, a spatial join is performed on the minimum bounding rectangles of the objects returning a set of candidates. Various approaches for accelerating this step of join processing have been examined at the last year’s conference [BKS 93a]. In this paper, we focus on the problem how to compute the answers from the set of candidates which is handled by the following two steps. First of all, sophisticated approximations are used to identify answers as well as to filter out false hits from the set of candidates. For this purpose, we investigate various types of conservative and progressive approximations. In the last step, the exact geometry of the remaining candidates has to be tested against the join predicate. The time required for computing spatial join predicates can essentially be reduced when objects are adequately organized in main memory. In our approach, objects are first decomposed into simple components which are exclusively organized by a main-memory resident spatial data structure. Overall, we present a complete approach of spatial join processing on complex spatial objects. The performance of the individual steps of our approach is evaluated with data sets from real cartographic applications. The results show that our approach reduces the total execution time of the spatial join by factors

CiteSeerX

Crossref

Open Access LMU ( Ludwig-Maximilians-Univ. München)

Detecting Redundancy in Data Warehouse Evolution

Author: A. Gupta
D. Theodoratos
D. Theodoratos
D. Theodoratos
T. K. Sellis
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Информационная система сопровождения деятельности муниципального бюджетного учреждения "Центр технического контроля и обслуживания учреждений Управления Образования Администрации города Юрги"

Author: C. Brochhaus
C. Faloutsos
D. Yang
H. Ding
H. Sakoe
I. Assent
I. Assent
J. Aach
S. Babu
S. Salvador
T.K. Sellis
W.H. Tok
Publication venue
Publication date: 01/01/2011
Field of study

The analysis of most widespread modern software products and the choice of programming environments. In the capacity of the object of automation is considered: process control and maintenance institutions of education management Yurga city Administration

Electronic archive of Tomsk Polytechnic University

Crossref

Towards a better solution to the shortest common supersequence problem: the deposition and reduction algorithm

Author: D Gusfield
D Sankoff
DE Foulser
EA Hubbell
G Nicosia
Hon Wai Leong
J Branke
JA Storer
K Ning
Kang Ning
P Barone
R Michels
RW Irving
S Kasif
T Jiang
TH Cormen
TK Sellis
VG Timkovsky
Publication venue: BioMed Central
Publication date: 01/01/2006
Field of study

BACKGROUND: The problem of finding a Shortest Common Supersequence (SCS) of a set of sequences is an important problem with applications in many areas. It is a key problem in biological sequences analysis. The SCS problem is well-known to be NP-complete. Many heuristic algorithms have been proposed. Some heuristics work well on a few long sequences (as in sequence comparison applications); others work well on many short sequences (as in oligo-array synthesis). Unfortunately, most do not work well on large SCS instances where there are many, long sequences. RESULTS: In this paper, we present a Deposition and Reduction (DR) algorithm for solving large SCS instances of biological sequences. There are two processes in our DR algorithm: deposition process, and reduction process. The deposition process is responsible for generating a small set of common supersequences; and the reduction process shortens these common supersequences by removing some characters while preserving the common supersequence property. Our evaluation on simulated data and real DNA and protein sequences show that our algorithm consistently produces the best results compared to many well-known heuristic algorithms, and especially on large instances. CONCLUSION: Our DR algorithm provides a partial answer to the open problem of designing efficient heuristic algorithm for SCS problem on many long sequences. Our algorithm has a bounded approximation ratio. The algorithm is efficient, both in running time and space complexity and our evaluation shows that it is practical even for SCS problems on many long sequences

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

ScholarBank@NUS

Qingdao Institute of Bioenergy and Bioprocess Technology, Chinese Academy of Sciences

Approximate spatio-temporal retrieval

Author: ARGE L.
BACCHUS F.
BACCHUS F.
BERCHTOLD S.
BRINKHOFF T.
BRUNS T.
CHANG S.
Dimitris Papadias
EGENHOFER M.J.
GUTTMAN A.
HARALICK R.M.
HUANG Y. -W.
KOUDAS N.
LEE S.
Nikos Mamoulis
PAPADIAS D.
PAPADIAS D.
PAPADIAS D.
PAPADIAS D.
PARK H.
PREPARATA F.P.
ROTEM D.
ROUSSOPOULOS N.
SEIDL T.
SELLIS T.
Vasilis Delis
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date
Field of study

Crossref

SOWL QL: Querying Spatio - Temporal Ontologies in OWL

Author: A Artale
A Krokhin
AG Cohn
B Nebel
C Daskalakis
C Gutierrez
D Montello
E Sirin
I Budak Arpinar
J Allen
J Prez
J Renz
M Bodirsky
M Yannakakis
MJ Egenhofer
P Beek van
P Jonsson
R Guting
S Bykau
S Skiadopoulos
T Sellis
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 17/05/2016
Field of study

We introduce SOWL QL, a query language for spatio-temporal information in ontologies. Buildingupon SOWL (Spatio-Temporal OWL), an ontology for handling spatio-temporal information in OWL, SOWL QL supports querying over qualitative spatio-temporal information (expressed using natural language expressions such as “before”, “after”, “north of”, “south of”) rather than merely quantitative information (exact dates, times, locations). SOWL QL extends SPARQL with a powerful set of temporal and spatial operators, including temporal Allen topological, spatial directional and topological operations or combinations of the above. SOWL QL maintains simplicity of expression and also, upward and downward compatibility with SPARQL. Query translation in SOWL QL yields SPARQL queries implying that, querying spatio-temporal ontologies using SPARQL is still feasible but suffers from several drawbacks the most important of them being that, queries in SPARQL become particularly complicated and users must be familiar with the underlying spatio-temporal representation (the “N-ary relations” or the “4D-fluents” approach in this work). Finally, querying in SOWL QL is supported by the SOWL reasoner which is not part of the standard SPARQL translation. The run-time performance of SOWL QL has been assessed experimentally in a real data setting. A critical analysis of its performance is also presented

Crossref

University of Huddersfield Repository

Institutional Repository of the Technical University of Crete

Huddersfield Research Portal

Accurate microRNA target prediction correlates with protein repression levels

Author: A Grimson
Artemis G Hatzigeorgiou
BP Lewis
BP Lewis
D Baek
D Gaidatzis
D Karolchik
D Long
DP Bartel
Evangelos Koukis
George Giannopoulos
George Goumas
Giorgio L Papadopoulos
J Brennecke
J Liu
Kornilios Kourtis
LP Lim
M Kertesz
M Kiriakidou
M Lagos-Quintana
M Rehmsmeier
M Selbach
Manolis Maragkakis
Martin Reczko
NC Lau
Nectarios Koziris
P Sethupathy
P Sood
Panagiotis Alexiou
Panagiotis Tsanakas
Praveen Sethupathy
RC Lee
RC Lee
S Lall
Thanasis Vergoulis
Theodore Dalamagas
Timos Sellis
Victor A Simossis
Publication venue: BioMed Central
Publication date: 01/01/2009
Field of study

MicroRNAs are small endogenously expressed non-coding RNA molecules that regulate target gene expression through translation repression or messenger RNA degradation. MicroRNA regulation is performed through pairing of the microRNA to sites in the messenger RNA of protein coding genes. Since experimental identification of miRNA target genes poses difficulties, computational microRNA target prediction is one of the key means in deciphering the role of microRNAs in development and diseas

OAR@UM

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

RMIT Research Repository

DSpace@NTUA (National Technical Univ. of Athens)

A Genome-Wide Analysis of FRT-Like Sequences in the Human Genome

Author: AA Mills
AW Rufer
CI Rodriguez
D Sellis
D Wirth
ES Lander
Eugenia Voziyanova
F Buchholz
H Skaletsky
I Sarkar
J Qiao
J Sambrook
Jay H. Konieczka
Jeffry L. Shultz
JF Senecoff
JH Konieczka
KR Kalari
N Guex
NJ Kilby
Robert Oshima
S Bolusani
S Glaser
SR Eddy
SW Santoro
SW Umlauf
T Yada
Y Chen
Y Tay
Y Voziyanov
Y Voziyanov
Yuri Voziyanov
Publication venue: Public Library of Science
Publication date: 01/03/2011
Field of study

Efficient and precise genome manipulations can be achieved by the Flp/FRT system of site-specific DNA recombination. Applications of this system are limited, however, to cases when target sites for Flp recombinase, FRT sites, are pre-introduced into a genome locale of interest. To expand use of the Flp/FRT system in genome engineering, variants of Flp recombinase can be evolved to recognize pre-existing genomic sequences that resemble FRT and thus can serve as recombination sites. To understand the distribution and sequence properties of genomic FRT-like sites, we performed a genome-wide analysis of FRT-like sites in the human genome using the experimentally-derived parameters. Out of 642,151 identified FRT-like sequences, 581,157 sequences were unique and 12,452 sequences had at least one exact duplicate. Duplicated FRT-like sequences are located mostly within LINE1, but also within LTRs of endogenous retroviruses, Alu repeats and other repetitive DNA sequences. The unique FRT-like sequences were classified based on the number of matches to FRT within the first four proximal bases pairs of the Flp binding elements of FRT and the nature of mismatched base pairs in the same region. The data obtained will be useful for the emerging field of genome engineering

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Query Processing in Spatial Databases Containing Obstacles

Author: Becker B.
Brinkhoff T.
Dimitris Papadias
Estivill‐Castro V.
Ghosh S.
Guttman A.
Jun Zhang
Kung R.
Kyriakos Mouratidis
Papadias D.
Pocchiola M.
Rivière S.
Sellis T.
Sharir M.
Zhu Manli
Publication venue: 'Informa UK Limited'
Publication date: 01/01/2005
Field of study

Despite the existence of obstacles in many database applications, traditional spatial query processing assumes that points in space are directly reachable and utilizes the Euclidean distance metric. In this paper, we study spatial queries in the presence of obstacles, where the obstructed distance between two points is defined as the length of the shortest path that connects them without crossing any obstacles. We propose efficient algorithms for the most important query types, namely, range search, nearest neighbours, e-distance joins, closest pairs and distance semi-joins, assuming that both data objects and obstacles are indexed by R-trees. The effectiveness of the proposed solutions is verified through extensive experiments

CiteSeerX

Crossref

Institutional Knowledge at Singapore Management University

Alu distribution and mutation types of cancer genes

Author: A Tsirigos
A Viel
Andrea Edwards
B Dennis
D Grover
D Sellis
D Srikanta
DW Huang
E Teugels
EM Kvikstad
G Abrusan
G Casella
G Franke
GE Novick
H van der Klift
J Jurka
J Jurka
J Jurka
J O'Neil
J Wang
JE Stenger
JR Korenberg
K Debacker
K Zhang
Kun Zhang
L Lin
MA Batzer
MP Strout
MW McCoy
N Gilbert
N Sela
N. J. D. NAGELKERKE
P Medstrand
PA Callinan
PA Futreal
PL Deininger
Prescott Deininger
RJ Hu
S Myers
S Schwartz
S Volinia
S Yang
SJ Baker
SK Sen
SK Sen
SY Hsieh
TA Morrish
TJ Gu
V Ricci
VP Belancio
Wei Fan
Wensheng Zhang
Y Benjamini
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Background: Alu elements are the most abundant retrotransposable elements comprising ~11% of the human genome. Many studies have highlighted the role that Alu elements have in genetic instability and how their contribution to the assortment of mutagenic events can lead to cancer. As of yet, little has been done to quantitatively assess the association between Alu distribution and genes that are causally implicated in oncogenesis.Results: We have investigated the effect of various Alu densities on the mutation type based classifications of cancer genes. In order to establish the direct relationship between Alus and the cancer genes of interest, genome wide Alu-related densities were measured using genes rather than the sliding windows of fixed length as the units. Several novel genomic features, such as the density of the adjacent Alu pairs and the number of Alu-Exon-Alu triplets, were developed in order to extend the investigation via the multivariate statistical analysis toward more advanced biological insight. In addition, we characterized the genome-wide intron Alu distribution with a mixture model that distinguished genes containing Alu elements from those with no Alus, and evaluated the gene-level effect of the 5\u27-TTAAAA motif associated with Alu insertion sites using a two-step regression analysis method.Conclusions: The study resulted in several novel findings worthy of further investigation. They include: (1) Recessive cancer genes (tumor suppressor genes) are enriched with Alu elements (p \u3c 0.01) compared to dominant cancer genes (oncogenes) and the entire set of genes in the human genome; (2) Alu-related genomic features can be used to cluster cancer genes into biological meaningful groups; (3) The retention of exon Alus has been restricted in the human genome development, and an upper limit to the chromosome-level exon Alu densities is suggested by the distribution profile; (4) For the genes with at least one intron Alu repeat in individual chromosomes, the intron Alu densities can be well fitted by a Gamma distribution; (5) The effect of the 5\u27-TTAAAA motif on Alu densities varies across different chromosomes

Crossref

Springer - Publisher Connector

PubMed Central

Xavier University of Louisiana: XULA Digital Commons