Search CORE

46,123 research outputs found

Pattern based processing of XPath queries

Author: Marks Gerard
Roantree Mark
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/2008
Field of study

As the popularity of areas including document storage and distributed systems continues to grow, the demand for high performance XML databases is increasingly evident. This has led to a number of research eorts aimed at exploiting the maturity of relational database systems in order to in- crease XML query performance. In our approach, we use an index structure based on a metamodel for XML databases combined with relational database technology to facilitate fast access to XML document elements. The query process involves transforming XPath expressions to SQL which can be executed over our optimised query engine. As there are many dierent types of XPath queries, varying processing logic may be applied to boost performance not only to indi- vidual XPath axes, but across multiple axes simultaneously. This paper describes a pattern based approach to XPath query processing, which permits the execution of a group of XPath location steps in parallel

CiteSeerX

Crossref

DCU Online Research Access Service

The Effelsberg-Bonn HI Survey: Data reduction

Author: Adelman-McCarthy
B. Winkel
Bradley
Briggs
Flöer
Galassi
Giovanelli
Goldsmith
Heiles
Heiles
J. Kerp
Kalberla
Kalberla
Kalberla
Keller
Klein
L. Flöer
McClure-Griffiths
McClure-Griffiths
P. M. W. Kalberla
Peek
Taylor
Winkel
Publication venue: 'IOP Publishing'
Publication date: 25/05/2010
Field of study

Starting in winter 2008/2009 an L-band 7-Feed-Array receiver is used for a 21-cm line survey performed with the 100-m telescope, the Effelsberg-Bonn HI survey (EBHIS). The EBHIS will cover the whole northern hemisphere for decl.>-5 deg comprising both the galactic and extragalactic sky out to a distance of about 230 Mpc. Using state-of-the-art FPGA-based digital fast Fourier transform spectrometers, superior in dynamic range and temporal resolution to conventional correlators, allows us to apply sophisticated radio frequency interference (RFI) mitigation schemes. In this paper, the EBHIS data reduction package and first results are presented. The reduction software consists of RFI detection schemes, flux and gain-curve calibration, stray-radiation removal, baseline fitting, and finally the gridding to produce data cubes. The whole software chain is successfully tested using multi-feed data toward many smaller test fields (1--100 square degrees) and recently applied for the first time to data of two large sky areas, each covering about 2000 square degrees. The first large area is toward the northern galactic pole and the second one toward the northern tip of the Magellanic Leading Arm. Here, we demonstrate the data quality of EBHIS Milky Way data and give a first impression on the first data release in 2011.Comment: 17 pages, 14 figures; to be published in ApJ

arXiv.org e-Print Archive

Crossref

MatriVasha: A Multipurpose Comprehensive Database for Bangla Handwritten Compound Characters

Author: Ferdous Jannatul
Hossain Syed Akhter
Karmaker Suvrajit
Rabby A K M Shahariar Azad
Publication venue
Publication date: 06/05/2020
Field of study

At present, recognition of the Bangla handwriting compound character has been an essential issue for many years. In recent years there have been application-based researches in machine learning, and deep learning, which is gained interest, and most notably is handwriting recognition because it has a tremendous application such as Bangla OCR. MatrriVasha, the project which can recognize Bangla, handwritten several compound characters. Currently, compound character recognition is an important topic due to its variant application, and helps to create old forms, and information digitization with reliability. But unfortunately, there is a lack of a comprehensive dataset that can categorize all types of Bangla compound characters. MatrriVasha is an attempt to align compound character, and it's challenging because each person has a unique style of writing shapes. After all, MatrriVasha has proposed a dataset that intends to recognize Bangla 120(one hundred twenty) compound characters that consist of 2552(two thousand five hundred fifty-two) isolated handwritten characters written unique writers which were collected from within Bangladesh. This dataset faced problems in terms of the district, age, and gender-based written related research because the samples were collected that includes a verity of the district, age group, and the equal number of males, and females. As of now, our proposed dataset is so far the most extensive dataset for Bangla compound characters. It is intended to frame the acknowledgment technique for handwritten Bangla compound character. In the future, this dataset will be made publicly available to help to widen the research.Comment: 19 fig, 2 tabl

arXiv.org e-Print Archive

Don't Thrash: How to Cache Your Hash on Flash

Author: Bender Michael A.
Farach-Colton Martin
Johnson Rob
Kraner Russell
Kuszmaul Bradley C.
Medjedovic Dzejla
Montes Pablo
Shetty Pradeep
Spillane Richard P.
Zadok Erez
Publication venue
Publication date: 01/01/2012
Field of study

This paper presents new alternatives to the well-known Bloom filter data structure. The Bloom filter, a compact data structure supporting set insertion and membership queries, has found wide application in databases, storage systems, and networks. Because the Bloom filter performs frequent random reads and writes, it is used almost exclusively in RAM, limiting the size of the sets it can represent. This paper first describes the quotient filter, which supports the basic operations of the Bloom filter, achieving roughly comparable performance in terms of space and time, but with better data locality. Operations on the quotient filter require only a small number of contiguous accesses. The quotient filter has other advantages over the Bloom filter: it supports deletions, it can be dynamically resized, and two quotient filters can be efficiently merged. The paper then gives two data structures, the buffered quotient filter and the cascade filter, which exploit the quotient filter advantages and thus serve as SSD-optimized alternatives to the Bloom filter. The cascade filter has better asymptotic I/O performance than the buffered quotient filter, but the buffered quotient filter outperforms the cascade filter on small to medium data sets. Both data structures significantly outperform recently-proposed SSD-optimized Bloom filter variants, such as the elevator Bloom filter, buffered Bloom filter, and forest-structured Bloom filter. In experiments, the cascade filter and buffered quotient filter performed insertions 8.6-11 times faster than the fastest Bloom filter variant and performed lookups 0.94-2.56 times faster.Comment: VLDB201

arXiv.org e-Print Archive

CiteSeerX