450 research outputs found

    Cross-Document Pattern Matching

    Get PDF
    We study a new variant of the string matching problem called cross-document string matching, which is the problem of indexing a collection of documents to support an efficient search for a pattern in a selected document, where the pattern itself is a substring of another document. Several variants of this problem are considered, and efficient linear-space solutions are proposed with query time bounds that either do not depend at all on the pattern size or depend on it in a very limited way (doubly logarithmic). As a side result, we propose an improved solution to the weighted level ancestor problem

    Low Space External Memory Construction of the Succinct Permuted Longest Common Prefix Array

    Full text link
    The longest common prefix (LCP) array is a versatile auxiliary data structure in indexed string matching. It can be used to speed up searching using the suffix array (SA) and provides an implicit representation of the topology of an underlying suffix tree. The LCP array of a string of length nn can be represented as an array of length nn words, or, in the presence of the SA, as a bit vector of 2n2n bits plus asymptotically negligible support data structures. External memory construction algorithms for the LCP array have been proposed, but those proposed so far have a space requirement of O(n)O(n) words (i.e. O(nlogn)O(n \log n) bits) in external memory. This space requirement is in some practical cases prohibitively expensive. We present an external memory algorithm for constructing the 2n2n bit version of the LCP array which uses O(nlogσ)O(n \log \sigma) bits of additional space in external memory when given a (compressed) BWT with alphabet size σ\sigma and a sampled inverse suffix array at sampling rate O(logn)O(\log n). This is often a significant space gain in practice where σ\sigma is usually much smaller than nn or even constant. We also consider the case of computing succinct LCP arrays for circular strings

    Document Retrieval on Repetitive Collections

    Full text link
    Document retrieval aims at finding the most important documents where a pattern appears in a collection of strings. Traditional pattern-matching techniques yield brute-force document retrieval solutions, which has motivated the research on tailored indexes that offer near-optimal performance. However, an experimental study establishing which alternatives are actually better than brute force, and which perform best depending on the collection characteristics, has not been carried out. In this paper we address this shortcoming by exploring the relationship between the nature of the underlying collection and the performance of current methods. Via extensive experiments we show that established solutions are often beaten in practice by brute-force alternatives. We also design new methods that offer superior time/space trade-offs, particularly on repetitive collections.Comment: Accepted to ESA 2014. Implementation and experiments at http://www.cs.helsinki.fi/group/suds/rlcsa

    Searching for Radio Pulsars in 3EG Sources at Urumqi Observatory

    Full text link
    Since mid-2005, a pulsar searching system has been operating at 18 cm on the 25-m radio telescope of Urumqi Observatory. Test observations on known pulsars show that the system can perform the intended task. The prospect of using this system to observe 3EG sources and other target searching tasks is discussed.Comment: a training project about MSc thesi

    The spectroscopic evolution of the recurrent nova T Pyxidis during its 2011 outburst I. The optically thick phase and the origin of moving lines in novae

    Full text link
    The nova T Pyx was observed with high resolution spectroscopy (R ~ 65000) spectroscopy, beginning 1 day after discovery of the outburst and continuing through the last visibility of the star at the end of May 2011. The interstellar absorption lines of Na I, Ca II, CH, CH+^+, and archival H I 21 cm emission line observations have been used to determine a kinematic distance. Interstellar diffuse absorption features have been used to determine the extinction independent of previous assumptions. Sample Fe-peak line profiles show the optical depth and radial velocity evolution of the discrete components. We propose a distance to T Pyx \geq4.5kpc, with a strict lower limit of 3.5 kpc (the previously accepted distance). We derive an extinction, E(B-V)0.5±\approx0.5\pm0.1, that is higher than previous estimates. The first observation, Apr. 15, displayed He I, He II, C III, and N III emission lines and a maximum velocity on P Cyg profiles of the Balmer and He I lines of \approx2500 km s1^{-1} characteristic of the fireball stage. These ions were undetectable in the second spectrum, Apr. 23, and we use the recombination time to estimate the mass of the ejecta, 105f10^{-5}fM_\odot for a filling factor ff. Numerous absorption line systems were detected on the Balmer, Fe-peak, Ca II, and Na I lines, mirrored in broader emission line components, that showed an "accelerated" displacement in velocity. We also show that the time sequence of these absorptions, which are common to all lines and arise only in the ejecta, can be described by recombination front moving outward in the expanding gas without either a stellar wind or circumstellar collisions.Comment: Accepted for publication in Astronomy & Astrophysics Letters (17/8/11

    Five-Year Optical and Near Infrared Observations of the Extremely Slow Nova V1280 Scorpii

    Full text link
    We present optical (BB, VV, RcR_{\rm c}, IcI_{\rm c} and yy) and near infrared (JJ, HH and KsK_{\rm s}) photometric and spectroscopic observations of a classical nova V1280 Scorpii for five years from 2007 to 2011. Our photometric observations show a declining event in optical bands shortly after the maximum light which continues \sim 250 days. The event is most probably caused by a dust formation. The event is accompanied by a short (\sim 30 days) re-brightening episode (\sim 2.5 mag in VV), which suggests a re-ignition of the surface nuclear burning. After 2008, the yy band observations show a very long plateau at around yy = 10.5 for more than 1000 days until April 2011 (\sim 1500 days after the maximum light). The nova had taken a very long time (\sim 50 months) before entering the nebular phase (clear detection of both [\ion{O}{iii}] 4959 and 5007) and is still continuing to generate the wind caused by H-burning. The finding suggests that V1280 Sco is going through the historically slowest evolution. The interval from the maximum light (2007 February 16) to the beginning of the nebular phase is longer than any previously known slow novae: V723 Cas (18 months), RR Pic (10 months), or HR Del (8 months). It suggests that the mass of a white dwarf in the V1280 Sco system might be 0.6 M_\mathrm{\sun} or smaller. The distance, based on our measurements of the expansion velocity combined with the directly measured size of the dust shell, is estimated to be 1.1 ±\pm 0.5 kpc.Comment: 17 pages, 14 figures, accepted for publication in A&

    Wave Energy: a Pacific Perspective

    Get PDF
    This is the author's peer-reviewed final manuscript, as accepted by the publisher. The published article is copyrighted by The Royal Society and can be found at: http://rsta.royalsocietypublishing.org/.This paper illustrates the status of wave energy development in Pacific Rim countries by characterizing the available resource and introducing the region‟s current and potential future leaders in wave energy converter development. It also describes the existing licensing and permitting process as well as potential environmental concerns. Capabilities of Pacific Ocean testing facilities are described in addition to the region‟s vision of the future of wave energy

    Implementation of the Backlund transformations for the Ablowitz-Ladik hierarchy

    Full text link
    The derivation of the Backlund transformations (BTs) is a standard problem of the theory of the integrable systems. Here, I discuss the equations describing the BTs for the Ablowitz-Ladik hierarchy (ALH), which have been already obtained by several authors. The main aim of this work is to solve these equations. This can be done in the framework of the so-called functional representation of the ALH, when an infinite number of the evolutionary equations are replaced, using the Miwa's shifts, with a few equations linking tau-functions with different arguments. It is shown that starting from these equations it is possible to obtain explicit solutions of the BT equations. In other words, the main result of this work is a presentation of the discrete BTs as a superposition of an infinite number of evolutionary flows of the hierarchy. These results are used to derive the superposition formulae for the BTs as well as pure soliton solutions.Comment: 20 page
    corecore