450 research outputs found
Cross-Document Pattern Matching
We study a new variant of the string matching problem called cross-document
string matching, which is the problem of indexing a collection of documents to
support an efficient search for a pattern in a selected document, where the
pattern itself is a substring of another document. Several variants of this
problem are considered, and efficient linear-space solutions are proposed with
query time bounds that either do not depend at all on the pattern size or
depend on it in a very limited way (doubly logarithmic). As a side result, we
propose an improved solution to the weighted level ancestor problem
Low Space External Memory Construction of the Succinct Permuted Longest Common Prefix Array
The longest common prefix (LCP) array is a versatile auxiliary data structure
in indexed string matching. It can be used to speed up searching using the
suffix array (SA) and provides an implicit representation of the topology of an
underlying suffix tree. The LCP array of a string of length can be
represented as an array of length words, or, in the presence of the SA, as
a bit vector of bits plus asymptotically negligible support data
structures. External memory construction algorithms for the LCP array have been
proposed, but those proposed so far have a space requirement of words
(i.e. bits) in external memory. This space requirement is in some
practical cases prohibitively expensive. We present an external memory
algorithm for constructing the bit version of the LCP array which uses
bits of additional space in external memory when given a
(compressed) BWT with alphabet size and a sampled inverse suffix array
at sampling rate . This is often a significant space gain in
practice where is usually much smaller than or even constant. We
also consider the case of computing succinct LCP arrays for circular strings
Document Retrieval on Repetitive Collections
Document retrieval aims at finding the most important documents where a
pattern appears in a collection of strings. Traditional pattern-matching
techniques yield brute-force document retrieval solutions, which has motivated
the research on tailored indexes that offer near-optimal performance. However,
an experimental study establishing which alternatives are actually better than
brute force, and which perform best depending on the collection
characteristics, has not been carried out. In this paper we address this
shortcoming by exploring the relationship between the nature of the underlying
collection and the performance of current methods. Via extensive experiments we
show that established solutions are often beaten in practice by brute-force
alternatives. We also design new methods that offer superior time/space
trade-offs, particularly on repetitive collections.Comment: Accepted to ESA 2014. Implementation and experiments at
http://www.cs.helsinki.fi/group/suds/rlcsa
Searching for Radio Pulsars in 3EG Sources at Urumqi Observatory
Since mid-2005, a pulsar searching system has been operating at 18 cm on the
25-m radio telescope of Urumqi Observatory. Test observations on known pulsars
show that the system can perform the intended task. The prospect of using this
system to observe 3EG sources and other target searching tasks is discussed.Comment: a training project about MSc thesi
The spectroscopic evolution of the recurrent nova T Pyxidis during its 2011 outburst I. The optically thick phase and the origin of moving lines in novae
The nova T Pyx was observed with high resolution spectroscopy (R ~ 65000)
spectroscopy, beginning 1 day after discovery of the outburst and continuing
through the last visibility of the star at the end of May 2011. The
interstellar absorption lines of Na I, Ca II, CH, CH, and archival H I 21
cm emission line observations have been used to determine a kinematic distance.
Interstellar diffuse absorption features have been used to determine the
extinction independent of previous assumptions. Sample Fe-peak line profiles
show the optical depth and radial velocity evolution of the discrete
components. We propose a distance to T Pyx 4.5kpc, with a strict lower
limit of 3.5 kpc (the previously accepted distance). We derive an extinction,
E(B-V)0.1, that is higher than previous estimates. The first
observation, Apr. 15, displayed He I, He II, C III, and N III emission lines
and a maximum velocity on P Cyg profiles of the Balmer and He I lines of
2500 km s characteristic of the fireball stage. These ions were
undetectable in the second spectrum, Apr. 23, and we use the recombination time
to estimate the mass of the ejecta, M for a filling factor
. Numerous absorption line systems were detected on the Balmer, Fe-peak, Ca
II, and Na I lines, mirrored in broader emission line components, that showed
an "accelerated" displacement in velocity. We also show that the time sequence
of these absorptions, which are common to all lines and arise only in the
ejecta, can be described by recombination front moving outward in the expanding
gas without either a stellar wind or circumstellar collisions.Comment: Accepted for publication in Astronomy & Astrophysics Letters
(17/8/11
Five-Year Optical and Near Infrared Observations of the Extremely Slow Nova V1280 Scorpii
We present optical (, , , and ) and near
infrared (, and ) photometric and spectroscopic observations
of a classical nova V1280 Scorpii for five years from 2007 to 2011. Our
photometric observations show a declining event in optical bands shortly after
the maximum light which continues 250 days. The event is most probably
caused by a dust formation. The event is accompanied by a short ( 30
days) re-brightening episode ( 2.5 mag in ), which suggests a
re-ignition of the surface nuclear burning. After 2008, the band
observations show a very long plateau at around = 10.5 for more than 1000
days until April 2011 ( 1500 days after the maximum light). The nova had
taken a very long time ( 50 months) before entering the nebular phase
(clear detection of both [\ion{O}{iii}] 4959 and 5007) and is still continuing
to generate the wind caused by H-burning. The finding suggests that V1280 Sco
is going through the historically slowest evolution. The interval from the
maximum light (2007 February 16) to the beginning of the nebular phase is
longer than any previously known slow novae: V723 Cas (18 months), RR Pic (10
months), or HR Del (8 months). It suggests that the mass of a white dwarf in
the V1280 Sco system might be 0.6 M_\mathrm{\sun} or smaller. The distance,
based on our measurements of the expansion velocity combined with the directly
measured size of the dust shell, is estimated to be 1.1 0.5 kpc.Comment: 17 pages, 14 figures, accepted for publication in A&
Wave Energy: a Pacific Perspective
This is the author's peer-reviewed final manuscript, as accepted by the publisher. The published article is copyrighted by The Royal Society and can be found at: http://rsta.royalsocietypublishing.org/.This paper illustrates the status of wave energy development in Pacific Rim countries by characterizing the available resource and introducing the region‟s current and potential future leaders in wave energy converter development. It also describes the existing licensing and permitting process as well as potential environmental concerns. Capabilities of Pacific Ocean testing facilities are described in addition to the region‟s vision of the future of wave energy
Implementation of the Backlund transformations for the Ablowitz-Ladik hierarchy
The derivation of the Backlund transformations (BTs) is a standard problem of
the theory of the integrable systems. Here, I discuss the equations describing
the BTs for the Ablowitz-Ladik hierarchy (ALH), which have been already
obtained by several authors. The main aim of this work is to solve these
equations. This can be done in the framework of the so-called functional
representation of the ALH, when an infinite number of the evolutionary
equations are replaced, using the Miwa's shifts, with a few equations linking
tau-functions with different arguments. It is shown that starting from these
equations it is possible to obtain explicit solutions of the BT equations. In
other words, the main result of this work is a presentation of the discrete BTs
as a superposition of an infinite number of evolutionary flows of the
hierarchy. These results are used to derive the superposition formulae for the
BTs as well as pure soliton solutions.Comment: 20 page
- …