270 research outputs found
Fast Searching in Packed Strings
Given strings and the (exact) string matching problem is to find all
positions of substrings in matching . The classical Knuth-Morris-Pratt
algorithm [SIAM J. Comput., 1977] solves the string matching problem in linear
time which is optimal if we can only read one character at the time. However,
most strings are stored in a computer in a packed representation with several
characters in a single word, giving us the opportunity to read multiple
characters simultaneously. In this paper we study the worst-case complexity of
string matching on strings given in packed representation. Let be
the lengths and , respectively, and let denote the size of the
alphabet. On a standard unit-cost word-RAM with logarithmic word size we
present an algorithm using time O\left(\frac{n}{\log_\sigma n} + m +
\occ\right). Here \occ is the number of occurrences of in . For this improves the bound of the Knuth-Morris-Pratt algorithm.
Furthermore, if our algorithm is optimal since any
algorithm must spend at least \Omega(\frac{(n+m)\log
\sigma}{\log n} + \occ) = \Omega(\frac{n}{\log_\sigma n} + \occ) time to
read the input and report all occurrences. The result is obtained by a novel
automaton construction based on the Knuth-Morris-Pratt algorithm combined with
a new compact representation of subautomata allowing an optimal
tabulation-based simulation.Comment: To appear in Journal of Discrete Algorithms. Special Issue on CPM
200
Multi-ancestry meta-analysis of host genetic susceptibility to tuberculosis identifies shared genetic architecture
The heritability of susceptibility to tuberculosis (TB) disease has been well recognized. Over 100 genes have been studied as candidates for TB susceptibility, and several variants were identified by genome-wide association studies (GWAS), but few replicate. We established the International Tuberculosis Host Genetics Consortium to perform a multi-ancestry meta-analysis of GWAS, including 14,153 cases and 19,536 controls of African, Asian, and European ancestry. Our analyses demonstrate a substantial degree of heritability (pooled polygenic h2 = 26.3%, 95% CI 23.7–29.0%) for susceptibility to TB that is shared across ancestries, highlighting an important host genetic influence on disease. We identified one global host genetic correlate for TB at genome-wide significance (p<5 × 10-8) in the human leukocyte antigen (HLA)-II region (rs28383206, p-value=5.2 × 10-9) but failed to replicate variants previously associated with TB susceptibility. These data demonstrate the complex shared genetic architecture of susceptibility to TB and the importance of large-scale GWAS analysis across multiple ancestries experiencing different levels of infection pressure
Experimental Study of the Shortest Reset Word of Random Automata
In this paper we describe an approach to finding the shortest reset word of a
finite synchronizing automaton by using a SAT solver. We use this approach to
perform an experimental study of the length of the shortest reset word of a
finite synchronizing automaton. The largest automata we considered had 100
states. The results of the experiments allow us to formulate a hypothesis that
the length of the shortest reset word of a random finite automaton with
states and 2 input letters with high probability is sublinear with respect to
and can be estimated as $1.95 n^{0.55}.
Two-proton correlations from 158 AGeV Pb+Pb central collisions
The two-proton correlation function at midrapidity from Pb+Pb central
collisions at 158 AGeV has been measured by the NA49 experiment. The results
are compared to model predictions from static thermal Gaussian proton source
distributions and transport models RQMD and VENUS. An effective proton source
size is determined by minimizing CHI-square/ndf between the correlation
functions of the data and those calculated for the Gaussian sources, yielding
3.85 +-0.15(stat.) +0.60-0.25(syst.) fm. Both the RQMD and the VENUS model are
consistent with the data within the error in the correlation peak region.Comment: RevTeX style, 6 pages, 4 figures, 1 table. More discussion are added
about the structure on the tail of the correlation function. The systematic
error is revised. To appear in Phys. Lett.
Event-by-event fluctuations of average transverse momentum in central Pb+Pb collisions at 158 GeV per nucleon
We present first data on event-by-event fluctuations in the average
transverse momentum of charged particles produced in Pb+Pb collisions at the
CERN SPS. This measurement provides previously unavailable information allowing
sensitive tests of microscopic and thermodynamic collision models and to search
for fluctuations expected to occur in the vicinity of the predicted QCD phase
transition. We find that the observed variance of the event-by-event average
transverse momentum is consistent with independent particle production modified
by the known two-particle correlations due to quantum statistics and final
state interactions and folded with the resolution of the NA49 apparatus. For
two specific models of non-statistical fluctuations in transverse momentum
limits are derived in terms of fluctuation amplitude. We show that a
significant part of the parameter space for a model of isospin fluctuations
predicted as a consequence of chiral symmetry restoration in a non-equilibrium
scenario is excluded by our measurement.Comment: 6 pages, 2 figures, submitted to Phys. Lett.
NA49 Results on Single Particle and Correlation Measurements in Central Pb+Pb Collisions
Single-particle spectra and two-particle correlation functions measured by the NA49 collaboration in central Pb+Pb collisions at 158 GeV/nucleon are presented. These measurements are used to study the kinetic and chemical freeze-out conditions in heavy ion collisions. We conclude that large baryon stopping, high baryon density and strong transverse radial flow are achieved in central Pb+Pb collisions at the SPS.Single-particle spectra and two-particle correlation functions measured by the NA49 collaboration in central Pb+Pb collisions at 158 GeV/nucleon are presented. These measurements are used to study the kinetic and chemical freeze-out conditions in heavy ion collisions. We conclude that large baryon stopping, high baryon density and strong transverse radial flow are achieved in central Pb+Pb collisions at the SPS
Xi and Xi-bar Production in 158 GeV/Nucleon Pb+Pb Collisions
We report measurements of Xi and Xi-bar hyperon absolute yields as a function
of rapidity in 158 GeV/c Pb+Pb collisions. At midrapidity, dN/dy = 2.29 +/-
0.12 for Xi, and 0.52 +/- 0.05 for Xi-bar, leading to the ratio of Xi-bar/Xi =
0.23 +/- 0.03. Inverse slope parameters fitted to the measured transverse mass
spectra are of the order of 300 MeV near mid-rapidity. The estimated total
yield of Xi particles in Pb+Pb central interactions amounts to 7.4 +/- 1.0 per
collision. Comparison to Xi production in properly scaled p+p reactions at the
same energy reveals a dramatic enhancement (about one order of magnitude) of Xi
production in Pb+Pb central collisions over elementary hadron interactions.Comment: 15 page
- …