3,071,853 research outputs found
Efficient Computation of Sequence Mappability
Sequence mappability is an important task in genome re-sequencing. In the
-mappability problem, for a given sequence of length , our goal
is to compute a table whose th entry is the number of indices such
that length- substrings of starting at positions and have at
most mismatches. Previous works on this problem focused on heuristic
approaches to compute a rough approximation of the result or on the case of
. We present several efficient algorithms for the general case of the
problem. Our main result is an algorithm that works in time and space for
. It requires a carefu l adaptation of the technique of Cole
et al.~[STOC 2004] to avoid multiple counting of pairs of substrings. We also
show -time algorithms to compute all results for a fixed
and all or a fixed and all . Finally we show
that the -mappability problem cannot be solved in strongly subquadratic
time for unless the Strong Exponential Time Hypothesis
fails.Comment: Accepted to SPIRE 201
Efficient Algorithms for Scheduling Moldable Tasks
We study the problem of scheduling independent moldable tasks on
processors that arises in large-scale parallel computations. When tasks are
monotonic, the best known result is a -approximation
algorithm for makespan minimization with a complexity linear in and
polynomial in and where is
arbitrarily small. We propose a new perspective of the existing speedup models:
the speedup of a task is linear when the number of assigned
processors is small (up to a threshold ) while it presents
monotonicity when ranges in ; the bound
indicates an unacceptable overhead when parallelizing on too many processors.
For a given integer , let . In this paper, we propose a -approximation algorithm for makespan minimization with a
complexity where
(). As
a by-product, we also propose a -approximation algorithm for
throughput maximization with a common deadline with a complexity
Vinogradov systems with a slice off
Let denote the number of integral solutions of the modified
Vinogradov system of equations with .
By exploiting sharp estimates for an auxiliary mean value, we obtain bounds for
for . In particular, when
satisfy and , we establish the essentially
diagonal behaviour .Comment: 19 page
Efficient self-sustained pulsed CO laser
In this paper a simple sealed-off TEA CO laser is described with a self-sustained discharge without an external UV preionization source. At 77 K this system yields more than 600 mJ from a lasing volume of about 60 cm3 CO-N2-He mixture (45 J/ℓ atm. with 15.6% efficiency)
Cache-Oblivious Selection in Sorted X+Y Matrices
Let X[0..n-1] and Y[0..m-1] be two sorted arrays, and define the mxn matrix A
by A[j][i]=X[i]+Y[j]. Frederickson and Johnson gave an efficient algorithm for
selecting the k-th smallest element from A. We show how to make this algorithm
IO-efficient. Our cache-oblivious algorithm performs O((m+n)/B) IOs, where B is
the block size of memory transfers
Statistics of Partial Minima
Motivated by multi-objective optimization, we study extrema of a set of N
points independently distributed inside the d-dimensional hypercube. A point in
this set is k-dominated by another point when at least k of its coordinates are
larger, and is a k-minimum if it is not k-dominated by any other point. We
obtain statistical properties of these partial minima using exact probabilistic
methods and heuristic scaling techniques. The average number of partial minima,
A, decays algebraically with the total number of points, A ~ N^{-(d-k)/k}, when
1<=k<d. Interestingly, there are k-1 distinct scaling laws characterizing the
largest coordinates as the distribution P(y_j) of the jth largest coordinate,
y_j, decays algebraically, P(y_j) ~ (y_j)^{-alpha_j-1}, with
alpha_j=j(d-k)/(k-j) for 1<=j<=k-1. The average number of partial minima grows
logarithmically, A ~ [1/(d-1)!](ln N)^{d-1}, when k=d. The full distribution of
the number of minima is obtained in closed form in two-dimensions.Comment: 6 pages, 1 figur
- …