Search CORE

454,577 research outputs found

Deterministic Time-Space Tradeoffs for k-SUM

Author: Lincoln Andrea
Wang Joshua R.
Williams R. Ryan
Williams Virginia Vassilevska
Publication venue
Publication date: 01/01/2016
Field of study

Given a set of numbers, the

k

-SUM problem asks for a subset of

k

numbers that sums to zero. When the numbers are integers, the time and space complexity of

k

-SUM is generally studied in the word-RAM model; when the numbers are reals, the complexity is studied in the real-RAM model, and space is measured by the number of reals held in memory at any point. We present a time and space efficient deterministic self-reduction for the

k

-SUM problem which holds for both models, and has many interesting consequences. To illustrate: *

3

-SUM is in deterministic time

O(n^2 \lg\lg(n)/\lg(n))

and space

O\left(\sqrt{\frac{n \lg(n)}{\lg\lg(n)}}\right)

. In general, any polylogarithmic-time improvement over quadratic time for

3

-SUM can be converted into an algorithm with an identical time improvement but low space complexity as well. *

3

-SUM is in deterministic time

O(n^2)

and space

O(\sqrt n)

, derandomizing an algorithm of Wang. * A popular conjecture states that 3-SUM requires

n^{2-o(1)}

time on the word-RAM. We show that the 3-SUM Conjecture is in fact equivalent to the (seemingly weaker) conjecture that every

O(n^{.51})

-space algorithm for

3

-SUM requires at least

n^{2-o(1)}

time on the word-RAM. * For

k \ge 4

k

-SUM is in deterministic

O(n^{k - 2 + 2/k})

time and

O(\sqrt{n})

space

arXiv.org e-Print Archive

Dagstuhl Research Online Publication Server

Massively-Parallel Feature Selection for Big Data

Author: Borboudakis Giorgos
Christophides Vassilis
Katsogridakis Pavlos
Pratikakis Polyvios
Tsamardinos Ioannis
Publication venue
Publication date: 23/08/2017
Field of study

We present the Parallel, Forward-Backward with Pruning (PFBP) algorithm for feature selection (FS) in Big Data settings (high dimensionality and/or sample size). To tackle the challenges of Big Data FS PFBP partitions the data matrix both in terms of rows (samples, training examples) as well as columns (features). By employing the concepts of

p

-values of conditional independence tests and meta-analysis techniques PFBP manages to rely only on computations local to a partition while minimizing communication costs. Then, it employs powerful and safe (asymptotically sound) heuristics to make early, approximate decisions, such as Early Dropping of features from consideration in subsequent iterations, Early Stopping of consideration of features within the same iteration, or Early Return of the winner in each iteration. PFBP provides asymptotic guarantees of optimality for data distributions faithfully representable by a causal network (Bayesian network or maximal ancestral graph). Our empirical analysis confirms a super-linear speedup of the algorithm with increasing sample size, linear scalability with respect to the number of features and processing cores, while dominating other competitive algorithms in its class

arXiv.org e-Print Archive

INRIA a CCSD electronic archive server

Hal-Diderot

Fast Deterministic Selection

Author: Alexandrescu Andrei
Publication venue
Publication date: 04/08/2016
Field of study

The Median of Medians (also known as BFPRT) algorithm, although a landmark theoretical achievement, is seldom used in practice because it and its variants are slower than simple approaches based on sampling. The main contribution of this paper is a fast linear-time deterministic selection algorithm QuickselectAdaptive based on a refined definition of MedianOfMedians. The algorithm's performance brings deterministic selection---along with its desirable properties of reproducible runs, predictable run times, and immunity to pathological inputs---in the range of practicality. We demonstrate results on independent and identically distributed random inputs and on normally-distributed inputs. Measurements show that QuickselectAdaptive is faster than state-of-the-art baselines.Comment: Pre-publication draf

arXiv.org e-Print Archive

Dagstuhl Research Online Publication Server

Parallel Wavelet Tree Construction

Author: Shun Julian
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/04/2015
Field of study

We present parallel algorithms for wavelet tree construction with polylogarithmic depth, improving upon the linear depth of the recent parallel algorithms by Fuentes-Sepulveda et al. We experimentally show on a 40-core machine with two-way hyper-threading that we outperform the existing parallel algorithms by 1.3--5.6x and achieve up to 27x speedup over the sequential algorithm on a variety of real-world and artificial inputs. Our algorithms show good scalability with increasing thread count, input size and alphabet size. We also discuss extensions to variants of the standard wavelet tree.Comment: This is a longer version of the paper that appears in the Proceedings of the IEEE Data Compression Conference, 201

arXiv.org e-Print Archive

Crossref

Exploiting hybrid parallelism in the kinematic analysis of multibody systems based on group equations

Author: Bernabé García Gregorio
Cano Lorente José Carlos
Cuenca Muñoz Antonio Javier
Flores Gil Antonio
Giménez Cánovas Domingo
Saura Sánchez Maríano
Segado Cabezos Pablo
Publication venue: 'Elsevier BV'
Publication date: 01/01/2017
Field of study

Computational kinematics is a fundamental tool for the design, simulation, control, optimization and dynamic analysis of multibody systems. The analysis of complex multibody systems and the need for real time solutions requires the development of kinematic and dynamic formulations that reduces computational cost, the selection and efficient use of the most appropriated solvers and the exploiting of all the computer resources using parallel computing techniques. The topological approach based on group equations and natural coordinates reduces the computation time in comparison with well-known global formulations and enables the use of parallelism techniques which can be applied at different levels: simultaneous solution of equations, use of multithreading routines, or a combination of both. This paper studies and compares these topological formulation and parallel techniques to ascertain which combination performs better in two applications. The first application uses dedicated systems for the real time control of small multibody systems, defined by a few number of equations and small linear systems, so shared-memory parallelism in combination with linear algebra routines is analyzed in a small multicore and in Raspberry Pi. The control of a Stewart platform is used as a case study. The second application studies large multibody systems in which the kinematic analysis must be performed several times during the design of multibody systems. A simulator which allows us to control the formulation, the solver, the parallel techniques and size of the problem has been developed and tested in more powerful computational systems with larger multicores and GPU.This work was supported by the Spanish MINECO, as well as European Commission FEDER funds, under grant TIN2015-66972-C5-3-

Repositorio Digital de la Universidad Politécnica de Cartagena