Search CORE

63,074 research outputs found

Analysis of An Approximate Median Selection Algorithm

Author: Cantone Domenico
Hofri Micha
Publication venue: Digital WPI
Publication date: 10/08/2006
Field of study

We present analysis of an efficient algorithm for the approximate median selection problem that has been rediscovered many times, and easy to implement. The contribution of the article is in precise characterization of the accuracy of the algorithm. We present analytical results of the performance of the algorithm, as well as experimental illustrations of its precision

DigitalCommons@WPI

Optimal Gossip Algorithms for Exact and Approximate Quantile Computations

Author: Haeupler Bernhard
Mohapatra Jeet
Su Hsin-Hao
Publication venue
Publication date: 25/11/2017
Field of study

This paper gives drastically faster gossip algorithms to compute exact and approximate quantiles. Gossip algorithms, which allow each node to contact a uniformly random other node in each round, have been intensely studied and been adopted in many applications due to their fast convergence and their robustness to failures. Kempe et al. [FOCS'03] gave gossip algorithms to compute important aggregate statistics if every node is given a value. In particular, they gave a beautiful

O(\log n + \log \frac{1}{\epsilon})

round algorithm to

\epsilon

-approximate the sum of all values and an

O(\log^2 n)

round algorithm to compute the exact

\phi

-quantile, i.e., the the

\lceil \phi n \rceil

smallest value. We give an quadratically faster and in fact optimal gossip algorithm for the exact

\phi

-quantile problem which runs in

O(\log n)

rounds. We furthermore show that one can achieve an exponential speedup if one allows for an

\epsilon

-approximation. We give an

O(\log \log n + \log \frac{1}{\epsilon})

round gossip algorithm which computes a value of rank between

\phi n

and

(\phi+\epsilon)n

at every node.% for any

0 \leq \phi \leq 1

and

0 < \epsilon < 1

. Our algorithms are extremely simple and very robust - they can be operated with the same running times even if every transmission fails with a, potentially different, constant probability. We also give a matching

\Omega(\log \log n + \log \frac{1}{\epsilon})

lower bound which shows that our algorithm is optimal for all values of

\epsilon

arXiv.org e-Print Archive

Crossref

Fast Deterministic Selection

Author: Alexandrescu Andrei
Publication venue
Publication date: 04/08/2016
Field of study

The Median of Medians (also known as BFPRT) algorithm, although a landmark theoretical achievement, is seldom used in practice because it and its variants are slower than simple approaches based on sampling. The main contribution of this paper is a fast linear-time deterministic selection algorithm QuickselectAdaptive based on a refined definition of MedianOfMedians. The algorithm's performance brings deterministic selection---along with its desirable properties of reproducible runs, predictable run times, and immunity to pathological inputs---in the range of practicality. We demonstrate results on independent and identically distributed random inputs and on normally-distributed inputs. Measurements show that QuickselectAdaptive is faster than state-of-the-art baselines.Comment: Pre-publication draf

arXiv.org e-Print Archive

Dagstuhl Research Online Publication Server

SOCP relaxation bounds for the optimal subset selection problem applied to robust linear regression

Author: Flores Salvador
Publication venue: 'Elsevier BV'
Publication date: 01/01/2015
Field of study

This paper deals with the problem of finding the globally optimal subset of h elements from a larger set of n elements in d space dimensions so as to minimize a quadratic criterion, with an special emphasis on applications to computing the Least Trimmed Squares Estimator (LTSE) for robust regression. The computation of the LTSE is a challenging subset selection problem involving a nonlinear program with continuous and binary variables, linked in a highly nonlinear fashion. The selection of a globally optimal subset using the branch and bound (BB) algorithm is limited to problems in very low dimension, tipically d<5, as the complexity of the problem increases exponentially with d. We introduce a bold pruning strategy in the BB algorithm that results in a significant reduction in computing time, at the price of a negligeable accuracy lost. The novelty of our algorithm is that the bounds at nodes of the BB tree come from pseudo-convexifications derived using a linearization technique with approximate bounds for the nonlinear terms. The approximate bounds are computed solving an auxiliary semidefinite optimization problem. We show through a computational study that our algorithm performs well in a wide set of the most difficult instances of the LTSE problem.Comment: 12 pages, 3 figures, 2 table

arXiv.org e-Print Archive

Repositorio Académico de la Universidad de Chile

Recommended from our members

d-QPSO: A Quantum-Behaved Particle Swarm Technique for Finding D-Optimal Designs With Discrete and Continuous Factors and a Binary Response

Author: Lukemire Joshua
Mandal Abhyuday
Wong Weng Kee
Publication venue: eScholarship, University of California
Publication date: 23/10/2018
Field of study

Identifying optimal designs for generalized linear models with a binary response can be a challengingtask, especially when there are both discrete and continuous independent factors in the model. Theoreticalresults rarely exist for such models, and for the handful that do, they usually come with restrictive assumptions.In this article, we propose the d-QPSO algorithm, a modified version of quantum-behaved particleswarm optimization, to find a variety of D-optimal approximate and exact designs for experiments withdiscrete and continuous factors and a binary response. We show that the d-QPSO algorithm can efficientlyfind locally D-optimal designs even for experiments with a large number of factors and robust pseudo-Bayesian designs when nominal values for the model parameters are not available. Additionally, we investigaterobustness properties of the d-QPSO algorithm-generated designs to variousmodel assumptions andprovide real applications to design a bio-plastics odor removal experiment, an electronic static experiment,and a 10-factor car refueling experiment. Supplementary materials for the article are available online

eScholarship - University of California

Linear-Space Data Structures for Range Mode Query in Arrays

Author: Durocher Stephane
Morrison Jason
Publication venue
Publication date: 01/01/2011
Field of study

A mode of a multiset

S

is an element

a \in S

of maximum multiplicity; that is,

a

occurs at least as frequently as any other element in

S

. Given a list

A[1:n]

n

items, we consider the problem of constructing a data structure that efficiently answers range mode queries on

A

. Each query consists of an input pair of indices

(i, j)

for which a mode of

A[i:j]

must be returned. We present an

O(n^{2-2\epsilon})

-space static data structure that supports range mode queries in

O(n^\epsilon)

time in the worst case, for any fixed

\epsilon \in [0,1/2]

. When

\epsilon = 1/2

, this corresponds to the first linear-space data structure to guarantee

O(\sqrt{n})

query time. We then describe three additional linear-space data structures that provide

O(k)

O(m)

, and

O(|j-i|)

query time, respectively, where

k

denotes the number of distinct elements in

A

and

m

denotes the frequency of the mode of

A

. Finally, we examine generalizing our data structures to higher dimensions.Comment: 13 pages, 2 figure

arXiv.org e-Print Archive

CiteSeerX