26,168 research outputs found
The total path length of split trees
We consider the model of random trees introduced by Devroye [SIAM J. Comput.
28 (1999) 409-432]. The model encompasses many important randomized algorithms
and data structures. The pieces of data (items) are stored in a randomized
fashion in the nodes of a tree. The total path length (sum of depths of the
items) is a natural measure of the efficiency of the algorithm/data structure.
Using renewal theory, we prove convergence in distribution of the total path
length toward a distribution characterized uniquely by a fixed point equation.
Our result covers, using a unified approach, many data structures such as
binary search trees, m-ary search trees, quad trees, median-of-(2k+1) trees,
and simplex trees.Comment: Published in at http://dx.doi.org/10.1214/11-AAP812 the Annals of
Applied Probability (http://www.imstat.org/aap/) by the Institute of
Mathematical Statistics (http://www.imstat.org
On the sub-permutations of pattern avoiding permutations
There is a deep connection between permutations and trees. Certain
sub-structures of permutations, called sub-permutations, bijectively map to
sub-trees of binary increasing trees. This opens a powerful tool set to study
enumerative and probabilistic properties of sub-permutations and to investigate
the relationships between 'local' and 'global' features using the concept of
pattern avoidance. First, given a pattern {\mu}, we study how the avoidance of
{\mu} in a permutation {\pi} affects the presence of other patterns in the
sub-permutations of {\pi}. More precisely, considering patterns of length 3, we
solve instances of the following problem: given a class of permutations K and a
pattern {\mu}, we ask for the number of permutations whose
sub-permutations in K satisfy certain additional constraints on their size.
Second, we study the probability for a generic pattern to be contained in a
random permutation {\pi} of size n without being present in the
sub-permutations of {\pi} generated by the entry . These
theoretical results can be useful to define efficient randomized pattern-search
procedures based on classical algorithms of pattern-recognition, while the
general problem of pattern-search is NP-complete
Random binary search tree with equal elements
AbstractWe consider random binary search trees when the input consists of a multiset, i.e. a set with multiple occurrences of equal elements, and prove that the randomized insertion and deletion algorithms given by Martínez and Roura (1998) [4] produce random search trees regardless of multiplicities; even if all the elements are equal during the tree updates, a search tree will maintain its randomness. Thus, equal elements do not degenerate a random search tree and they need not to be handled in any special way. We consider also stability of a search tree with respect to its inorder traversal and prove that the algorithms used produce stable trees. This implies an implicit indexing of equal elements giving another proof that multiplicities do not pose problems when maintaining random binary search trees
Random Binary Search Tree with Equal Elements
Computing Reviews (1998) Categories and Subject Descriptors:
E.1 Data Structures — trees
F.2.2 Analysis of Algorithms and Problem Complexity: Nonnumerical Algorithms and
Problems — sorting and searchingWe consider random binary search trees when the input consists of a multiset, i.e. a set with multiple occurrences of equal elements, and prove that the randomized insertion and deletion algorithms produce random search trees regardless of multiplicities; even if all the elements are equal during the tree updates, a search tree will maintain its randomness. Thus, equal elements do not degenerate a random search tree and they need not to be handled in any special way. We consider also stability of a search tree with respect to its inorder traversal and prove that the algorithms used produce stable trees. This introduces an implicit indexing of equal elements giving another proof that multiplicities do not pose problems when maintaining random binary search trees
Median-of-k Jumplists and Dangling-Min BSTs
We extend randomized jumplists introduced by Br\"onnimann et al. (STACS 2003) to choose jump-pointer targets as median of a small sample for better search costs, and present randomized algorithms with expected time complexity that maintain the probability distribution of jump pointers upon insertions and deletions. We analyze the expected costs to search, insert and delete a random element, and we show that omitting jump pointers in small sublists hardly affects search costs, but significantly reduces the memory consumption. We use a bijection between jumplists and "dangling-min BSTs", a variant of (fringe-balanced) binary search trees for the analysis. Despite their similarities, some standard analysis techniques for search trees fail for dangling-min trees (and hence for jumplists)
Recommended from our members
Metaheuristic approaches for the quartet method of hierarchical clustering
Given a set of objects and their pairwise distances, we wish to determine a visual representation of the data. We use the quartet paradigm to compute a hierarchy of clusters of the objects. The method is based on an NP-hard graph optimization problem called the Minimum Quartet Tree Cost problem. This paper presents and compares several metaheuristic approaches to approximate the optimal hierarchy. The performance of the algorithms is tested through extensive computational experiments and it is shown that the Reduced Variable Neighbourhood Search metaheuristic is the most effective approach to the problem, obtaining high quality solutions in short computational running times
- …