5,181 research outputs found

    Searching by approximate personal-name matching

    Get PDF
    We discuss the design, building and evaluation of a method to access theinformation of a person, using his name as a search key, even if it has deformations. We present a similarity function, the DEA function, based on the probabilities of the edit operations accordingly to the involved letters and their position, and using a variable threshold. The efficacy of DEA is quantitatively evaluated, without human relevance judgments, very superior to the efficacy of known methods. A very efficient approximate search technique for the DEA function is also presented based on a compacted trie-tree structure.Postprint (published version

    Fast Approximate Reconciliation of Set Differences

    Full text link
    We present new, simple, efficient data structures for approximate reconciliation of set differences, a useful standalone primitive for peer-to-peer networks and a natural subroutine in methods for exact reconciliation. In the approximate reconciliation problem, peers A and B respectively have subsets of elements SA and SB of a large universe U. Peer A wishes to send a short message M to peer B with the goal that B should use M to determine as many elements in the set SB–SA as possible. To avoid the expense of round trip communication times, we focus on the situation where a single message M is sent. We motivate the performance tradeoffs between message size, accuracy and computation time for this problem with a straightforward approach using Bloom filters. We then introduce approximation reconciliation trees, a more computationally efficient solution that combines techniques from Patricia tries, Merkle trees, and Bloom filters. We present an analysis of approximation reconciliation trees and provide experimental results comparing the various methods proposed for approximate reconciliation.National Science Foundation (ANI-0093296, ANI-9986397, CCR-0118701, CCR-0121154); Alfred P. Sloan Research Fellowshi

    Lattice template placement for coherent all-sky searches for gravitational-wave pulsars

    Full text link
    All-sky, broadband, coherent searches for gravitational-wave pulsars are restricted by limited computational resources. Minimizing the number of templates required to cover the search parameter space, of sky position and frequency evolution, is one important way to reduce the computational cost of a search. We demonstrate a practical algorithm which, for the first time, achieves template placement with a minimal number of templates for an all-sky search, using the reduced supersky parameter-space metric of Wette and Prix [Phys. Rev. D 88, 123005 (2013)]. The metric prescribes a constant template density in the signal parameters, which permits that templates be placed at the vertices of a lattice. We demonstrate how to ensure complete coverage of the parameter space, including in particular at its boundaries. The number of templates generated by the algorithm is compared to theoretical estimates, and to previous predictions by Brady et al. [Phys. Rev. D 57, 2101 (1998)]. The algorithm may be applied to any search parameter space with a constant template density, which includes semicoherent searches and searches targeting known low-mass X-ray binaries.Comment: 16 pages, 14 figure
    • …
    corecore