Search CORE

30,628 research outputs found

Deterministic and Probabilistic Binary Search in Graphs

Author: Aslam J. A.
Burnashev M. V.
Dhagat A.
Karp R. M.
Laber E. S.
Mozes S.
Nowak R.
Pedrotti A.
Ulam S. M.
Publication venue
Publication date: 28/07/2017
Field of study

We consider the following natural generalization of Binary Search: in a given undirected, positively weighted graph, one vertex is a target. The algorithm's task is to identify the target by adaptively querying vertices. In response to querying a node

q

, the algorithm learns either that

q

is the target, or is given an edge out of

q

that lies on a shortest path from

q

to the target. We study this problem in a general noisy model in which each query independently receives a correct answer with probability

p > \frac{1}{2}

(a known constant), and an (adversarial) incorrect one with probability

1-p

. Our main positive result is that when

p = 1

(i.e., all answers are correct),

\log_2 n

queries are always sufficient. For general

p

, we give an (almost information-theoretically optimal) algorithm that uses, in expectation, no more than

(1 - \delta)\frac{\log_2 n}{1 - H(p)} + o(\log n) + O(\log^2 (1/\delta))

queries, and identifies the target correctly with probability at leas

1-\delta

. Here,

H(p) = -(p \log p + (1-p) \log(1-p))

denotes the entropy. The first bound is achieved by the algorithm that iteratively queries a 1-median of the nodes not ruled out yet; the second bound by careful repeated invocations of a multiplicative weights algorithm. Even for

p = 1

, we show several hardness results for the problem of determining whether a target can be found using

K

queries. Our upper bound of

\log_2 n

implies a quasipolynomial-time algorithm for undirected connected graphs; we show that this is best-possible under the Strong Exponential Time Hypothesis (SETH). Furthermore, for directed graphs, or for undirected graphs with non-uniform node querying costs, the problem is PSPACE-complete. For a semi-adaptive version, in which one may query

r

nodes each in

k

rounds, we show membership in

\Sigma_{2k-1}

in the polynomial hierarchy, and hardness for

\Sigma_{2k-5}

arXiv.org e-Print Archive

Crossref

Prioritizing Populations for Conservation Using Phylogenetic Networks

Author: Martyn Iain
Mooers Arne O.
Moulton Vincent
Spillner Andreas
Volkmann Logan
Publication venue
Publication date: 01/01/2014
Field of study

In the face of inevitable future losses to biodiversity, ranking species by conservation priority seems more than prudent. Setting conservation priorities within species (i.e., at the population level) may be critical as species ranges become fragmented and connectivity declines. However, existing approaches to prioritization (e.g., scoring organisms by their expected genetic contribution) are based on phylogenetic trees, which may be poor representations of differentiation below the species level. In this paper we extend evolutionary isolation indices used in conservation planning from phylogenetic trees to phylogenetic networks. Such networks better represent population differentiation, and our extension allows populations to be ranked in order of their expected contribution to the set. We illustrate the approach using data from two imperiled species: the spotted owl Strix occidentalis in North America and the mountain pygmy-possum Burramys parvus in Australia. Using previously published mitochondrial and microsatellite data, we construct phylogenetic networks and score each population by its relative genetic distinctiveness. In both cases, our phylogenetic networks capture the geographic structure of each species: geographically peripheral populations harbor less-redundant genetic information, increasing their conservation rankings. We note that our approach can be used with all conservation-relevant distances (e.g., those based on whole-genome, ecological, or adaptive variation) and suggest it be added to the assortment of tools available to wildlife managers for allocating effort among threatened populations

Directory of Open Access Journals

PubMed Central

Simon Fraser University Institutional Repository

University of East Anglia digital repository

Answering Complex Questions by Joining Multi-Document Evidence with Quasi Knowledge Graphs

Author: Abujabal A.
Lu X.
Pramanik S.
Saha Roy R.
Wang Y.
Weikum G.
Publication venue
Publication date: 01/01/2019
Field of study

Direct answering of questions that involve multiple entities and relations is a challenge for text-based QA. This problem is most pronounced when answers can be found only by joining evidence from multiple documents. Curated knowledge graphs (KGs) may yield good answers, but are limited by their inherent incompleteness and potential staleness. This paper presents QUEST, a method that can answer complex questions directly from textual sources on-the-fly, by computing similarity joins over partial results from different documents. Our method is completely unsupervised, avoiding training-data bottlenecks and being able to cope with rapidly evolving ad hoc topics and formulation style in user questions. QUEST builds a noisy quasi KG with node and edge weights, consisting of dynamically retrieved entity names and relational phrases. It augments this graph with types and semantic alignments, and computes the best answers by an algorithm for Group Steiner Trees. We evaluate QUEST on benchmarks of complex questions, and show that it substantially outperforms state-of-the-art baselines

MPG.PuRe

Recommended from our members

An O(n3 [square root of] log n) algorithm for the optimal stable marriage problem

Author: Ng Cheng
Publication venue: eScholarship, University of California
Publication date: 01/01/1990
Field of study

We give an O(n^3 √logn) time algorithm for the optimal stable marriage problem. This algorithm finds a stable marriage that minimizes an objective function defined over all stable marriages in a given problem instance.Irving, Leather, and Gusfield have previously provided a solution to this problem that runs in O(n^4) time [ILG87]. In addition, Feder has claimed that an O(n^3 log n) time algorithm exists [F89]. Our result is an asymptotic improvement over both cases.As part of our solution, we solve a special blue-red matching problem, and illustrate a technique for simulating Hopcroft and Karp's maximum-matching algorithm [HK73] on the transitive closure of a graph

eScholarship - University of California

Term-Specific Eigenvector-Centrality in Multi-Relation Networks

Author: Bry François
Furche Tim
Kneißl Fabian
Weiand Klara
Publication venue: 'Inderscience Publishers'
Publication date: 01/01/2011
Field of study

Fuzzy matching and ranking are two information retrieval techniques widely used in web search. Their application to structured data, however, remains an open problem. This article investigates how eigenvector-centrality can be used for approximate matching in multi-relation graphs, that is, graphs where connections of many different types may exist. Based on an extension of the PageRank matrix, eigenvectors representing the distribution of a term after propagating term weights between related data items are computed. The result is an index which takes the document structure into account and can be used with standard document retrieval techniques. As the scheme takes the shape of an index transformation, all necessary calculations are performed during index tim

CiteSeerX

Crossref

Open Access LMU