Search CORE

20,364 research outputs found

Connectivity Oracles for Graphs Subject to Vertex Failures

Author: Duan Ran
Pettie Seth
Publication venue
Publication date: 06/09/2017
Field of study

We introduce new data structures for answering connectivity queries in graphs subject to batched vertex failures. A deterministic structure processes a batch of

d\leq d_{\star}

failed vertices in

\tilde{O}(d^3)

time and thereafter answers connectivity queries in

O(d)

time. It occupies space

O(d_{\star} m\log n)

. We develop a randomized Monte Carlo version of our data structure with update time

\tilde{O}(d^2)

, query time

O(d)

, and space

\tilde{O}(m)

for any failure bound

d\le n

. This is the first connectivity oracle for general graphs that can efficiently deal with an unbounded number of vertex failures. We also develop a more efficient Monte Carlo edge-failure connectivity oracle. Using space

O(n\log^2 n)

d

edge failures are processed in

O(d\log d\log\log n)

time and thereafter, connectivity queries are answered in

O(\log\log n)

time, which are correct w.h.p. Our data structures are based on a new decomposition theorem for an undirected graph

G=(V,E)

, which is of independent interest. It states that for any terminal set

U\subseteq V

we can remove a set

B

|U|/(s-2)

vertices such that the remaining graph contains a Steiner forest for

U-B

with maximum degree

s

arXiv.org e-Print Archive

Crossref

Weighted Min-Cut: Sequential, Cut-Query and Streaming Algorithms

Author: Mukhopadhyay Sagnik
Nanongkai Danupon
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 22/06/2020
Field of study

Consider the following 2-respecting min-cut problem. Given a weighted graph

G

and its spanning tree

T

, find the minimum cut among the cuts that contain at most two edges in

T

. This problem is an important subroutine in Karger's celebrated randomized near-linear-time min-cut algorithm [STOC'96]. We present a new approach for this problem which can be easily implemented in many settings, leading to the following randomized min-cut algorithms for weighted graphs. * An

O(m\frac{\log^2 n}{\log\log n} + n\log^6 n)

-time sequential algorithm: This improves Karger's

O(m \log^3 n)

and

O(m\frac{(\log^2 n)\log (n^2/m)}{\log\log n} + n\log^6 n)

bounds when the input graph is not extremely sparse or dense. Improvements over Karger's bounds were previously known only under a rather strong assumption that the input graph is simple [Henzinger et al. SODA'17; Ghaffari et al. SODA'20]. For unweighted graphs with parallel edges, our bound can be improved to

O(m\frac{\log^{1.5} n}{\log\log n} + n\log^6 n)

. * An algorithm requiring

\tilde O(n)

cut queries to compute the min-cut of a weighted graph: This answers an open problem by Rubinstein et al. ITCS'18, who obtained a similar bound for simple graphs. * A streaming algorithm that requires

\tilde O(n)

space and

O(\log n)

passes to compute the min-cut: The only previous non-trivial exact min-cut algorithm in this setting is the 2-pass

\tilde O(n)

-space algorithm on simple graphs [Rubinstein et al., ITCS'18] (observed by Assadi et al. STOC'19). In contrast to Karger's 2-respecting min-cut algorithm which deploys sophisticated dynamic programming techniques, our approach exploits some cute structural properties so that it only needs to compute the values of

\tilde O(n)

cuts corresponding to removing

\tilde O(n)

pairs of tree edges, an operation that can be done quickly in many settings.Comment: Updates on this version: (1) Minor corrections in Section 5.1, 5.2; (2) Reference to newer results by GMW SOSA21 (arXiv:2008.02060v2), DEMN STOC21 (arXiv:2004.09129v2) and LMN 21 (arXiv:2102.06565v1

arXiv.org e-Print Archive

White Rose Research Online

The study of probability model for compound similarity searching

Author: Abd. Wahid Mohd. Taib
Alwee Razana
Dollah @ Md. Zain Rozilawati
Salim Naomie
Publication venue: Faculty of Computer Science and Information System
Publication date: 30/09/2006
Field of study

Information Retrieval or IR system main task is to retrieve relevant documents according to the users query. One of IR most popular retrieval model is the Vector Space Model. This model assumes relevance based on similarity, which is defined as the distance between query and document in the concept space. All currently existing chemical compound database systems have adapt the vector space model to calculate the similarity of a database entry to a query compound. However, it assumes that fragments represented by the bits are independent of one another, which is not necessarily true. Hence, the possibility of applying another IR model is explored, which is the Probabilistic Model, for chemical compound searching. This model estimates the probabilities of a chemical structure to have the same bioactivity as a target compound. It is envisioned that by ranking chemical structures in decreasing order of their probability of relevance to the query structure, the effectiveness of a molecular similarity searching system can be increased. Both fragment dependencies and independencies assumption are taken into consideration in achieving improvement towards compound similarity searching system. After conducting a series of simulated similarity searching, it is concluded that PM approaches really did perform better than the existing similarity searching. It gave better result in all evaluation criteria to confirm this statement. In terms of which probability model performs better, the BD model shown improvement over the BIR model

Universiti Teknologi Malaysia Institutional Repository