Search CORE

20,570 research outputs found

Between Subgraph Isomorphism and Maximum Common Subgraph

Author: Hoffmann Ruth
Mccreesh Ciaran
Reilly Craig
Publication venue
Publication date: 01/01/2017
Field of study

When a small pattern graph does not occur inside a larger target graph, we can ask how to find "as much of the pattern as possible" inside the target graph. In general, this is known as the maximum common subgraph problem, which is much more computationally challenging in practice than subgraph isomorphism. We introduce a restricted alternative, where we ask if all but k vertices from the pattern can be found in the target graph. This allows for the development of slightly weakened forms of certain invariants from subgraph isomorphism which are based upon degree and number of paths. We show that when k is small, weakening the invariants still retains much of their effectiveness. We are then able to solve this problem on the standard problem instances used to benchmark subgraph isomorphism algorithms, despite these instances being too large for current maximum common subgraph algorithms to handle. Finally, by iteratively increasing k, we obtain an algorithm which is also competitive for the maximum common subgraph

Enlighten

Association for the Advancement of Artificial Intelligence: AAAI Publications

University of St. Andrews - Pure

Pattern matching and pattern discovery algorithms for protein topologies

Author: C. Bron
C.A. Orengo
C.A. Orengo
D. Gilbert
D.R. Westhead
D.R. Westhead
H.M. Berman
I. Koch
J.J. McGregor
J.R. Ullmann
K. Hofmann
K. Zhang
L. Holm
P.A. Evans
T.P.J. Flores
Publication venue: 'Springer Fachmedien Wiesbaden GmbH'
Publication date: 01/01/2001
Field of study

We describe algorithms for pattern matching and pattern learning in TOPS diagrams (formal descriptions of protein topologies). These problems can be reduced to checking for subgraph isomorphism and finding maximal common subgraphs in a restricted class of ordered graphs. We have developed a subgraph isomorphism algorithm for ordered graphs, which performs well on the given set of data. The maximal common subgraph problem then is solved by repeated subgraph extension and checking for isomorphisms. Despite the apparent inefficiency such approach gives an algorithm with time complexity proportional to the number of graphs in the input set and is still practical on the given set of data. As a result we obtain fast methods which can be used for building a database of protein topological motifs, and for the comparison of a given protein of known secondary structure against a motif database

Crossref

Brunel University Research Archive

Graph theoretic methods for the analysis of structural relationships in biological macromolecules

Author: Altschul
Artymiuk
Artymiuk
Artymiuk
Artymiuk
Artymiuk
Barnard
Baxevanis
Benning
Berman
Bernstein
Brint
Brint
Bron
Bruno
Bryant
Crandell
Dean
Diestel
Doubet
Fan
Feizi
Figueras
Flores
Gardiner
Gati
Good
Gray
Groves
Gruer
Gund
Hagadone
Harrison
Holden
Hutchinson
Jasanoff
Johnson
Kanna
Klausner
Kleywegt
Koch
Kraulis
Lengauer
Lesk
Martin
Martin
McGregor
Messmer
Mitchell
Ollis
Pickering
Ray
Raymond
Read
Salton
Samudrala
Sayle
Simon
Srere
Sussenguth
Tesmer
Tinoco
Trinajstic
Tsukada
Ullmann
van Rijsbergen
Willett
Willett
Willett
Willett
Williams
Wilson
Zhang
Publication venue: 'Wiley'
Publication date: 01/01/2005
Field of study

Subgraph isomorphism and maximum common subgraph isomorphism algorithms from graph theory provide an effective and an efficient way of identifying structural relationships between biological macromolecules. They thus provide a natural complement to the pattern matching algorithms that are used in bioinformatics to identify sequence relationships. Examples are provided of the use of graph theory to analyze proteins for which three-dimensional crystallographic or NMR structures are available, focusing on the use of the Bron-Kerbosch clique detection algorithm to identify common folding motifs and of the Ullmann subgraph isomorphism algorithm to identify patterns of amino acid residues. Our methods are also applicable to other types of biological macromolecule, such as carbohydrate and nucleic acid structures

CiteSeerX

Crossref

White Rose Research Online

Sussex Research Online

Quantum Query Complexity of Subgraph Isomorphism and Homomorphism

Author: Kulkarni Raghav
Podder Supartha
Publication venue
Publication date: 21/09/2015
Field of study

Let

H

be a fixed graph on

n

vertices. Let

f_H(G) = 1

iff the input graph

G

n

vertices contains

H

as a (not necessarily induced) subgraph. Let

\alpha_H

denote the cardinality of a maximum independent set of

H

. In this paper we show:

Q(f_H) = \Omega\left(\sqrt{\alpha_H \cdot n}\right),

where

Q(f_H)

denotes the quantum query complexity of

f_H

. As a consequence we obtain a lower bounds for

Q(f_H)

in terms of several other parameters of

H

such as the average degree, minimum vertex cover, chromatic number, and the critical probability. We also use the above bound to show that

Q(f_H) = \Omega(n^{3/4})

for any

H

, improving on the previously best known bound of

\Omega(n^{2/3})

. Until very recently, it was believed that the quantum query complexity is at least square root of the randomized one. Our

\Omega(n^{3/4})

bound for

Q(f_H)

matches the square root of the current best known bound for the randomized query complexity of

f_H

, which is

\Omega(n^{3/2})

due to Gr\"oger. Interestingly, the randomized bound of

\Omega(\alpha_H \cdot n)

for

f_H

still remains open. We also study the Subgraph Homomorphism Problem, denoted by

f_{[H]}

, and show that

Q(f_{[H]}) = \Omega(n)

. Finally we extend our results to the

3

-uniform hypergraphs. In particular, we show an

\Omega(n^{4/5})

bound for quantum query complexity of the Subgraph Isomorphism, improving on the previously known

\Omega(n^{3/4})

bound. For the Subgraph Homomorphism, we obtain an

\Omega(n^{3/2})

bound for the same.Comment: 16 pages, 2 figure

arXiv.org e-Print Archive

DROPS Dagstuhl Research Online Publication Server

Efficient Subgraph Similarity Search on Large Probabilistic Graph Databases

Author: Chen Lei
Wang Guoren
Wang Haixun
Yuan Ye
Publication venue
Publication date: 01/01/2012
Field of study

Many studies have been conducted on seeking the efficient solution for subgraph similarity search over certain (deterministic) graphs due to its wide application in many fields, including bioinformatics, social network analysis, and Resource Description Framework (RDF) data management. All these works assume that the underlying data are certain. However, in reality, graphs are often noisy and uncertain due to various factors, such as errors in data extraction, inconsistencies in data integration, and privacy preserving purposes. Therefore, in this paper, we study subgraph similarity search on large probabilistic graph databases. Different from previous works assuming that edges in an uncertain graph are independent of each other, we study the uncertain graphs where edges' occurrences are correlated. We formally prove that subgraph similarity search over probabilistic graphs is #P-complete, thus, we employ a filter-and-verify framework to speed up the search. In the filtering phase,we develop tight lower and upper bounds of subgraph similarity probability based on a probabilistic matrix index, PMI. PMI is composed of discriminative subgraph features associated with tight lower and upper bounds of subgraph isomorphism probability. Based on PMI, we can sort out a large number of probabilistic graphs and maximize the pruning capability. During the verification phase, we develop an efficient sampling algorithm to validate the remaining candidates. The efficiency of our proposed solutions has been verified through extensive experiments.Comment: VLDB201

arXiv.org e-Print Archive

CiteSeerX

Crossref

Hong Kong University of Science and Technology Institutional Repository