Search CORE

15 research outputs found

Finding Frequent Patterns in a Large Sparse Graph*

Author: A. Inokuchi
B.D. McKay
D.J. Cook
D.J. Cook
D.J. Cook
D.S. Hochbaum
E.M. Mitchell
George Karypis
H.M. Berman
H.M. Grindley
I. Jonyer
I. Jonyer
I. Koch
J.M. Kleinberg
J.M. Kleinberg
J.M. Robson
J.W. Raymond
K. Yoshida
K. Yoshida
M. Kuramochi
M.M. Halldórsson
M.R. Garey
Michihiro Kuramochi
N. Leibowitz
P.R.J. Östergård
R.C. Read
S.H. Muggleton
W. Lee
X. Pennec
X. Wang
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Similarity searching in databases of three-dimensional molecules and macromolecules

Author: Allen F.H.
Artymiuk P.J.
Bath P.A.
Grindley H.M.
Pepperrell C.A.
Poirrette A.R.
Rice D.W.
Taylor R.
Thorner D.A.
Wild D.J.
Willett P.
Publication venue: 'American Chemical Society (ACS)'
Publication date: 01/11/1992
Field of study

This paper discusses algorithmic techniques for measuring the degree of similarity between pairs of threedimensional (3-D) chemical molecules represented by interatomic distance matrices. A comparison of four methods for the calculation of 3-D structural similarity suggests that the most effective one is a procedure that identifies pairs of atoms, one from each of the molecules that are being compared, that lie at the center of geometrically-related volumes of 3-D space. This atom mapping method enables the calculation of a wide range of types of intermolecular similarity coefficient, including measures that are based on physicochemical data. Massively-parallel implementations of the method are discussed, using the AMT Distributed Array Processor, that achieve a substantial increase in performance when compared with a sequential implementation on a UNIX workstation. Current work involves the use of angular information and the extension of the method to field-based similarity searching. Similarity searching in 3-D macromolecules is effected by the use of a maximal common subgraph (MCS) isomorphism algorithm with a novel, graph-based representation of the tertiary structures of proteins. This algorithm is being used to identify similarities between the 3-D structures of proteins in the Brookhaven Protein Data Bank; its use is exemplified by searches involving the NAD-binding fold motif

White Rose Research Online

Lessons learned from exploring the backtracking paradigm on the GPU

Author: B. Zhang
C. Bron
D. Chakrabarti
D.L. Tabb
H.M. Grindley
J. Moon
J.D. Owens
K. Zhou
M.C. Schmidt
P. Harish
V. Kumar
Publication venue: Springer
Publication date: 01/01/2011
Field of study

Abstract. We explore the backtracking paradigm with properties seen as sub-optimal for GPU architectures, using as a case study the maximal clique enumeration problem, and find that the presence of these properties limit GPU performance to approximately 1.4–2.25 times a single CPU core. The GPU performance “lessons ” we find critical to providing this performance include a coarse-and-fine-grain parallelization of the search space, a low-overhead load-balanced distribution of work, global memory latency hiding through coalescence, saturation, and shared memory utilization, and the use of GPU output buffering as a solution to irregular workloads and a large solution domain. We also find a strong reliance on an efficient global problem structure representation that bounds any efficiencies gained from these lessons, and discuss the meanings of these results to backtracking problems in general.

CiteSeerX

Crossref

eScholarship - University of California

Representation of protein secondary structure using bond-orientational order parameters

Author: A.G. Brevern de
A.G. Murzin
A.P. Joseph
A.R. Atilgan
B. Offmann
C. Atilgan
D. Frishman
G. Leban
H.M. Berman
H.M. Grindley
J. Demsar
J. Demšar
J. Martin
J.A. Hanley
M.N. Fodje
P.J. Steinhardt
S. Torquato
T.M. Truskett
W.J. Sternberg
Y. Zhang
Publication venue: Springer Berlin Heidelberg
Publication date: 01/01/2012
Field of study

Structural studies of proteins for motif mining and other pattern recognition techniques require the abstraction of the structure into simpler elements for robust matching. In this study, we propose the use of bond-orientational order parameters, a well-established metric usually employed to compare atom packing in crystals and liquids. Creating a vector of orientational order parameters of residue centers in a sliding window fashion provides us with a descriptor of local structure and connectivity around each residue that is easy to calculate and compare. To test whether this representation is feasible and applicable to protein structures, we tried to predict the secondary structure of protein segments from those descriptors, resulting in 0.99 AUC (area under the ROC curve). Clustering those descriptors to 6 clusters also yield 0.93 AUC, showing that these descriptors can be used to capture and distinguish local structural information

Crossref

Sabanci University Research Database