Search CORE

Edinburgh Research Explorer

Caltech Authors

Finite Element Algorithms and Data Structures on Graphical Processing Units

Author: C Cecka
C Johnson
D Komatitsch
EL Poole
G Alefeld
I. Z. Reguly
J Bolz
KJ Fidkowski
M. B. Giles
O Axelsson
WmW Hwu
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 04/12/2013
Field of study

The finite element method (FEM) is one of the most commonly used techniques for the solution of partial differential equations on unstructured meshes. This paper discusses both the assembly and the solution phases of the FEM with special attention to the balance of computation and data movement. We present a GPU assembly algorithm that scales to arbitrary degree polynomials used as basis functions, at the expense of redundant computations. We show how the storage of the stiffness matrix affects the performance of both the assembly and the solution. We investigate two approaches: global assembly into the CSR and ELLPACK matrix formats and matrix-free algorithms, and show the trade-off between the amount of indexing data and stiffness data. We discuss the performance of different approaches in light of the implicit caches on Fermi GPUs and show a speedup over a two-socket 12-core CPU of up to 10 times in the assembly and up to 6 times in the solution phase. We present our sparse matrix-vector multiplication algorithms that are part of a conjugate gradient iteration and show that a matrix-free approach may be up to two times faster than global assembly approaches and up to 4 times faster than NVIDIA’s cuSPARSE library, depending on the preconditioner used

Oxford University Research Archive

A human MAP kinase interactome.

Author: A Friedman
A Karydis
A Lunardi
AA Reszka
AC Gavin
AJ Whitmarsh
BP Kelley
C Widmann
Chih-yuan Chiang
Christopher H Martin
Cornelia Kurschner
D Wang
Diane L Barber
DJ LaCount
GD Bader
GL Johnson
J Kim
J Ptacek
JV Olsen
Jyoti Srivastava
K Hayashi
K Venkatesan
KC Gunsalus
L Chang
L Collin
M Baumgartner
M Karin
M Qi
M Rothe
Merril Gersten
Mike Smoot
P Uetz
PJ Cullen
R Hooley
R Konig
R Konig
R Konig
Russell Bell
S Peri
SA Johnson
SK Chanda
Sourav Bandyopadhyay
SP Denker
Sudhir Sahasrabudhe
Suhaila White
Sumit K Chanda
T Ito
T Reguly
Trey Ideker
W Kolch
W Kolch
Y Ho
Publication venue: eScholarship, University of California
Publication date: 01/10/2010
Field of study

Mitogen-activated protein kinase (MAPK) pathways form the backbone of signal transduction in the mammalian cell. Here we applied a systematic experimental and computational approach to map 2,269 interactions between human MAPK-related proteins and other cellular machinery and to assemble these data into functional modules. Multiple lines of evidence including conservation with yeast supported a core network of 641 interactions. Using small interfering RNA knockdowns, we observed that approximately one-third of MAPK-interacting proteins modulated MAPK-mediated signaling. We uncovered the Na-H exchanger NHE1 as a potential MAPK scaffold, found links between HSP90 chaperones and MAPK pathways and identified MUC12 as the human analog to the yeast signaling mucin Msb2. This study makes available a large resource of MAPK interactions and clone libraries, and it illustrates a methodology for probing signaling networks based on functional refinement of experimentally derived protein-interaction maps

eScholarship - University of California

Partitioning optimization of proteins from Zea mays malt in ATPS PEG 6000/CaCl2

Author: Albertsson P. -Å.
Alex Ferreira Evangelista
Barros Neto B.
Biazus J. P. M.
Bradford M. M.
Cabral J. M. S.
Cleland J.
Diamond A. D.
Elias Basile Tambourgi
Elizabete Jordão
Graziela Batista Ferreira
Gunduz U.
Higuti I. H.
José Carlos Curvelo Santana
João Baptista Severo Junior
Malavasi U. C.
Matiasson B.
Nirmala M.
Reguly J. C.
Roberto Rodrigues de Souza
Santana J. C. C.
Silva M. E.
Zaslasvsky B. Y.
Zhi W.
Publication venue: 'FapUNIFESP (SciELO)'
Publication date
Field of study

Reuse of structural domain–domain interactions in protein networks

Author: A Grigoriev
AC Gavin
Alex Bateman
Benjamin Schuster-Böckler
C von Mering
H Hermjakob
H Lee
J Bravo
J Park
K Peng
L Giot
ME Cusick
P Aloy
P Aloy
P Pagel
P Uetz
R Jothi
R Riley
RD Finn
S Peri
S Wuchty
SJ Littler
SK Ng
T Ito
T Reguly
TKB Gandhi
TMW Nye
Z Itzhaki
Publication venue: BioMed Central
Publication date: 01/01/2007
Field of study

Abstract Background Protein interactions are thought to be largely mediated by interactions between structural domains. Databases such as <it>i</it>Pfam relate interactions in protein structures to known domain families. Here, we investigate how the domain interactions from the <it>i</it>Pfam database are distributed in protein interactions taken from the HPRD, MPact, BioGRID, DIP and IntAct databases. Results We find that known structural domain interactions can only explain a subset of 4–19% of the available protein interactions, nevertheless this fraction is still significantly bigger than expected by chance. There is a correlation between the frequency of a domain interaction and the connectivity of the proteins it occurs in. Furthermore, a large proportion of protein interactions can be attributed to a small number of domain interactions. We conclude that many, but not all, domain interactions constitute reusable modules of molecular recognition. A substantial proportion of domain interactions are conserved between <it>E. coli</it>, <it>S. cerevisiae </it>and <it>H. sapiens</it>. These domains are related to essential cellular functions, suggesting that many domain interactions were already present in the last universal common ancestor. Conclusion Our results support the concept of domain interactions as reusable, conserved building blocks of protein interactions, but also highlight the limitations currently imposed by the small number of available protein structures.</p

Oxford University Research Archive

Generating confidence intervals on biological networks

Author: A Wagner
B Lemos
BD Ripley
C Robert
C Tucker
D Drummond
E de Silva
F Picard
G Arfken
H Hermjakob
H Yu
HB Fraser
I Agrafioti
I Xenarios
IK Jordan
J Berg
JS Bader
M Gavin
M Newman
M Stumpf
Michael PH Stumpf
MW Hahn
N Luscombe
N Metropolis
P Bork
R Cho
R Milo
R Milo
T Reguly
Thomas Thorne
Publication venue: BioMed Central
Publication date: 01/01/2007
Field of study

Abstract Background In the analysis of networks we frequently require the statistical significance of some network statistic, such as measures of similarity for the properties of interacting nodes. The structure of the network may introduce dependencies among the nodes and it will in general be necessary to account for these dependencies in the statistical analysis. To this end we require some form of Null model of the network: generally rewired replicates of the network are generated which preserve only the degree (number of interactions) of each node. We show that this can fail to capture important features of network structure, and may result in unrealistic significance levels, when potentially confounding additional information is available. Methods We present a new network resampling Null model which takes into account the degree sequence as well as available biological annotations. Using gene ontology information as an illustration we show how this information can be accounted for in the resampling approach, and the impact such information has on the assessment of statistical significance of correlations and motif-abundances in the <it>Saccharomyces cerevisiae </it>protein interaction network. An algorithm, GOcardShuffle, is introduced to allow for the efficient construction of an improved Null model for network data. Results We use the protein interaction network of <it>S. cerevisiae</it>; correlations between the evolutionary rates and expression levels of interacting proteins and their statistical significance were assessed for Null models which condition on different aspects of the available data. The novel GOcardShuffle approach results in a Null model for annotated network data which appears better to describe the properties of real biological networks. Conclusion An improved statistical approach for the statistical analysis of biological network data, which conditions on the available biological information, leads to qualitatively different results compared to approaches which ignore such annotations. In particular we demonstrate the effects of the biological organization of the network can be sufficient to explain the observed similarity of interacting proteins.</p

University of Melbourne Institutional Repository

Network-based functional enrichment

Author: A Subramanian
AA Margolin
C Harbison
C Lefebvre
C Stark
Christopher L Poirel
Clifford C Owens
D Beisser
D Szklarczyk
ES Welf
F arkowetz
FF Mohammed
G Joshi-Tope
GF Berriz
GK Smyth
I Ulitsky
J Rual
K Wang
L Soucek
LM Heltemes-Harris
M Ashburner
M Bansal
M Kanehisa
MT Dittrich
NJ Hewitt
P Shannon
P Uetz
R Milo
S Bauer
S Grossmann
S Malin
SR Collins
T Ideker
T M Murali
T Reguly
VD Longo
Y Kim
Y Lu
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Abstract Background Many methods have been developed to infer and reason about molecular interaction networks. These approaches often yield networks with hundreds or thousands of nodes and up to an order of magnitude more edges. It is often desirable to summarize the biological information in such networks. A very common approach is to use gene function enrichment analysis for this task. A major drawback of this method is that it ignores information about the edges in the network being analyzed, i.e., it treats the network simply as a set of genes. In this paper, we introduce a novel method for functional enrichment that explicitly takes network interactions into account. Results Our approach naturally generalizes Fisher’s exact test, a gene set-based technique. Given a function of interest, we compute the subgraph of the network induced by genes annotated to this function. We use the sequence of sizes of the connected components of this sub-network to estimate its connectivity. We estimate the statistical significance of the connectivity empirically by a permutation test. We present three applications of our method: i) determine which functions are enriched in a given network, ii) given a network and an interesting sub-network of genes within that network, determine which functions are enriched in the sub-network, and iii) given two networks, determine the functions for which the connectivity improves when we merge the second network into the first. Through these applications, we show that our approach is a natural alternative to network clustering algorithms. Conclusions We presented a novel approach to functional enrichment that takes into account the pairwise relationships among genes annotated by a particular function. Each of the three applications discovers highly relevant functions. We used our methods to study biological data from three different organisms. Our results demonstrate the wide applicability of our methods. Our algorithms are implemented in C++ and are freely available under the GNU General Public License at our supplementary website. Additionally, all our input data and results are available at <url>http://bioinformatics.cs.vt.edu/~murali/supplements/2011-incob-nbe/</url>.</p

Identifying protein complexes directly from high-throughput TAP data with Markov random fields

Author: AC Gavin
AC Gavin
AD King
Alexander Schliep
Arno Schödl
C von Mering
E Segal
G Bader
G Bader
G Rigaut
I Lee
J Pereira-Leal
J Zhang
M Deng
M Deng
MA Gilchrist
NJ Krogan
NJ Krogan
P Kemmeren
P Uetz
R Kinderman
R Krause
R Krause
RO Duda
Roland Krause
S Brohée
S van Dongen
SZ Li
T Ito
T Reguly
V Spirin
Wasinee Rungsarityotin
Y Ho
Publication venue: BioMed Central
Publication date: 01/01/2007
Field of study

Abstract Background Predicting protein complexes from experimental data remains a challenge due to limited resolution and stochastic errors of high-throughput methods. Current algorithms to reconstruct the complexes typically rely on a two-step process. First, they construct an interaction graph from the data, predominantly using heuristics, and subsequently cluster its vertices to identify protein complexes. Results We propose a model-based identification of protein complexes directly from the experimental observations. Our model of protein complexes based on Markov random fields explicitly incorporates false negative and false positive errors and exhibits a high robustness to noise. A model-based quality score for the resulting clusters allows us to identify reliable predictions in the complete data set. Comparisons with prior work on reference data sets shows favorable results, particularly for larger unfiltered data sets. Additional information on predictions, including the source code under the GNU Public License can be found at http://algorithmics.molgen.mpg.de/Static/Supplements/ProteinComplexes. Conclusion We can identify complexes in the data obtained from high-throughput experiments without prior elimination of proteins or weak interactions. The few parameters of our model, which does not rely on heuristics, can be estimated using maximum likelihood without a reference data set. This is particularly important for protein complex studies in organisms that do not have an established reference frame of known protein complexes.</p

Open Repository and Bibliography - Luxembourg

MPG.PuRe

Facile whole mitochondrial genome resequencing from nipple aspirate fluid using MitoChip v2.0

Author: A Maitra
Alioune Ngom
Andrea Maggrah
BL King
Brian Reguly
C Isaacs
C Jeronimo
ER Sauter
F Legros
G Mitchell
Gabriel D Dakubo
GC Buehring
GD Dakubo
J Felsenstein
J He
J Maki
JB Jones
Jennifer Maki
John P Jakupciak
JP Jakupciak
JP Jakupciak
JS Penta
K Polyak
K Visvanathan
Katrina Maki
Ken Gehman
Kerry Robinson
L Jacobs
LM Tseng
M Wrensch
MR Wrensch
MR Wrensch
MS Fliss
P Sharma
Paul D Wagner
RG van Eijsden
Robert E Thayer
Roy Wittock
Ryan L Parr
S Krishnamurthy
S Zhou
SA Higgins
SA Khan
Samantha Maragh
SM Richard
Sudhir Srivastava
Teresa Gehman
W Zhu
WC Dooley
Publication venue: BioMed Central
Publication date: 01/01/2008
Field of study

Abstract Background Mutations in the mitochondrial genome (mtgenome) have been associated with many disorders, including breast cancer. Nipple aspirate fluid (NAF) from symptomatic women could potentially serve as a minimally invasive sample for breast cancer screening by detecting somatic mutations in this biofluid. This study is aimed at 1) demonstrating the feasibility of NAF recovery from symptomatic women, 2) examining the feasibility of sequencing the entire mitochondrial genome from NAF samples, 3) cross validation of the Human mitochondrial resequencing array 2.0 (MCv2), and 4) assessing the somatic mtDNA mutation rate in benign breast diseases as a potential tool for monitoring early somatic mutations associated with breast cancer. Methods NAF and blood were obtained from women with symptomatic benign breast conditions, and we successfully assessed the mutation load in the entire mitochondrial genome of 19 of these women. DNA extracts from NAF were sequenced using the mitochondrial resequencing array MCv2 and by capillary electrophoresis (CE) methods as a quality comparison. Sequencing was performed independently at two institutions and the results compared. The germline mtDNA sequence determined using DNA isolated from the patient's blood (control) was compared to the mutations present in cellular mtDNA recovered from patient's NAF. Results From the cohort of 28 women recruited for this study, NAF was successfully recovered from 23 participants (82%). Twenty two (96%) of the women produced fluids from both breasts. Twenty NAF samples and corresponding blood were chosen for this study. Except for one NAF sample, the whole mtgenome was successfully amplified using a single primer pair, or three pairs of overlapping primers. Comparison of MCv2 data from the two institutions demonstrates 99.200% concordance. Moreover, MCv2 data was 99.999% identical to CE sequencing, indicating that MCv2 is a reliable method to rapidly sequence the entire mtgenome. Four NAF samples contained somatic mutations. Conclusion We have demonstrated that NAF is a suitable material for mtDNA sequence analysis using the rapid and reliable MCv2. Somatic mtDNA mutations present in NAF of women with benign breast diseases could potentially be used as risk factors for progression to breast cancer, but this will require a much larger study with clinical follow up.</p

Trees on networks: resolving statistical patterns of phylogenetic similarities among interacting proteins

Author: A Valencia
A Wagner
AC Gavin
AK Ramani
AK Ramani
B Lemos
BL Drees
C Brun
CM Deane
CS Goh
CS Goh
D Fitzpatrick
D Juan
D Juan
DA Drummond
DR Strong
E Alm
E Bender
E de Silva
E de Silva
F Pazos
F Pazos
F Tajima
HB Fraser
I Agrafioti
I Jordan
I Xenarios
J Berg
J Felsenstein
J Felsenstein
J Gertz
J Yu
JD Thompson
JS Bader
K Wolfe
L Hakes
L Salwinski
L Skrabanek
M Ashburner
M Pellegrini
Michael PH Stumpf
N Bhardwaj
P Erdös
P Harvey
P Pamilo
P Sharp
R Cho
R Jothi
R Milo
RM May
T Reguly
T Schlitt
T Thorne
William P Kelly
WP Kelly
Z Yang
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Abstract Background Phylogenies capture the evolutionary ancestry linking extant species. Correlations and similarities among a set of species are mediated by and need to be understood in terms of the phylogenic tree. In a similar way it has been argued that biological networks also induce correlations among sets of interacting genes or their protein products. Results We develop suitable statistical resampling schemes that can incorporate these two potential sources of correlation into a single inferential framework. To illustrate our approach we apply it to protein interaction data in yeast and investigate whether the phylogenetic trees of interacting proteins in a panel of yeast species are more similar than would be expected by chance. Conclusions While we find only negligible evidence for such increased levels of similarities, our statistical approach allows us to resolve the previously reported contradictory results on the levels of co-evolution induced by protein-protein interactions. We conclude with a discussion as to how we may employ the statistical framework developed here in further functional and evolutionary analyses of biological networks and systems.</p