Search CORE

71 research outputs found

Batch solution of small PDEs with the OPS DSL

Author: E László
GR Mudalige
H Carter Edwards
H Wang
IZ Reguly
JE Stone
JG Verwer
K In’t Hout
K In’t Hout
M Wyns
P MacNeice
R Chandra
R Nath
S Kronawitter
SP Jammy
T Deakin
W Gropp
W Hundsdorfer
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2019
Field of study

In this paper we discuss the challenges and optimisations opportunities when solving a large number of small, equally sized discretised PDEs on regular grids. We present an extension of the OPS (Oxford Parallel library for Structured meshes) embedded Domain Specific Language, and show how support can be added for solving multiple systems, and how OPS makes it easy to deploy a variety of transformations and optimisations. The new capabilities in OPS allow to automatically apply data structure transformations, as well as execution schedule transformations to deliver high performance on a variety of hardware platforms. We evaluate our work on an industrially representative finance simulation on Intel CPUs, as well as NVIDIA GPUs

Crossref

Warwick Research Archives Portal Repository

Repository of the Academy's Library

The pseudo-mitochondrial genome influences mistakes in heteroplasmy interpretation

Author: Aguirre Andrea
Dakubo Gabriel D
Jakupciak John P
Maki Jennifer
Parr Ryan L
Reguly Brian
Robinson Kerry
Thayer Robert E
Wittock Roy
Publication venue: BioMed Central
Publication date: 01/01/2006
Field of study

BACKGROUND: Nuclear mitochondrial pseudogenes (numts) are a potential source of contamination during mitochondrial DNA PCR amplification. This possibility warrants careful experimental design and cautious interpretation of heteroplasmic results. RESULTS: Here we report the cloning and sequencing of numts loci, amplified from human tissue and rho-zero (ρ(0)) cells (control) with primers known to amplify the mitochondrial genome. This paper is the first to fully sequence 46 paralogous nuclear DNA fragments that represent the entire mitochondrial genome. This is a surprisingly small number due primarily to the primer sets used in this study, because prior to this, BLAST searches have suggested that nuclear DNA harbors between 400 to 1,500 paralogous mitochondrial DNA fragments. Our results indicate that multiple numts were amplified simultaneously with the mitochondrial genome and increased the load of pseudogene signal in PCR reactions. Further, the entire mitochondrial genome was represented by multiple copies of paralogous nuclear sequences. CONCLUSION: These findings suggest that mitochondrial genome disease-associated biomarkers must be rigorously authenticated to preclude any affiliation with paralogous nuclear pseudogenes. Importantly, the common perception that mitochondrial template "swamps" numts loci precluding detectable amplification, depends on the region of the mitochondrial genome targeted by the PCR reaction and the number of pseudogene loci that may co-amplify. Cloning and relevant sequencing data will facilitate the correct interpretation. This is the first complete, wet-lab characterization of numts that represent the entire mitochondrial genome

Springer - Publisher Connector

PubMed Central

Improved Network Performance via Antagonism: From Synthetic Rescues to Multi-drug Combinations

Author: Adilson E. Motter
Baba
Blagosklonny
Bollenbach
Bourgeois
Burgard
Burgard
Cagatay
Chait
Cun
Dancey
Duarte
Edwards
Feist
Fischer
Fong
Fong
Fong
Fong
Fraenkel
Gerdes
Giaever
Goebl
Harrison
Hashimoto
Hegreness
Henry
Herring
Hilgetag
Hilgetag
Hillenmeyer
Hopkins
Huttenlocher
Huttenlocher
Isalan
Kauffman
Keiser
Kim
Kitano
Kupiec
Ma
Miller
Motter
Motter
Nishikawa
Olson
Palsson
Papp
Pharkya
Reguly
Segre
Segre
Shlomi
Timmermans
Wagner
Yeh
Yildirim
Yngvadottir
Zamboni
Publication venue: 'Wiley'
Publication date: 17/03/2010
Field of study

Recent research shows that a faulty or sub-optimally operating metabolic network can often be rescued by the targeted removal of enzyme-coding genes--the exact opposite of what traditional gene therapy would suggest. Predictions go as far as to assert that certain gene knockouts can restore the growth of otherwise nonviable gene-deficient cells. Many questions follow from this discovery: What are the underlying mechanisms? How generalizable is this effect? What are the potential applications? Here, I will approach these questions from the perspective of compensatory perturbations on networks. Relations will be drawn between such synthetic rescues and naturally occurring cascades of reaction inactivation, as well as their analogues in physical and other biological networks. I will specially discuss how rescue interactions can lead to the rational design of antagonistic drug combinations that select against resistance and how they can illuminate medical research on cancer, antibiotics, and metabolic diseases.Comment: Online Open "Problems and Paradigms" articl

arXiv.org e-Print Archive

Crossref

PubMed Central

Generating confidence intervals on biological networks

Author: A Wagner
B Lemos
BD Ripley
C Robert
C Tucker
D Drummond
E de Silva
F Picard
G Arfken
H Hermjakob
H Yu
HB Fraser
I Agrafioti
I Xenarios
IK Jordan
J Berg
JS Bader
M Gavin
M Newman
M Stumpf
Michael PH Stumpf
MW Hahn
N Luscombe
N Metropolis
P Bork
R Cho
R Milo
R Milo
T Reguly
Thomas Thorne
Publication venue: BioMed Central
Publication date: 01/01/2007
Field of study

Abstract Background In the analysis of networks we frequently require the statistical significance of some network statistic, such as measures of similarity for the properties of interacting nodes. The structure of the network may introduce dependencies among the nodes and it will in general be necessary to account for these dependencies in the statistical analysis. To this end we require some form of Null model of the network: generally rewired replicates of the network are generated which preserve only the degree (number of interactions) of each node. We show that this can fail to capture important features of network structure, and may result in unrealistic significance levels, when potentially confounding additional information is available. Methods We present a new network resampling Null model which takes into account the degree sequence as well as available biological annotations. Using gene ontology information as an illustration we show how this information can be accounted for in the resampling approach, and the impact such information has on the assessment of statistical significance of correlations and motif-abundances in the <it>Saccharomyces cerevisiae </it>protein interaction network. An algorithm, GOcardShuffle, is introduced to allow for the efficient construction of an improved Null model for network data. Results We use the protein interaction network of <it>S. cerevisiae</it>; correlations between the evolutionary rates and expression levels of interacting proteins and their statistical significance were assessed for Null models which condition on different aspects of the available data. The novel GOcardShuffle approach results in a Null model for annotated network data which appears better to describe the properties of real biological networks. Conclusion An improved statistical approach for the statistical analysis of biological network data, which conditions on the available biological information, leads to qualitatively different results compared to approaches which ignore such annotations. In particular we demonstrate the effects of the biological organization of the network can be sufficient to explain the observed similarity of interacting proteins.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

University of Melbourne Institutional Repository

Identifying protein complexes directly from high-throughput TAP data with Markov random fields

Author: AC Gavin
AC Gavin
AD King
Alexander Schliep
Arno Schödl
C von Mering
E Segal
G Bader
G Bader
G Rigaut
I Lee
J Pereira-Leal
J Zhang
M Deng
M Deng
MA Gilchrist
NJ Krogan
NJ Krogan
P Kemmeren
P Uetz
R Kinderman
R Krause
R Krause
RO Duda
Roland Krause
S Brohée
S van Dongen
SZ Li
T Ito
T Reguly
V Spirin
Wasinee Rungsarityotin
Y Ho
Publication venue: BioMed Central
Publication date: 01/01/2007
Field of study

Abstract Background Predicting protein complexes from experimental data remains a challenge due to limited resolution and stochastic errors of high-throughput methods. Current algorithms to reconstruct the complexes typically rely on a two-step process. First, they construct an interaction graph from the data, predominantly using heuristics, and subsequently cluster its vertices to identify protein complexes. Results We propose a model-based identification of protein complexes directly from the experimental observations. Our model of protein complexes based on Markov random fields explicitly incorporates false negative and false positive errors and exhibits a high robustness to noise. A model-based quality score for the resulting clusters allows us to identify reliable predictions in the complete data set. Comparisons with prior work on reference data sets shows favorable results, particularly for larger unfiltered data sets. Additional information on predictions, including the source code under the GNU Public License can be found at http://algorithmics.molgen.mpg.de/Static/Supplements/ProteinComplexes. Conclusion We can identify complexes in the data obtained from high-throughput experiments without prior elimination of proteins or weak interactions. The few parameters of our model, which does not rely on heuristics, can be estimated using maximum likelihood without a reference data set. This is particularly important for protein complex studies in organisms that do not have an established reference frame of known protein complexes.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Open Repository and Bibliography - Luxembourg

MPG.PuRe

New insights into protein-protein interaction data lead to increased estimates of the S. cerevisiae interactome size

Author: A Grigoriev
A Vinayagam
AC Gavin
AS Schwartz
C Stark
C von Mering
E Sprinzak
GT Hart
H Huang
H Huang
H Jeong
H Sahai
H Yu
I Lee
I Xenarios
JDJ Han
JM Cherry
K Tarassov
K Venkatesan
L Salwinski
Laure Sambourg
M Costanzo
ME Cusick
ME Cusick
MJA Aryee
MPH Stumpf
Nicolas Thierry-Mieg
NJ Krogan
P Braun
P D'haeseleer
P Uetz
S Fields
SW Michnick
T Ito
T Reguly
X Xin
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Abstract Background As protein interactions mediate most cellular mechanisms, protein-protein interaction networks are essential in the study of cellular processes. Consequently, several large-scale interactome mapping projects have been undertaken, and protein-protein interactions are being distilled into databases through literature curation; yet protein-protein interaction data are still far from comprehensive, even in the model organism <it>Saccharomyces cerevisiae</it>. Estimating the interactome size is important for evaluating the completeness of current datasets, in order to measure the remaining efforts that are required. Results We examined the yeast interactome from a new perspective, by taking into account how thoroughly proteins have been studied. We discovered that the set of literature-curated protein-protein interactions is qualitatively different when restricted to proteins that have received extensive attention from the scientific community. In particular, these interactions are less often supported by yeast two-hybrid, and more often by more complex experiments such as biochemical activity assays. Our analysis showed that high-throughput and literature-curated interactome datasets are more correlated than commonly assumed, but that this bias can be corrected for by focusing on well-studied proteins. We thus propose a simple and reliable method to estimate the size of an interactome, combining literature-curated data involving well-studied proteins with high-throughput data. It yields an estimate of at least 37, 600 direct physical protein-protein interactions in <it>S. cerevisiae</it>. Conclusions Our method leads to higher and more accurate estimates of the interactome size, as it accounts for interactions that are genuine yet difficult to detect with commonly-used experimental assays. This shows that we are even further from completing the yeast interactome map than previously expected.</p

Crossref

Hal - Université Grenoble Alpes

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

HAL Descartes

A genome-wide screen for essential yeast genes that affect telomere length maintenance

Author: Barrowman
Ben-Aroya
Blasco
Breslow
Bressan
Chan
Chapon
Dahan
E. Ruppin
Fourel
Garvik
Gatbonton
Gavin
Grandin
Greenwell
Greider
Gross
Harrington
Harrington
Hirano
Houser-Scott
Krogan
L. Ungar
Lendvay
Levy
Luke
Lundblad
Lydall
M. Kupiec
Machesky
Mitchell
Monahan
N. Yosef
Niida
Onge
Ozkaynak
Pleiss
R. Sharan
Reguly
Rog
Schuldiner
Segr
Seto
Shachar
Sharan
Teter
van Steensel
van Vugt
Watson
Wotton
Y. Sela
Zakian
Zakian
Publication venue: Oxford University Press
Publication date
Field of study

Telomeres are structures composed of repetitive DNA and proteins that protect the chromosomal ends in eukaryotic cells from fusion or degradation, thus contributing to genomic stability. Although telomere length varies between species, in all organisms studied telomere length appears to be controlled by a dynamic equilibrium between elongating mechanisms (mainly addition of repeats by the enzyme telomerase) and nucleases that shorten the telomeric sequences. Two previous studies have analyzed a collection of yeast deletion strains (deleted for nonessential genes) and found over 270 genes that affect telomere length (Telomere Length Maintenance or TLM genes). Here we complete the list of TLM by analyzing a collection of strains carrying hypomorphic alleles of most essential genes (DAmP collection). We identify 87 essential genes that affect telomere length in yeast. These genes interact with the nonessential TLM genes in a significant manner, and provide new insights on the mechanisms involved in telomere length maintenance. The newly identified genes span a variety of cellular processes, including protein degradation, pre-mRNA splicing and DNA replication

Crossref

PubMed Central

Trees on networks: resolving statistical patterns of phylogenetic similarities among interacting proteins

Author: A Valencia
A Wagner
AC Gavin
AK Ramani
AK Ramani
B Lemos
BL Drees
C Brun
CM Deane
CS Goh
CS Goh
D Fitzpatrick
D Juan
D Juan
DA Drummond
DR Strong
E Alm
E Bender
E de Silva
E de Silva
F Pazos
F Pazos
F Tajima
HB Fraser
I Agrafioti
I Jordan
I Xenarios
J Berg
J Felsenstein
J Felsenstein
J Gertz
J Yu
JD Thompson
JS Bader
K Wolfe
L Hakes
L Salwinski
L Skrabanek
M Ashburner
M Pellegrini
Michael PH Stumpf
N Bhardwaj
P Erdös
P Harvey
P Pamilo
P Sharp
R Cho
R Jothi
R Milo
RM May
T Reguly
T Schlitt
T Thorne
William P Kelly
WP Kelly
Z Yang
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Abstract Background Phylogenies capture the evolutionary ancestry linking extant species. Correlations and similarities among a set of species are mediated by and need to be understood in terms of the phylogenic tree. In a similar way it has been argued that biological networks also induce correlations among sets of interacting genes or their protein products. Results We develop suitable statistical resampling schemes that can incorporate these two potential sources of correlation into a single inferential framework. To illustrate our approach we apply it to protein interaction data in yeast and investigate whether the phylogenetic trees of interacting proteins in a panel of yeast species are more similar than would be expected by chance. Conclusions While we find only negligible evidence for such increased levels of similarities, our statistical approach allows us to resolve the previously reported contradictory results on the levels of co-evolution induced by protein-protein interactions. We conclude with a discussion as to how we may employ the statistical framework developed here in further functional and evolutionary analyses of biological networks and systems.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

University of Melbourne Institutional Repository

DroID: the Drosophila Interactions Database, a comprehensive resource for annotated gene and protein interactions

Author: AC Gavin
AC Gingras
B Deplancke
B Lehner
BA Shoemaker
C Boone
C von Mering
CA Stanyon
CA Stanyon
CY Lin
E Formstecher
Guozhen Liu
H Yu
I Lee
I Vastrik
Jingkai Yu
JR Parrish
JR Parrish
JS Bader
L Giot
M Deng
M Persico
M Vidal
MN Arbeitman
P Shannon
P Tomancak
Russell L Finley
S Fields
S Fields
S Mathivanan
S Mukherjee
S Pacifico
S Suthram
SD Hooper
Svetlana Pacifico
T Beuming
T Reguly
T Sandmann
TI Lee
Publication venue: BioMed Central
Publication date: 01/01/2008
Field of study

Abstract Background Charting the interactions among genes and among their protein products is essential for understanding biological systems. A flood of interaction data is emerging from high throughput technologies, computational approaches, and literature mining methods. Quick and efficient access to this data has become a critical issue for biologists. Several excellent multi-organism databases for gene and protein interactions are available, yet most of these have understandable difficulty maintaining comprehensive information for any one organism. No single database, for example, includes all available interactions, integrated gene expression data, and comprehensive and searchable gene information for the important model organism, <it>Drosophila melanogaster</it>. Description DroID, the <it>Drosophila </it>Interactions Database, is a comprehensive interactions database designed specifically for <it>Drosophila</it>. DroID houses published physical protein interactions, genetic interactions, and computationally predicted interactions, including interologs based on data for other model organisms and humans. All interactions are annotated with original experimental data and source information. DroID can be searched and filtered based on interaction information or a comprehensive set of gene attributes from Flybase. DroID also contains gene expression and expression correlation data that can be searched and used to filter datasets, for example, to focus a study on sub-networks of co-expressed genes. To address the inherent noise in interaction data, DroID employs an updatable confidence scoring system that assigns a score to each physical interaction based on the likelihood that it represents a biologically significant link. Conclusion DroID is the most comprehensive interactions database available for <it>Drosophila</it>. To facilitate downstream analyses, interactions are annotated with original experimental information, gene expression data, and confidence scores. All data in DroID are freely available and can be searched, explored, and downloaded through three different interfaces, including a text based web site, a Java applet with dynamic graphing capabilities (IM Browser), and a Cytoscape plug-in. DroID is available at <url>http://www.droidb.org</url>.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Digital Commons@Wayne State University

Inferring the role of transcription factors in regulatory networks

Author: A Siegel
A Zanzoni
AA Margolin
Anne Siegel
AP Gasch
AP Gasch
AR Joyce
B S
B Xing
C Constantinidou
C Guziolowski
Carito Guziolowski
CH Yeang
CH Yeang
CT Harbison
D Di Bernardo
E Hong
E Segal
F Ferrazzi
GG Roberts
H de Jong
H Hermjakob
H Salgado
Ingenuity-Systems
JJ Faith
JL DeRisi
KD MacIsaac
M Bansal
M Bansal
M Gebser
M Kanehisa
Michel Le Borgne
N Friedman
N Nabil Guelzim
N Nariai
N Ogawa
O Radulescu
Ovidiu Radulescu
P Shannon
P Sudarsanam
P Veber
Philippe Veber
PJ Cullen
R Bryan
RM Gutierrez-Rios
S Chu
S Kauffman
S Mnaimneh
S Peri
T Hughes
T Ideker
T Reguly
TI Lee
Publication venue: BioMed Central
Publication date: 01/01/2007
Field of study

Abstract Background Expression profiles obtained from multiple perturbation experiments are increasingly used to reconstruct transcriptional regulatory networks, from well studied, simple organisms up to higher eukaryotes. Admittedly, a key ingredient in developing a reconstruction method is its ability to integrate heterogeneous sources of information, as well as to comply with practical observability issues: measurements can be scarce or noisy. In this work, we show how to combine a network of genetic regulations with a set of expression profiles, in order to infer the functional effect of the regulations, as inducer or repressor. Our approach is based on a consistency rule between a network and the signs of variation given by expression arrays. Results We evaluate our approach in several settings of increasing complexity. First, we generate artificial expression data on a transcriptional network of <it>E. coli </it>extracted from the literature (1529 nodes and 3802 edges), and we estimate that 30% of the regulations can be annotated with about 30 profiles. We additionally prove that at most 40.8% of the network can be inferred using our approach. Second, we use this network in order to validate the predictions obtained with a compendium of real expression profiles. We describe a filtering algorithm that generates particularly reliable predictions. Finally, we apply our inference approach to <it>S. cerevisiae </it>transcriptional network (2419 nodes and 4344 interactions), by combining ChIP-chip data and 15 expression profiles. We are able to detect and isolate inconsistencies between the expression profiles and a significant portion of the model (15% of all the interactions). In addition, we report predictions for 14.5% of all interactions. Conclusion Our approach does not require accurate expression levels nor times series. Nevertheless, we show on both data, real and artificial, that a relatively small number of perturbation experiments are enough to determine a significant portion of regulatory effects. This is a key practical asset compared to statistical methods for network reconstruction. We demonstrate that our approach is able to provide accurate predictions, even when the network is incomplete and the data is noisy.</p

HAL-CentraleSupelec

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

INRIA a CCSD electronic archive server

PubMed Central

HAL-Rennes 1