Search CORE

72 research outputs found

Alignment-free local structural search by writhe decomposition

Author: Baiocco
Berman
Birzele
Chandonia
D. Zhi
Dietmann
Dror
Eidhammer
Harrison
Hasegawa
Holm
Ko
Koch
Kolodny
Levitt
Lindorff-Larsen
M. Shatsky
Madej
McGuffin
Mizuguchi
Murzin
Orengo
Petrey
Rufino
S. E. Brenner
Shindyalov
Shu
Teichert
Zhi
Zotenko
Publication venue: Oxford University Press
Publication date
Field of study

Motivation: Rapid methods for protein structure search enable biological discoveries based on flexibly defined structural similarity, unleashing the power of the ever greater number of solved protein structures. Projection methods show promise for the development of fast structural database search solutions. Projection methods map a structure to a point in a high-dimensional space and compare two structures by measuring distance between their projected points. These methods offer a tremendous increase in speed over residue-level structural alignment methods. However, current projection methods are not practical, partly because they are unable to identify local similarities

Crossref

PubMed Central

Revisiting Date and Party Hubs: Novel Approaches to Role Assignment in Protein Interaction Networks

Author: A Gursoy
A Rapoport
AC Gavin
AC Gavin
ACF Lewis
AI Su
AP Gasch
AS Schwartz
AW Rives
B Adamcsek
C von Mering
Charlotte M. Deane
CM Deane
E Hubbell
E Zotenko
G Kar
GD Bader
H Jeong
H Yu
H Yu
HW Mewes
I Maraziotis
IW Taylor
J Chen
J Reichardt
JA Hartigan
JDJ Han
JF Rual
JS Bader
K Komurov
K Tarassov
K Venkatesan
KR Brown
L Hakes
L Hakes
LC Freeman
M Fromont-Racine
M Fromont-Racine
M Girvan
MA Porter
Mason A. Porter
MEJ Newman
MR Wilkins
MS Granovetter
N Bertin
Nick S. Jones
NN Batada
NN Batada
P Braun
P Jaccard
P Kemmeren
P Resnik
P Uetz
Philip E. Bourne
PM Hartigan
PM Kim
PV Missiuro
R Dunn
R Guimerà
R Saeed
RR Vallabhajosyula
S Fortunato
SH Yook
Sumeet Agarwal
T Ito
T Obayashi
T Yamada
V Spirin
WK Lim
Y Ho
Z Wu
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2010
Field of study

The idea of 'date' and 'party' hubs has been influential in the study of protein-protein interaction networks. Date hubs display low co-expression with their partners, whilst party hubs have high co-expression. It was proposed that party hubs are local coordinators whereas date hubs are global connectors. Here we show that the reported importance of date hubs to network connectivity can in fact be attributed to a tiny subset of them. Crucially, these few, extremely central, hubs do not display particularly low expression correlation, undermining the idea of a link between this quantity and hub function. The date/party distinction was originally motivated by an approximately bimodal distribution of hub co-expression; we show that this feature is not always robust to methodological changes. Additionally, topological properties of hubs do not in general correlate with co-expression. Thus, we suggest that a date/party dichotomy is not meaningful and it might be more useful to conceive of roles for protein-protein interactions rather than individual proteins. We find significant correlations between interaction centrality and the functional similarity of the interacting proteins.Comment: 27 pages, 5 main figures, 4 supplementary figure

arXiv.org e-Print Archive

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Oxford University Research Archive

Spiral - Imperial College Digital Repository

DNA methylation is required to maintain both DNA replication timing precision and 3D genome organization integrity

Author: Achinger-Kawecka J.
Armstrong N.J.
Barton K.
Buckley M.
Caldon C.E.
Campbell E.M.
Chan C-L
Chia K-M
Clark S.J.
Deveson I.W.
Du Q.
Ferguson J.M.
Gould C.M.
Kaczorowski D.
Lim E.
Luu P.L.
Nair S.S.
Portman N.
Powell J.E.
Skvortsova K.
Smith G.C.
Smith M.A.
Stirzaker C.
Zotenko E.
Publication venue: Elsevier Inc.
Publication date: 01/01/2021
Field of study

DNA replication timing and three-dimensional (3D) genome organization are associated with distinct epigenome patterns across large domains. However, whether alterations in the epigenome, in particular cancer-related DNA hypomethylation, affects higher-order levels of genome architecture is still unclear. Here, using Repli-Seq, single-cell Repli-Seq, and Hi-C, we show that genome-wide methylation loss is associated with both concordant loss of replication timing precision and deregulation of 3D genome organization. Notably, we find distinct disruption in 3D genome compartmentalization, striking gains in cell-to-cell replication timing heterogeneity and loss of allelic replication timing in cancer hypomethylation models, potentially through the gene deregulation of DNA replication and genome organization pathways. Finally, we identify ectopic H3K4me3-H3K9me3 domains from across large hypomethylated domains, where late replication is maintained, which we purport serves to protect against catastrophic genome reorganization and aberrant gene transcription. Our results highlight a potential role for the methylome in the maintenance of 3D genome regulation

Research Repository

MCL-CAw: A refinement of MCL for detecting yeast complexes from weighted PPI networks by incorporating core-attachment structure

Author: A Eberharter
A Mitrofanova
AC Gavin
AC Gavin
AD King
AJ Enright
B Breitkreutz
B Zhang
C Fridel
C Friedel
C von Mering
DF Seals
E Zotenko
EA Winzeler
G Giaever
G Hart
G Liu
G Liu
G Rigaut
GD Bader
H Cheng
H Chua
H Jeong
H Leung
H Wang
Hon Wai Leong
HW Mewes
J Hurwitz
J Zhao
JC Mellor
JD Han
JM Cherry
JS Luz
K Voevodski
Kang Ning
M Ashburner
M Wu
N Batada
NJ Krogan
P Aloy
P Carvalho
P Shannon
P Uetz
PA Grant
PA Grant
S Brohee
S Dongen
S Pu
S Pu
S Srihari
S Srihari
SR Collins
Sriganesh Srihari
T Ito
T Miller
X Zhou
Y Araki
Y Ho
Y Ozawa
Y Tao
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2010
Field of study

Abstract Background The reconstruction of protein complexes from the physical interactome of organisms serves as a building block towards understanding the higher level organization of the cell. Over the past few years, several independent high-throughput experiments have helped to catalogue enormous amount of physical protein interaction data from organisms such as yeast. However, these individual datasets show lack of correlation with each other and also contain substantial number of false positives (noise). Over these years, several affinity scoring schemes have also been devised to improve the qualities of these datasets. Therefore, the challenge now is to detect meaningful as well as novel complexes from protein interaction (PPI) networks derived by combining datasets from multiple sources and by making use of these affinity scoring schemes. In the attempt towards tackling this challenge, the Markov Clustering algorithm (MCL) has proved to be a popular and reasonably successful method, mainly due to its scalability, robustness, and ability to work on scored (weighted) networks. However, MCL produces many noisy clusters, which either do not match known complexes or have additional proteins that reduce the accuracies of correctly predicted complexes. Results Inspired by recent experimental observations by Gavin and colleagues on the modularity structure in yeast complexes and the distinctive properties of "core" and "attachment" proteins, we develop a core-attachment based refinement method coupled to MCL for reconstruction of yeast complexes from scored (weighted) PPI networks. We combine physical interactions from two recent "pull-down" experiments to generate an unscored PPI network. We then score this network using available affinity scoring schemes to generate multiple scored PPI networks. The evaluation of our method (called MCL-CAw) on these networks shows that: (i) MCL-CAw derives larger number of yeast complexes and with better accuracies than MCL, particularly in the presence of natural noise; (ii) Affinity scoring can effectively reduce the impact of noise on MCL-CAw and thereby improve the quality (precision and recall) of its predicted complexes; (iii) MCL-CAw responds well to most available scoring schemes. We discuss several instances where MCL-CAw was successful in deriving meaningful complexes, and where it missed a few proteins or whole complexes due to affinity scoring of the networks. We compare MCL-CAw with several recent complex detection algorithms on unscored and scored networks, and assess the relative performance of the algorithms on these networks. Further, we study the impact of augmenting physical datasets with computationally inferred interactions for complex detection. Finally, we analyse the essentiality of proteins within predicted complexes to understand a possible correlation between protein essentiality and their ability to form complexes. Conclusions We demonstrate that core-attachment based refinement in MCL-CAw improves the predictions of MCL on yeast PPI networks. We show that affinity scoring improves the performance of MCL-CAw.http://deepblue.lib.umich.edu/bitstream/2027.42/78256/1/1471-2105-11-504.xmlhttp://deepblue.lib.umich.edu/bitstream/2027.42/78256/2/1471-2105-11-504-S1.PDFhttp://deepblue.lib.umich.edu/bitstream/2027.42/78256/3/1471-2105-11-504-S2.ZIPhttp://deepblue.lib.umich.edu/bitstream/2027.42/78256/4/1471-2105-11-504.pdfPeer Reviewe

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Deep Blue Documents at the University of Michigan

ScholarBank@NUS

Qingdao Institute of Bioenergy and Bioprocess Technology, Chinese Academy of Sciences

How to identify essential genes from molecular networks?

Author: A Wagner
B Papp
B Thibert
BH Junker
CL Barret
D Koschützki
Dirk Koschützki
E Estrada
E Estrada
E Segal
E Zotenko
EA Winzeler
Gabriel del Rio
Gerardo Coello
H Jeong
H Yu
HW Ma
HW Ma
I Familii
I Lee
J Förster
JA Hanley
KM Kuhn
L Kuepfer
M Ashburner
M Kanehisa
M Vilela
MP Joy
MW Hahn
NC Duarte
R Mahadevan
S Gerdes
S Wachi
SA Becker
T Ideker
X He
Z Wunderlich
Publication venue: BioMed Central
Publication date: 01/01/2009
Field of study

Abstract Background The prediction of essential genes from molecular networks is a way to test the understanding of essentiality in the context of what is known about the network. However, the current knowledge on molecular network structures is incomplete yet, and consequently the strategies aimed to predict essential genes are prone to uncertain predictions. We propose that simultaneously evaluating different network structures and different algorithms representing gene essentiality (centrality measures) may identify essential genes in networks in a reliable fashion. Results By simultaneously analyzing 16 different centrality measures on 18 different reconstructed metabolic networks for <it>Saccharomyces cerevisiae</it>, we show that no single centrality measure identifies essential genes from these networks in a statistically significant way; however, the combination of at least 2 centrality measures achieves a reliable prediction of most but not all of the essential genes. No improvement is achieved in the prediction of essential genes when 3 or 4 centrality measures were combined. Conclusion The method reported here describes a reliable procedure to predict essential genes from molecular networks. Our results show that essential genes may be predicted only by combining centrality measures, revealing the complex nature of the function of essential genes.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

The whole-organism heavy chain B cell repertoire from Zebrafish self-organizes into distinct network features

Author: A Richard
AL Barabasi
C Berek
CE Willett
CJ Jolly
E Andersson
E Andersson
E Andersson
E Zotenko
EA Kabat
FN Papavasiliou
H Jeong
H Jeong
IR Cohen
JA Weinstein
JA Yoder
JD Hansen
JH Postlethwait
JP Rast
JY Park
LA Clark
M Girvan
M Muramatsu
N Danilova
NS Trede
NY Zheng
O Ozier
P Parham
Rotem Ben-Hamo
S Tavazoie
SH Kleinstein
SH Yook
SJ Foster
Sol Efroni
T Mora
TB Kepler
TX Liu
U Hershberg
VMA Batagelj
WB Barbazuk
XL He
Z Li
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Crossref

Springer - Publisher Connector

PubMed Central

Why Do Hubs in the Yeast Protein Interaction Network Tend To Be Essential: Reexamining the Connection between the Network Topology and Essentiality

Author: A Fatica
AC Gavin
AC Gavin
Burkhard Rost
C von Mering
CL Myers
CM Deane
Dianne P. O'Leary
E Estrada
EJ Chesler
Elena Zotenko
G Giaever
G Palla
GT Hart
H Jeong
H Jeong
H Yu
H Yu
HB Fraser
JD Gibbons
JD Han
Julian Mestre
LC Freeman
M Hampsey
MEJ Newman
MW Hahn
NJ Krogan
NN Batada
NN Batada
P Bonacich
P Uetz
PM Kim
R Albert
R Jansen
S Coulomb
SR Collins
T Ito
T Ito
T Reguly
Teresa M. Przytycka
X He
X He
Y Ho
Publication venue: Public Library of Science
Publication date: 01/01/2008
Field of study

The centrality-lethality rule, which notes that high-degree nodes in a protein interaction network tend to correspond to proteins that are essential, suggests that the topological prominence of a protein in a protein interaction network may be a good predictor of its biological importance. Even though the correlation between degree and essentiality was confirmed by many independent studies, the reason for this correlation remains illusive. Several hypotheses about putative connections between essentiality of hubs and the topology of protein–protein interaction networks have been proposed, but as we demonstrate, these explanations are not supported by the properties of protein interaction networks. To identify the main topological determinant of essentiality and to provide a biological explanation for the connection between the network topology and essentiality, we performed a rigorous analysis of six variants of the genomewide protein interaction network for Saccharomyces cerevisiae obtained using different techniques. We demonstrated that the majority of hubs are essential due to their involvement in Essential Complex Biological Modules, a group of densely connected proteins with shared biological function that are enriched in essential proteins. Moreover, we rejected two previously proposed explanations for the centrality-lethality rule, one relating the essentiality of hubs to their role in the overall network connectivity and another relying on the recently published essential protein interactions model

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

MPG.PuRe

Identifying Causal Genes and Dysregulated Pathways in Complex Diseases

Author: A Bruna
A Chatr-aryamontri
A Keller
A Li
AA Alizadeh
BE Stranger
C Jiang
C Li
CM Perou
D Hagerstrand
D Thierry-Mieg
E Iorns
E Lee
E Yeger-Lotem
E Zotenko
EE Schadt
EE Schadt
F Arslan
F Diella
G Dennis Jr
H Mao
HY Chuang
I Ulitsky
J Fridlyand
JF Rual
K Nagasaki
L Matthews
LI Lin
LJ van 't Veer
LL Parker
M Lund-Johansen
M Newman
M Sibilia
M Sibilia
M Thompson
Markus W. Covert
O Vanunu
P Smits
PV Missiuro
R Linding
R Linding
RC Gentleman
RM Ewing
S Kerrien
S Kohler
S Peri
S Ramaswamy
S Suthram
SA Chowdhury
SK Sieberts
Stefan Wuchty
SW Doniger
Teresa M. Przytycka
TR Golub
U Stelzl
W Huang da
W Wick
X Wu
XY Wang
Y Huang
Y Kotliarov
YA Kim
Yoo-Ah Kim
Z Tu
Publication venue: Public Library of Science
Publication date: 01/01/2010
Field of study

In complex diseases, various combinations of genomic perturbations often lead to the same phenotype. On a molecular level, combinations of genomic perturbations are assumed to dys-regulate the same cellular pathways. Such a pathway-centric perspective is fundamental to understanding the mechanisms of complex diseases and the identification of potential drug targets. In order to provide an integrated perspective on complex disease mechanisms, we developed a novel computational method to simultaneously identify causal genes and dys-regulated pathways. First, we identified a representative set of genes that are differentially expressed in cancer compared to non-tumor control cases. Assuming that disease-associated gene expression changes are caused by genomic alterations, we determined potential paths from such genomic causes to target genes through a network of molecular interactions. Applying our method to sets of genomic alterations and gene expression profiles of 158 Glioblastoma multiforme (GBM) patients we uncovered candidate causal genes and causal paths that are potentially responsible for the altered expression of disease genes. We discovered a set of putative causal genes that potentially play a role in the disease. Combining an expression Quantitative Trait Loci (eQTL) analysis with pathway information, our approach allowed us not only to identify potential causal genes but also to find intermediate nodes and pathways mediating the information flow between causal and target genes. Our results indicate that different genomic perturbations indeed dys-regulate the same functional pathways, supporting a pathway-centric perspective of cancer. While copy number alterations and gene expression data of glioblastoma patients provided opportunities to test our approach, our method can be applied to any disease system where genetic variations play a fundamental causal role

CiteSeerX

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

University of Miami: Scholarship Miami

Neighbor Overlap Is Enriched in the Yeast Interaction Network: Analysis and Implications

Author: A Wagner
AC Gavin
AH Tong
Anna Tramontano
Ariel Feiglin
B Errede
Byungkook Lee
C Lin
CJ Roberts
DE Levin
E Ravasz
E Zotenko
F Radicchi
G Giaever
H Frohlich
H Jeong
I Ulitsky
I Wapinski
J Xiang
JG Cook
John Moult
K Irie
K Ohkuni
K Tedford
KA Olson
L Bardwell
L Grassi
M Jimenez-Sanchez
M Kellis
M Kupiec
M Schuldiner
M Soler
MC Gustin
MF Manolson
MP Samanta
NJ Krogan
P Rice
R Albert
R Kafri
R Kelley
R Milo
R Milo
Ron Unger
S Maslov
S Pu
S Sun
SR Collins
SR Collins
T Roemer
V Cherkasova
V Spirin
X He
Y Artzy-Randrup
Y Guan
Yanay Ofran
Publication venue: Public Library of Science
Publication date: 26/06/2012
Field of study

The yeast protein-protein interaction network has been shown to have distinct topological features such as a scale free degree distribution and a high level of clustering. Here we analyze an additional feature which is called Neighbor Overlap. This feature reflects the number of shared neighbors between a pair of proteins. We show that Neighbor Overlap is enriched in the yeast protein-protein interaction network compared with control networks carefully designed to match the characteristics of the yeast network in terms of degree distribution and clustering coefficient. Our analysis also reveals that pairs of proteins with high Neighbor Overlap have higher sequence similarity, more similar GO annotations and stronger genetic interactions than pairs with low ones. Finally, we demonstrate that pairs of proteins with redundant functions tend to have high Neighbor Overlap. We suggest that a combination of three mechanisms is the basis for this feature: The abundance of protein complexes, selection for backup of function, and the need to allow functional variation

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

FigShare

Towards the prediction of essential genes by integration of network topology, cellular localization and biological process information

Author: A Kumar
A Saleh
AM Deutschbauer
AM Dudley
AM Gustafson
B Schwikowski
BL Ray
C Gonzalez-Aguilera
C Kingsford
C Tsurumi
D Imbeault
DE Febres
DJ Hand
E Estrada
E Zotenko
ER DeLong
G Giaever
H Jeong
H Shi
H Yu
IA Vergara
IH Witten
J Huang
J Kittler
J mei Li
J Salgado-Garrido
JA Lillo
JP Girard
JP Kastenmayer
JP Muller da Silva
JR Quinlan
JW Hyle
K Kobayashi
K Lee
L Breiman
L Breiman
LM Cullen
M Huss
M Itaya
M Seringhaus
Marcio L Acencio
MC Daugeron
MC Palumbo
N Guelzim
N Judson
N Landwehr
NC Duarte
Ney Lemke
R Kohavi
S Coulomb
S Grava
S Jager
S Visa
S Walke
S Wuchty
SM Roberts
SP Jackson
T Roemer
WK Huh
Y Freund
Publication venue: BioMed Central
Publication date: 01/01/2009
Field of study

Abstract Background The identification of essential genes is important for the understanding of the minimal requirements for cellular life and for practical purposes, such as drug design. However, the experimental techniques for essential genes discovery are labor-intensive and time-consuming. Considering these experimental constraints, a computational approach capable of accurately predicting essential genes would be of great value. We therefore present here a machine learning-based computational approach relying on network topological features, cellular localization and biological process information for prediction of essential genes. Results We constructed a decision tree-based meta-classifier and trained it on datasets with individual and grouped attributes-network topological features, cellular compartments and biological processes-to generate various predictors of essential genes. We showed that the predictors with better performances are those generated by datasets with integrated attributes. Using the predictor with all attributes, i.e., network topological features, cellular compartments and biological processes, we obtained the best predictor of essential genes that was then used to classify yeast genes with unknown essentiality status. Finally, we generated decision trees by training the J48 algorithm on datasets with all network topological features, cellular localization and biological process information to discover cellular rules for essentiality. We found that the number of protein physical interactions, the nuclear localization of proteins and the number of regulating transcription factors are the most important factors determining gene essentiality. Conclusion We were able to demonstrate that network topological features, cellular localization and biological process information are reliable predictors of essential genes. Moreover, by constructing decision trees based on these data, we could discover cellular rules governing essentiality.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central