Search CORE

75 research outputs found

Fluctuations in Mass-Action Equilibrium of Protein Binding Networks

Author: B. Widom
Dylan Walker
Koon-Kiu Yan
Sergei Maslov
Publication venue: 'American Physical Society (APS)'
Publication date: 10/03/2008
Field of study

We consider two types of fluctuations in the mass-action equilibrium in protein binding networks. The first type is driven by relatively slow changes in total concentrations (copy numbers) of interacting proteins. The second type, to which we refer to as spontaneous, is caused by quickly decaying thermodynamic deviations away from the equilibrium of the system. As such they are amenable to methods of equilibrium statistical mechanics used in our study. We investigate the effects of network connectivity on these fluctuations and compare them to their upper and lower bounds. The collective effects are shown to sometimes lead to large power-law distributed amplification of spontaneous fluctuations as compared to the expectation for isolated dimers. As a consequence of this, the strength of both types of fluctuations is positively correlated with the overall network connectivity of proteins forming the complex. On the other hand, the relative amplitude of fluctuations is negatively correlated with the abundance of the complex. Our general findings are illustrated using a real network of protein-protein interactions in baker's yeast with experimentally determined protein concentrations.Comment: 4 pages, 3 figure

arXiv.org e-Print Archive

Crossref

Parameters of proteome evolution from histograms of amino-acid sequence identities of paralogous proteins

Author: Axelsen Jacob Bock
Maslov Sergei
Yan Koon-Kiu
Publication venue: BioMed Central
Publication date: 01/01/2007
Field of study

Background: The evolution of the full repertoire of proteins encoded in a given genome is mostly driven by gene duplications, deletions, and sequence modifications of existing proteins. Indirect information about relative rates and other intrinsic parameters of these three basic processes is contained in the proteome-wide distribution of sequence identities of pairs of paralogous proteins. Results: We introduce a simple mathematical framework based on a stochastic birth-and-death model that allows one to extract some of this information and apply it to the set of all pairs of paralogous proteins in H. pylori, E. coli, S. cerevisiae, C. elegans, D. melanogaster, and H. sapiens. It was found that the histogram of sequence identities p generated by an all-to-all alignment of all protein sequences encoded in a genome is well fitted with a power-law form ∼ p−γ with the value of the exponent γ around 4 for the majority of organisms used in this study. This implies that the intra-protein variability of substitution rates is best described by the Gamma-distribution with the exponent α ≈ 0.33. Different features of the shape of such histograms allow us to quantify the ratio between the genome-wide average deletion/duplication rates and the amino-acid substitution rate. 1 Conclusions: We separately measure the short-term (“raw”) duplication and deletion rates r ∗ dup, r ∗ del whic

arXiv.org e-Print Archive

CiteSeerX

Crossref

Directory of Open Access Journals

PubMed Central

Copenhagen University Research Information System

Upstream plasticity and downstream robustness in evolution of molecular networks

Author: Eriksen Kasper Astrup
Maslov Sergei
Sneppen Kim
Yan Koon-Kiu
Publication venue: BioMed Central
Publication date: 01/01/2004
Field of study

BACKGROUND: Gene duplication followed by the functional divergence of the resulting pair of paralogous proteins is a major force shaping molecular networks in living organisms. Recent species-wide data for protein-protein interactions and transcriptional regulations allow us to assess the effect of gene duplication on robustness and plasticity of these molecular networks. RESULTS: We demonstrate that the transcriptional regulation of duplicated genes in baker's yeast Saccharomyces cerevisiae diverges fast so that on average they lose 3% of common transcription factors for every 1% divergence of their amino acid sequences. The set of protein-protein interaction partners of their protein products changes at a slower rate exhibiting a broad plateau for amino acid sequence similarity above 70%. The stability of functional roles of duplicated genes at such relatively low sequence similarity is further corroborated by their ability to substitute for each other in single gene knockout experiments in yeast and RNAi experiments in a nematode worm Caenorhabditis elegans. We also quantified the divergence rate of physical interaction neighborhoods of paralogous proteins in a bacterium Helicobacter pylori and a fly Drosophila melanogaster. However, in the absence of system-wide data on transcription factors' binding in these organisms we could not compare this rate to that of transcriptional regulation of duplicated genes. CONCLUSIONS: For all molecular networks studied in this work we found that even the most distantly related paralogous proteins with amino acid sequence identities around 20% on average have more similar positions within a network than a randomly selected pair of proteins. For yeast we also found that the upstream regulation of genes evolves more rapidly than downstream functions of their protein products. This is in accordance with a view which puts regulatory changes as one of the main driving forces of the evolution. In this context a very important open question is to what extent our results obtained for homologous genes within a single species (paralogs) carries over to homologous proteins in different species (orthologs)

Lund University Publications

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Copenhagen University Research Information System

Improved Reconstruction of In Silico Gene Regulatory Networks by Integrating Knockout and Perturbation Data

Author: Kevin Y. Yip
Koon-Kiu Yan
Magnus Rattray
Mark Gerstein
Roger P. Alexander
Publication venue: Public Library of Science
Publication date: 01/01/2010
Field of study

We performed computational reconstruction of the in silico gene regulatory networks in the DREAM3 Challenges. Our task was to learn the networks from two types of data, namely gene expression profiles in deletion strains (the ‘deletion data’) and time series trajectories of gene expression after some initial perturbation (the ‘perturbation data’). In the course of developing the prediction method, we observed that the two types of data contained different and complementary information about the underlying network. In particular, deletion data allow for the detection of direct regulatory activities with strong responses upon the deletion of the regulator while perturbation data provide richer information for the identification of weaker and more complex types of regulation. We applied different techniques to learn the regulation from the two types of data. For deletion data, we learned a noise model to distinguish real signals from random fluctuations using an iterative method. For perturbation data, we used differential equations to model the change of expression levels of a gene along the trajectories due to the regulation of other genes. We tried different models, and combined their predictions. The final predictions were obtained by merging the results from the two types of data. A comparison with the actual regulatory networks suggests that our approach is effective for networks with a range of different sizes. The success of the approach demonstrates the importance of integrating heterogeneous data in network reconstruction

CiteSeerX

Public Library of Science (PLOS)

Crossref

PubMed Central

An Approach for Determining and Measuring Network Hierarchy Applied to Comparing the Phosphorylome and the Regulome

Author: Andrews Erik
Cheng Chao
Gerstein Mark
Ung Matthew
Wang Daifeng
Yan Koon-Kiu
Publication venue: Dartmouth Digital Commons
Publication date: 01/01/2015
Field of study

Many biological networks naturally form a hierarchy with a preponderance of downward information flow. In this study, we define a score to quantify the degree of hierarchy in a network and develop a simulated-annealing algorithm to maximize the hierarchical score globally over a network. We apply our algorithm to determine the hierarchical structure of the phosphorylome in detail and investigate the correlation between its hierarchy and kinase properties. We also compare it to the regulatory network, finding that the phosphorylome is more hierarchical than the regulome

PubMed Central

Dartmouth Digital Commons (Dartmouth College)

Detecting modules in multiplex networks – an application for integrating expression profiles across multiple species

Author: Gerstein Mark
Pei Baikang
Rozowsky Joel
Wang Daifeng
Yan Koon-Kiu
Zheng Henry
Publication venue: EliScholar – A Digital Platform for Scholarly Publishing at Yale
Publication date: 19/09/2013
Field of study

Multiplex network, a set of networks linked through interconnected layers, is a useful mathematical framework for data integration. Here, we present a general method to detect modules in multiplex networks and apply it in a specific biological context: to simultaneously cluster the genome-wide expression profiles of C. elegans and D. melanogaster generated by the ENOCDE and modENCODE consortia. The method revealed modules that are fundamentally cross-species and can either be conserved or species-specific. In general, the method could be applied in various contexts like the integration of different social networks

Yale University