643 research outputs found
Infinite-Order Percolation and Giant Fluctuations in a Protein Interaction Network
We investigate a model protein interaction network whose links represent
interactions between individual proteins. This network evolves by the
functional duplication of proteins, supplemented by random link addition to
account for mutations. When link addition is dominant, an infinite-order
percolation transition arises as a function of the addition rate. In the
opposite limit of high duplication rate, the network exhibits giant structural
fluctuations in different realizations. For biologically-relevant growth rates,
the node degree distribution has an algebraic tail with a peculiar rate
dependence for the associated exponent.Comment: 4 pages, 2 figures, 2 column revtex format, to be submitted to PRL 1;
reference added and minor rewording of the first paragraph; Title change and
major reorganization (but no result changes) in response to referee comments;
to be published in PR
A Re-Annotation of the Saccharomyces Cerevisiae Genome
Discrepancies in gene and orphan number indicated by previous analyses suggest that
S. cerevisiae would benefit from a consistent re-annotation. In this analysis three new genes
are identified and 46 alterations to gene coordinates are described. 370 ORFs are defined
as totally spurious ORFs which should be disregarded. At least a further 193 genes could
be described as very hypothetical, based on a number of criteria.
It was found that disparate genes with sequence overlaps over ten amino acids (especially
at the N-terminus) are rare in both S. cerevisiae and Sz. pombe. A new S. cerevisiae gene
number estimate with an upper limit of 5804 is proposed, but after the removal of very
hypothetical genes and pseudogenes this is reduced to 5570. Although this is likely to be
closer to the true upper limit, it is still predicted to be an overestimate of gene number. A
complete list of revised gene coordinates is available from the Sanger Centre (S. cerevisiae
reannotation: ftp://ftp/pub/yeast/SCreannotation)
Population genomics of domestic and wild yeasts
The natural genetics of an organism is determined by the distribution of sequences of its genome. Here we present one- to four-fold, with some deeper, coverage of the genome sequences of over seventy isolates of the domesticated baker's yeast, _Saccharomyces cerevisiae_, and its closest relative, the wild _S. paradoxus_, which has never been associated with human activity. These were collected from numerous geographic locations and sources (including wild, clinical, baking, wine, laboratory and food spoilage). These sequences provide an unprecedented view of the population structure, natural (and artificial) selection and genome evolution in these species. Variation in gene content, SNPs, indels, copy numbers and transposable elements provide insights into the evolution of different lineages. Phenotypic variation broadly correlates with global genome-wide phylogenetic relationships however there is no correlation with source. _S. paradoxus_ populations are well delineated along geographic boundaries while the variation among worldwide _S. cerevisiae_ isolates show less differentiation and is comparable to a single _S. paradoxus_ population. Rather than one or two domestication events leading to the extant baker's yeasts, the population structure of _S. cerevisiae_ shows a few well defined geographically isolated lineages and many different mosaics of these lineages, supporting the notion that human influence provided the opportunity for outbreeding and production of new combinations of pre-existing variation
NEAT: An efficient network enrichment analysis test
Background: Network enrichment analysis is a powerful method, which allows to integrate gene enrichment analysis with the information on relationships between genes that is provided by gene networks. Existing tests for network enrichment analysis deal only with undirected networks, they can be computationally slow and are based on normality assumptions. Results: We propose NEAT, a test for network enrichment analysis. The test is based on the hypergeometric distribution, which naturally arises as the null distribution in this context. NEAT can be applied not only to undirected, but to directed and partially directed networks as well. Our simulations indicate that NEAT is considerably faster than alternative resampling-based methods, and that its capacity to detect enrichments is at least as good as the one of alternative tests. We discuss applications of NEAT to network analyses in yeast by testing for enrichment of the Environmental Stress Response target gene set with GO Slim and KEGG functional gene sets, and also by inspecting associations between functional sets themselves. Conclusions: NEAT is a flexible and efficient test for network enrichment analysis that aims to overcome some limitations of existing resampling-based tests. The method is implemented in the R package neat, which can be freely downloaded from CRAN ( https://cran.r-project.org/package=neat )
Control of glycolytic dynamics by hexose transport in Saccharomyces cerevisiae
AbstractIt is becoming accepted that steady-state fluxes are not necessarily controlled by single rate-limiting steps. This leaves open the issue whether cellular dynamics are controlled by single pacemaker enzymes, as has often been proposed. This paper shows that yeast sugar transport has substantial but not complete control of the frequency of glycolytic oscillations. Addition of maltose, a competitive inhibitor of glucose transport, reduced both average glucose consumption flux and frequency of glycolytic oscillations. Assuming a single kinetic component and a symmetrical carrier, a frequency control coefficient of between 0.4 and 0.6 and an average-flux control coefficient of between 0.6 and 0.9 were calculated for hexose transport activity. In a second approach, mannose was used as the carbon and free-energy source, and the dependencies on the extracellular mannose concentration of the transport activity, of the frequency of oscillations, and of the average flux were compared. In this case the frequency control coefficient and the average-flux control coefficient of hexose transport activity amounted to 0.7 and 0.9, respectively. From these results, we conclude that 1) transport is highly important for the dynamics of glycolysis, 2) most but not all control resides in glucose transport, and 3) there should at least be one step other than transport with substantial control
Global investigation of protein–protein interactions in yeast Saccharomyces cerevisiae using re-occurring short polypeptide sequences
Protein–protein interaction (PPI) maps provide insight into cellular biology and have received considerable attention in the post-genomic era. While large-scale experimental approaches have generated large collections of experimentally determined PPIs, technical limitations preclude certain PPIs from detection. Recently, we demonstrated that yeast PPIs can be computationally predicted using re-occurring short polypeptide sequences between known interacting protein pairs. However, the computational requirements and low specificity made this method unsuitable for large-scale investigations. Here, we report an improved approach, which exhibits a specificity of ∼99.95% and executes 16 000 times faster. Importantly, we report the first all-to-all sequence-based computational screen of PPIs in yeast, Saccharomyces cerevisiae in which we identify 29 589 high confidence interactions of ∼2 × 107 possible pairs. Of these, 14 438 PPIs have not been previously reported and may represent novel interactions. In particular, these results reveal a richer set of membrane protein interactions, not readily amenable to experimental investigations. From the novel PPIs, a novel putative protein complex comprised largely of membrane proteins was revealed. In addition, two novel gene functions were predicted and experimentally confirmed to affect the efficiency of non-homologous end-joining, providing further support for the usefulness of the identified PPIs in biological investigations
Inferring topology from clustering coefficients in protein-protein interaction networks
BACKGROUND: Although protein-protein interaction networks determined with high-throughput methods are incomplete, they are commonly used to infer the topology of the complete interactome. These partial networks often show a scale-free behavior with only a few proteins having many and the majority having only a few connections. Recently, the possibility was suggested that this scale-free nature may not actually reflect the topology of the complete interactome but could also be due to the error proneness and incompleteness of large-scale experiments. RESULTS: In this paper, we investigate the effect of limited sampling on average clustering coefficients and how this can help to more confidently exclude possible topology models for the complete interactome. Both analytical and simulation results for different network topologies indicate that partial sampling alone lowers the clustering coefficient of all networks tremendously. Furthermore, we extend the original sampling model by also including spurious interactions via a preferential attachment process. Simulations of this extended model show that the effect of wrong interactions on clustering coefficients depends strongly on the skewness of the original topology and on the degree of randomness of clustering coefficients in the corresponding networks. CONCLUSION: Our findings suggest that the complete interactome is either highly skewed such as e.g. in scale-free networks or is at least highly clustered. Although the correct topology of the interactome may not be inferred beyond any reasonable doubt from the interaction networks available, a number of topologies can nevertheless be excluded with high confidence
Draft genome sequence of Wickerhamomyces anomalus LBCM1105, isolated from cachaça fermentation
Wickerhamomyces anomalus LBCM1105 is a yeast isolated from cachaça distillery fermentation vats, notable for exceptional glycerol consumption ability. We report its draft genome with 20.5x in-depth coverage and around 90% extension and completeness. It harbors the sequences of proteins involved in glycerol transport and metabolism.The authors gratefully acknowledge Laboratorio Nacional de Ciencia e Tecnologia do Bioetanol (CTBE) and the Centro Nacional de Pesquisa em Energia e Materiais (CNPEM) for support with the sequencing of LBCM1105. This work was supported by CAPES/Brazil (PNPD 2755/2011; PCF-PVE 021/2012), by CNPq (Brazil), processes 304815/2012 (research grant) and 305135/2015-5, and by AUXPE-PVES 1801/2012 (Process 23038.015294/2016-18) from Brazilian Government and by UFOP. C.L. is supported by the strategic program UID/BIA/04050/2013 [POCI-01-0145-FEDER-007569] funded by national funds through the FCT I.P. and by the ERDF through the COMPETE2020 - Programa Operacional de Competitividade e Internacionalizacao (POCI). DMRP is a fellow from the CNPq (Conselho Nacional de Desenvolvimento Cientifico e Tecnologico) - Brazil (310080/2018-5)
- …