468 research outputs found

    Molecular Clock on a Neutral Network

    Full text link
    The number of fixed mutations accumulated in an evolving population often displays a variance that is significantly larger than the mean (the overdispersed molecular clock). By examining a generic evolutionary process on a neutral network of high-fitness genotypes, we establish a formalism for computing all cumulants of the full probability distribution of accumulated mutations in terms of graph properties of the neutral network, and use the formalism to prove overdispersion of the molecular clock. We further show that significant overdispersion arises naturally in evolution when the neutral network is highly sparse, exhibits large global fluctuations in neutrality, and small local fluctuations in neutrality. The results are also relevant for elucidating the topological structure of a neutral network from empirical measurements of the substitution process.Comment: 10 page

    Lack of self-averaging in neutral evolution of proteins

    Full text link
    We simulate neutral evolution of proteins imposing conservation of the thermodynamic stability of the native state in the framework of an effective model of folding thermodynamics. This procedure generates evolutionary trajectories in sequence space which share two universal features for all of the examined proteins. First, the number of neutral mutations fluctuates broadly from one sequence to another, leading to a non-Poissonian substitution process. Second, the number of neutral mutations displays strong correlations along the trajectory, thus causing the breakdown of self-averaging of the resulting evolutionary substitution process.Comment: 4 pages, 2 figure

    Genome landscapes and bacteriophage codon usage

    Get PDF
    Across all kingdoms of biological life, protein-coding genes exhibit unequal usage of synonmous codons. Although alternative theories abound, translational selection has been accepted as an important mechanism that shapes the patterns of codon usage in prokaryotes and simple eukaryotes. Here we analyze patterns of codon usage across 74 diverse bacteriophages that infect E. coli, P. aeruginosa and L. lactis as their primary host. We introduce the concept of a `genome landscape,' which helps reveal non-trivial, long-range patterns in codon usage across a genome. We develop a series of randomization tests that allow us to interrogate the significance of one aspect of codon usage, such a GC content, while controlling for another aspect, such as adaptation to host-preferred codons. We find that 33 phage genomes exhibit highly non-random patterns in their GC3-content, use of host-preferred codons, or both. We show that the head and tail proteins of these phages exhibit significant bias towards host-preferred codons, relative to the non-structural phage proteins. Our results support the hypothesis of translational selection on viral genes for host-preferred codons, over a broad range of bacteriophages.Comment: 9 Color Figures, 5 Tables, 53 Reference

    Evolution favors protein mutational robustness in sufficiently large populations

    Get PDF
    BACKGROUND: An important question is whether evolution favors properties such as mutational robustness or evolvability that do not directly benefit any individual, but can influence the course of future evolution. Functionally similar proteins can differ substantially in their robustness to mutations and capacity to evolve new functions, but it has remained unclear whether any of these differences might be due to evolutionary selection for these properties. RESULTS: Here we use laboratory experiments to demonstrate that evolution favors protein mutational robustness if the evolving population is sufficiently large. We neutrally evolve cytochrome P450 proteins under identical selection pressures and mutation rates in populations of different sizes, and show that proteins from the larger and thus more polymorphic population tend towards higher mutational robustness. Proteins from the larger population also evolve greater stability, a biophysical property that is known to enhance both mutational robustness and evolvability. The excess mutational robustness and stability is well described by existing mathematical theories, and can be quantitatively related to the way that the proteins occupy their neutral network. CONCLUSIONS: Our work is the first experimental demonstration of the general tendency of evolution to favor mutational robustness and protein stability in highly polymorphic populations. We suggest that this phenomenon may contribute to the mutational robustness and evolvability of viruses and bacteria that exist in large populations

    Natural History, Microbes and Sequences: Shouldn't We Look Back Again to Organisms?

    Get PDF
    The discussion on the existence of prokaryotic species is reviewed. The demonstration that several different mechanisms of genetic exchange and recombination exist has led some to a radical rejection of the possibility of bacterial species and, in general, the applicability of traditional classification categories to the prokaryotic domains. However, in spite of intense gene traffic, prokaryotic groups are not continuously variable but form discrete clusters of phenotypically coherent, well-defined, diagnosable groups of individual organisms. Molecularization of life sciences has led to biased approaches to the issue of the origins of biodiversity, which has resulted in the increasingly extended tendency to emphasize genes and sequences and not give proper attention to organismal biology. As argued here, molecular and organismal approaches that should be seen as complementary and not opposed views of biology

    Genomic and SNP Analyses Demonstrate a Distant Separation of the Hospital and Community-Associated Clades of Enterococcus faecium

    Get PDF
    Recent studies have pointed to the existence of two subpopulations of Enterococcus faecium, one containing primarily commensal/community-associated (CA) strains and one that contains most clinical or hospital-associated (HA) strains, including those classified by multi-locus sequence typing (MLST) as belonging to the CC17 group. The HA subpopulation more frequently has IS16, pathogenicity island(s), and plasmids or genes associated with antibiotic resistance, colonization, and/or virulence. Supporting the two clades concept, we previously found a 3–10% difference between four genes from HA-clade strains vs. CA-clade strains, including 5% difference between pbp5-R of ampicillin-resistant, HA strains and pbp5-S of ampicillin-sensitive, CA strains. To further investigate the core genome of these subpopulations, we studied 100 genes from 21 E. faecium genome sequences; our analyses of concatenated sequences, SNPs, and individual genes all identified two distinct groups. With the concatenated sequence, HA-clade strains differed by 0–1% from one another while CA clade strains differed from each other by 0–1.1%, with 3.5–4.2% difference between the two clades. While many strains had a few genes that grouped in one clade with most of their genes in the other clade, one strain had 28% of its genes in the CA clade and 72% in the HA clade, consistent with the predicted role of recombination in the evolution of E. faecium. Using estimates for Escherichia coli, molecular clock calculations using sSNP analysis indicate that these two clades may have diverged ≥1 million years ago or, using the higher mutation rate for Bacillus anthracis, ∼300,000 years ago. These data confirm the existence of two clades of E. faecium and show that the differences between the HA and CA clades occur at the core genomic level and long preceded the modern antibiotic era

    The Evolution of Respiratory Chain Complex I from a Smaller Last Common Ancestor Consisting of 11 Protein Subunits

    Get PDF
    The NADH:quinone oxidoreductase (complex I) has evolved from a combination of smaller functional building blocks. Chloroplasts and cyanobacteria contain a complex I-like enzyme having only 11 subunits. This enzyme lacks the N-module which harbors the NADH binding site and the flavin and iron–sulfur cluster prosthetic groups. A complex I-homologous enzyme found in some archaea contains an F420 dehydrogenase subunit denoted as FpoF rather than the N-module. In the present study, all currently available whole genome sequences were used to survey the occurrence of the different types of complex I in the different kingdoms of life. Notably, the 11-subunit version of complex I was found to be widely distributed, both in the archaeal and in the eubacterial kingdoms, whereas the 14-subunit classical complex I was found only in certain eubacterial phyla. The FpoF-containing complex I was present in Euryarchaeota but not in Crenarchaeota, which contained the 11-subunit complex I. The 11-subunit enzymes showed a primary sequence variability as great or greater than the full-size 14-subunit complex I, but differed distinctly from the membrane-bound hydrogenases. We conclude that this type of compact 11-subunit complex I is ancestral to all present-day complex I enzymes. No designated partner protein, acting as an electron delivery device, could be found for the compact version of complex I. We propose that the primordial complex I, and many of the present-day 11-subunit versions of it, operate without a designated partner protein but are capable of interaction with several different electron donor or acceptor proteins

    Dating Phylogenies with Hybrid Local Molecular Clocks

    Get PDF
    BACKGROUND: Because rates of evolution and species divergence times cannot be estimated directly from molecular data, all current dating methods require that specific assumptions be made before inferring any divergence time. These assumptions typically bear either on rates of molecular evolution (molecular clock hypothesis, local clocks models) or on both rates and times (penalized likelihood, Bayesian methods). However, most of these assumptions can affect estimated dates, oftentimes because they underestimate large amounts of rate change. PRINCIPAL FINDINGS: A significant modification to a recently proposed ad hoc rate-smoothing algorithm is described, in which local molecular clocks are automatically placed on a phylogeny. This modification makes use of hybrid approaches that borrow from recent theoretical developments in microarray data analysis. An ad hoc integration of phylogenetic uncertainty under these local clock models is also described. The performance and accuracy of the new methods are evaluated by reanalyzing three published data sets. CONCLUSIONS: It is shown that the new maximum likelihood hybrid methods can perform better than penalized likelihood and almost as well as uncorrelated Bayesian models. However, the new methods still tend to underestimate the actual amount of rate change. This work demonstrates the difficulty of estimating divergence times using local molecular clocks
    corecore