276 research outputs found
Lack of self-averaging in neutral evolution of proteins
We simulate neutral evolution of proteins imposing conservation of the
thermodynamic stability of the native state in the framework of an effective
model of folding thermodynamics. This procedure generates evolutionary
trajectories in sequence space which share two universal features for all of
the examined proteins. First, the number of neutral mutations fluctuates
broadly from one sequence to another, leading to a non-Poissonian substitution
process. Second, the number of neutral mutations displays strong correlations
along the trajectory, thus causing the breakdown of self-averaging of the
resulting evolutionary substitution process.Comment: 4 pages, 2 figure
Relative Contributions of Intrinsic StructuralβFunctional Constraints and Translation Rate to the Evolution of Protein-Coding Genes
A long-standing assumption in evolutionary biology is that the evolution rate of protein-coding genes depends, largely, on specific constraints that affect the function of the given protein. However, recent research in evolutionary systems biology revealed unexpected, significant correlations between evolution rate and characteristics of genes or proteins that are not directly related to specific protein functions, such as expression level and proteinβprotein interactions. The strongest connections were consistently detected between protein sequence evolution rate and the expression level of the respective gene. A recent genome-wide proteomic study revealed an extremely strong correlation between the abundances of orthologous proteins in distantly related animals, the nematode Caenorhabditis elegans and the fruit fly Drosophila melanogaster. We used the extensive protein abundance data from this study along with short-term evolutionary rates (ERs) of orthologous genes in nematodes and flies to estimate the relative contributions of structuralβfunctional constraints and the translation rate to the evolution rate of protein-coding genes. Together the intrinsic constraints and translation rate account for approximately 50% of the variance of the ERs. The contribution of constraints is estimated to be 3- to 5-fold greater than the contribution of translation rate
H2r: Identification of evolutionary important residues by means of an entropy based analysis of multiple sequence alignments
BACKGROUND: A multiple sequence alignment (MSA) generated for a protein can be used to characterise residues by means of a statistical analysis of single columns. In addition to the examination of individual positions, the investigation of co-variation of amino acid frequencies offers insights into function and evolution of the protein and residues. RESULTS: We introduce conn(k), a novel parameter for the characterisation of individual residues. For each residue k, conn(k) is the number of most extreme signals of co-evolution. These signals were deduced from a normalised mutual information (MI) value U(k, l) computed for all pairs of residues k, l. We demonstrate that conn(k) is a more robust indicator than an individual MI-value for the prediction of residues most plausibly important for the evolution of a protein. This proposition was inferred by means of statistical methods. It was further confirmed by the analysis of several proteins. A server, which computes conn(k)-values is available at http://www-bioinf.uni-regensburg.de. CONCLUSION: The algorithms H2r, which analyses MSAs and computes conn(k)-values, characterises a specific class of residues. In contrast to strictly conserved ones, these residues possess some flexibility in the composition of side chains. However, their allocation is sensibly balanced with several other positions, as indicated by conn(k)
Energetic Selection of Topology in Ferredoxins
Models of early protein evolution posit the existence of short peptides that bound metals and ions and served as transporters, membranes or catalysts. The Cys-X-X-Cys-X-X-Cys heptapeptide located within bacterial ferredoxins, enclosing an Fe4S4 metal center, is an attractive candidate for such an early peptide. Ferredoxins are ancient proteins and the simple Ξ±+Ξ² fold is found alone or as a domain in larger proteins throughout all three kingdoms of life. Previous analyses of the heptapeptide conformation in experimentally determined ferredoxin structures revealed a pervasive right-handed topology, despite the fact that the Fe4S4 cluster is achiral. Conformational enumeration of a model CGGCGGC heptapeptide bound to a cubane iron-sulfur cluster indicates both left-handed and right-handed folds could exist and have comparable stabilities. However, only the natural ferredoxin topology provides a significant network of backbone-to-cluster hydrogen bonds that would stabilize the metal-peptide complex. The optimal peptide configuration (alternating Ξ±L,Ξ±R) is that of an Ξ±-sheet, providing an additional mechanism where oligomerization could stabilize the peptide and facilitate iron-sulfur cluster binding
Two Novel Parvoviruses in Frugivorous New and Old World Bats
Bats, a globally distributed group of mammals with high ecological importance, are increasingly recognized as natural reservoir hosts for viral agents of significance to human and animal health. In the present study, we evaluated pools of blood samples obtained from two phylogenetically distant bat families, in particular from flying foxes (Pteropodidae), Eidolon helvum in West Africa, and from two species of New World leaf-nosed fruit bats (Phyllostomidae), Artibeus jamaicensis and Artibeus lituratus in Central America. A sequence-independent virus discovery technique (VIDISCA) was used in combination with high throughput sequencing to detect two novel parvoviruses: a PARV4-like virus named Eh-BtPV-1 in Eidolon helvum from Ghana and the first member of a putative new genus in Artibeus jamaicensis from Panama (Aj-BtPV-1). Those viruses were circulating in the corresponding bat colony at rates of 7β8%. Aj-BtPV-1 was also found in Artibeus lituratus (5.5%). Both viruses were detected in the blood of infected animals at high concentrations: up to 10E8 and to 10E10 copies/ml for Aj-BtPV-1 and Eh-BtPV-1 respectively. Eh-BtPV-1 was additionally detected in all organs collected from bats (brain, lungs, liver, spleen, kidneys and intestine) and spleen and kidneys were identified as the most likely sites where viral replication takes place. Our study shows that bat parvoviruses share common ancestors with known parvoviruses of humans and livestock. We also provide evidence that a variety of Parvovirinae are able to cause active infection in bats and that they are widely distributed in these animals with different geographic origin, ecologies and climatic ranges
Medicago truncatula contains a second gene encoding a plastid located glutamine synthetase exclusively expressed in developing seeds
<p>Abstract</p> <p>Background</p> <p>Nitrogen is a crucial nutrient that is both essential and rate limiting for plant growth and seed production. Glutamine synthetase (GS), occupies a central position in nitrogen assimilation and recycling, justifying the extensive number of studies that have been dedicated to this enzyme from several plant sources. All plants species studied to date have been reported as containing a single, nuclear gene encoding a plastid located GS isoenzyme per haploid genome. This study reports the existence of a second nuclear gene encoding a plastid located GS in <it>Medicago truncatula</it>.</p> <p>Results</p> <p>This study characterizes a new, second gene encoding a plastid located glutamine synthetase (GS2) in <it>M. truncatula</it>. The gene encodes a functional GS isoenzyme with unique kinetic properties, which is exclusively expressed in developing seeds. Based on molecular data and the assumption of a molecular clock, it is estimated that the gene arose from a duplication event that occurred about 10 My ago, after legume speciation and that duplicated sequences are also present in closely related species of the Vicioide subclade. Expression analysis by RT-PCR and western blot indicate that the gene is exclusively expressed in developing seeds and its expression is related to seed filling, suggesting a specific function of the enzyme associated to legume seed metabolism. Interestingly, the gene was found to be subjected to alternative splicing over the first intron, leading to the formation of two transcripts with similar open reading frames but varying 5' UTR lengths, due to retention of the first intron. To our knowledge, this is the first report of alternative splicing on a plant GS gene.</p> <p>Conclusions</p> <p>This study shows that <it>Medicago truncatula </it>contains an additional GS gene encoding a plastid located isoenzyme, which is functional and exclusively expressed during seed development. Legumes produce protein-rich seeds requiring high amounts of nitrogen, we postulate that this gene duplication represents a functional innovation of plastid located GS related to storage protein accumulation exclusive to legume seed metabolism.</p
Integration of Evolutionary Features for the Identification of Functionally Important Residues in Major Facilitator Superfamily Transporters
The identification of functionally important residues is an important challenge for understanding the molecular mechanisms of proteins. Membrane protein transporters operate two-state allosteric conformational changes using functionally important cooperative residues that mediate long-range communication from the substrate binding site to the translocation pathway. In this study, we identified functionally important cooperative residues of membrane protein transporters by integrating sequence conservation and co-evolutionary information. A newly derived evolutionary feature, the co-evolutionary coupling number, was introduced to measure the connectivity of co-evolving residue pairs and was integrated with the sequence conservation score. We tested this method on three Major Facilitator Superfamily (MFS) transporters, LacY, GlpT, and EmrD. MFS transporters are an important family of membrane protein transporters, which utilize diverse substrates, catalyze different modes of transport using unique combinations of functional residues, and have enough characterized functional residues to validate the performance of our method. We found that the conserved cores of evolutionarily coupled residues are involved in specific substrate recognition and translocation of MFS transporters. Furthermore, a subset of the residues forms an interaction network connecting functional sites in the protein structure. We also confirmed that our method is effective on other membrane protein transporters. Our results provide insight into the location of functional residues important for the molecular mechanisms of membrane protein transporters
Advantages of a Mechanistic Codon Substitution Model for Evolutionary Analysis of Protein-Coding Sequences
A mechanistic codon substitution model, in which each codon substitution rate is proportional to the product of a codon mutation rate and the average fixation probability depending on the type of amino acid replacement, has advantages over nucleotide, amino acid, and empirical codon substitution models in evolutionary analysis of protein-coding sequences. It can approximate a wide range of codon substitution processes. If no selection pressure on amino acids is taken into account, it will become equivalent to a nucleotide substitution model. If mutation rates are assumed not to depend on the codon type, then it will become essentially equivalent to an amino acid substitution model. Mutation at the nucleotide level and selection at the amino acid level can be separately evaluated.The present scheme for single nucleotide mutations is equivalent to the general time-reversible model, but multiple nucleotide changes in infinitesimal time are allowed. Selective constraints on the respective types of amino acid replacements are tailored to each gene in a linear function of a given estimate of selective constraints. Their good estimates are those calculated by maximizing the respective likelihoods of empirical amino acid or codon substitution frequency matrices. Akaike and Bayesian information criteria indicate that the present model performs far better than the other substitution models for all five phylogenetic trees of highly-divergent to highly-homologous sequences of chloroplast, mitochondrial, and nuclear genes. It is also shown that multiple nucleotide changes in infinitesimal time are significant in long branches, although they may be caused by compensatory substitutions or other mechanisms. The variation of selective constraint over sites fits the datasets significantly better than variable mutation rates, except for 10 slow-evolving nuclear genes of 10 mammals. An critical finding for phylogenetic analysis is that assuming variable mutation rates over sites lead to the overestimation of branch lengths
- β¦