Search CORE

MPG.PuRe

Relative Contributions of Intrinsic Structural–Functional Constraints and Translation Rate to the Evolution of Protein-Coding Genes

Author: Altschul
Belle
David J. Lipman
Drummond
Drummond
Drummond
Drummond
Edgar
Eugene V. Koonin
Felsenstein
Grishin
Herbeck
Hirsh
Hurst
Irina V. Gopich
Jensen
Jones
Jordan
Jordan
Khaitovich
Koonin
Krylov
Lemos
Lobkovsky
Pal
Pal
Schrimpf
Tatusov
Tatusov
Vitkup
Wall
Wheeler
Wilke
Wilson
Wolf
Wolf
Wolf
Wolf
Yuri I. Wolf
Zhou
Zuckerkandl
Publication venue: Oxford University Press
Publication date
Field of study

A long-standing assumption in evolutionary biology is that the evolution rate of protein-coding genes depends, largely, on specific constraints that affect the function of the given protein. However, recent research in evolutionary systems biology revealed unexpected, significant correlations between evolution rate and characteristics of genes or proteins that are not directly related to specific protein functions, such as expression level and protein–protein interactions. The strongest connections were consistently detected between protein sequence evolution rate and the expression level of the respective gene. A recent genome-wide proteomic study revealed an extremely strong correlation between the abundances of orthologous proteins in distantly related animals, the nematode Caenorhabditis elegans and the fruit fly Drosophila melanogaster. We used the extensive protein abundance data from this study along with short-term evolutionary rates (ERs) of orthologous genes in nematodes and flies to estimate the relative contributions of structural–functional constraints and the translation rate to the evolution rate of protein-coding genes. Together the intrinsic constraints and translation rate account for approximately 50% of the variance of the ERs. The contribution of constraints is estimated to be 3- to 5-fold greater than the contribution of translation rate

The Roots of Bioinformatics in Protein Evolution

Author: AJP Martin
AP Ryle
CA Ouzounis
CB Anfinsen
CB Anfinsen
CB Bridges
CH Li
David B. Searls
E Abderhalden
E Margoliash
E Zuckerkandl
EB Lewis
F Sanger
G Braunitzer
GA Mross
HA Itano
Ingram
JB Hagen
K Brew
KA Walsh
L Pauling
MO Dayhoff
MO Dayhoff
MO Dayhoff
MO Dayhoff
MW Nirenberg
P Edman
P Edman
R Eck
RF Doolittle
RF Doolittle
RF Doolittle
RF Doolittle
RL Hill
Russell F. Doolittle
S Henikoff
S Moore
SB Needleman
SG Stephens
SJ Singer
V du Vigneuad
V Ingram
WA Fitch
WM Fitch
Publication venue: Public Library of Science
Publication date: 01/07/2010
Field of study

eScholarship - University of California

H2r: Identification of evolutionary important residues by means of an entropy based analysis of multiple sequence alignments

Author: A del Sol Mesa
AL Barabási
B Rost
C Notredame
C Ouzounis
C Sander
C Steegborn
CC Hyde
CE Shannon
D Altschuh
DR Caffrey
E Eyal
E Neher
E Weber-Ban
E Zuckerkandl
ER Tillier
F Pearl
GB Gloor
GM Süel
HO Villar
I Kass
IM Wallace
J Tsai
JA Capra
JP Dekker
K Katoh
K Wang
LA Kelley
LC Martin
M Landau
Matthias Zwick
MC Saraf
ME Noble
O Noivirt
O Olmea
OV Kalinina
OV Kalinina
R Merkl
RA Estabrook
RA Laskowski
Rainer Merkl
RD Finn
RI Dima
S Henikoff
SJ Fleishman
SM Larson
SW Lockless
T Lassmann
T Sato
TD Schneider
U Göbel
V Kulik
V Kulik
WH Press
WR Atchley
Publication venue: BioMed Central
Publication date: 01/01/2007
Field of study

BACKGROUND: A multiple sequence alignment (MSA) generated for a protein can be used to characterise residues by means of a statistical analysis of single columns. In addition to the examination of individual positions, the investigation of co-variation of amino acid frequencies offers insights into function and evolution of the protein and residues. RESULTS: We introduce conn(k), a novel parameter for the characterisation of individual residues. For each residue k, conn(k) is the number of most extreme signals of co-evolution. These signals were deduced from a normalised mutual information (MI) value U(k, l) computed for all pairs of residues k, l. We demonstrate that conn(k) is a more robust indicator than an individual MI-value for the prediction of residues most plausibly important for the evolution of a protein. This proposition was inferred by means of statistical methods. It was further confirmed by the analysis of several proteins. A server, which computes conn(k)-values is available at http://www-bioinf.uni-regensburg.de. CONCLUSION: The algorithms H2r, which analyses MSAs and computes conn(k)-values, characterises a specific class of residues. In contrast to strictly conserved ones, these residues possess some flexibility in the composition of side chains. However, their allocation is sensibly balanced with several other positions, as indicated by conn(k)

University of Regensburg Publication Server

Springer - Publisher Connector

Energetic Selection of Topology in Ferredoxins

Models of early protein evolution posit the existence of short peptides that bound metals and ions and served as transporters, membranes or catalysts. The Cys-X-X-Cys-X-X-Cys heptapeptide located within bacterial ferredoxins, enclosing an Fe4S4 metal center, is an attractive candidate for such an early peptide. Ferredoxins are ancient proteins and the simple α+β fold is found alone or as a domain in larger proteins throughout all three kingdoms of life. Previous analyses of the heptapeptide conformation in experimentally determined ferredoxin structures revealed a pervasive right-handed topology, despite the fact that the Fe4S4 cluster is achiral. Conformational enumeration of a model CGGCGGC heptapeptide bound to a cubane iron-sulfur cluster indicates both left-handed and right-handed folds could exist and have comparable stabilities. However, only the natural ferredoxin topology provides a significant network of backbone-to-cluster hydrogen bonds that would stabilize the metal-peptide complex. The optimal peptide configuration (alternating αL,αR) is that of an α-sheet, providing an additional mechanism where oligomerization could stabilize the peptide and facilitate iron-sulfur cluster binding

CiteSeerX

The Francis Crick Institute

Stabilization against Hyperthermal Denaturation through Increased CG Content Can Explain the Discrepancy between Whole Genome and 16S rRNA Analyses

Author: Altschul S. F.
Bansal A. K.
Bansal A. K.
Bansal A. K.
Cole J. R.
Doolittle W. F.
Fitz-Gibbon S. T.
Fleischmann R. D.
Galtier N.
Garrity G. M.
Gogarten J. P.
Graham D. E.
Gupta R. S.
Hurst L. D.
Huynen M. A.
Kampmann M.
Koonin E. V.
Margoliash E.
Mayr E.
Meyer T. E.
Nakashima H.
Tekaia F.
Wang H.
Waterman M. S.
Weisburg W. G.
Woese C. R.
Woese C. R.
Woese C. R.
Woese C. R.
Zuckerkandl E.
Publication venue: 'American Chemical Society (ACS)'
Publication date
Field of study

Two Novel Parvoviruses in Frugivorous New and Old World Bats

Bats, a globally distributed group of mammals with high ecological importance, are increasingly recognized as natural reservoir hosts for viral agents of significance to human and animal health. In the present study, we evaluated pools of blood samples obtained from two phylogenetically distant bat families, in particular from flying foxes (Pteropodidae), Eidolon helvum in West Africa, and from two species of New World leaf-nosed fruit bats (Phyllostomidae), Artibeus jamaicensis and Artibeus lituratus in Central America. A sequence-independent virus discovery technique (VIDISCA) was used in combination with high throughput sequencing to detect two novel parvoviruses: a PARV4-like virus named Eh-BtPV-1 in Eidolon helvum from Ghana and the first member of a putative new genus in Artibeus jamaicensis from Panama (Aj-BtPV-1). Those viruses were circulating in the corresponding bat colony at rates of 7–8%. Aj-BtPV-1 was also found in Artibeus lituratus (5.5%). Both viruses were detected in the blood of infected animals at high concentrations: up to 10E8 and to 10E10 copies/ml for Aj-BtPV-1 and Eh-BtPV-1 respectively. Eh-BtPV-1 was additionally detected in all organs collected from bats (brain, lungs, liver, spleen, kidneys and intestine) and spleen and kidneys were identified as the most likely sites where viral replication takes place. Our study shows that bat parvoviruses share common ancestors with known parvoviruses of humans and livestock. We also provide evidence that a variety of Parvovirinae are able to cause active infection in bats and that they are widely distributed in these animals with different geographic origin, ecologies and climatic ranges

Medicago truncatula contains a second gene encoding a plastid located glutamine synthetase exclusively expressed in developing seeds

Abstract Background Nitrogen is a crucial nutrient that is both essential and rate limiting for plant growth and seed production. Glutamine synthetase (GS), occupies a central position in nitrogen assimilation and recycling, justifying the extensive number of studies that have been dedicated to this enzyme from several plant sources. All plants species studied to date have been reported as containing a single, nuclear gene encoding a plastid located GS isoenzyme per haploid genome. This study reports the existence of a second nuclear gene encoding a plastid located GS in <it>Medicago truncatula</it>. Results This study characterizes a new, second gene encoding a plastid located glutamine synthetase (GS2) in <it>M. truncatula</it>. The gene encodes a functional GS isoenzyme with unique kinetic properties, which is exclusively expressed in developing seeds. Based on molecular data and the assumption of a molecular clock, it is estimated that the gene arose from a duplication event that occurred about 10 My ago, after legume speciation and that duplicated sequences are also present in closely related species of the Vicioide subclade. Expression analysis by RT-PCR and western blot indicate that the gene is exclusively expressed in developing seeds and its expression is related to seed filling, suggesting a specific function of the enzyme associated to legume seed metabolism. Interestingly, the gene was found to be subjected to alternative splicing over the first intron, leading to the formation of two transcripts with similar open reading frames but varying 5' UTR lengths, due to retention of the first intron. To our knowledge, this is the first report of alternative splicing on a plant GS gene. Conclusions This study shows that <it>Medicago truncatula </it>contains an additional GS gene encoding a plastid located isoenzyme, which is functional and exclusively expressed during seed development. Legumes produce protein-rich seeds requiring high amounts of nitrogen, we postulate that this gene duplication represents a functional innovation of plastid located GS related to storage protein accumulation exclusive to legume seed metabolism.</p

ProdInra

Integration of Evolutionary Features for the Identification of Functionally Important Residues in Major Facilitator Superfamily Transporters

The identification of functionally important residues is an important challenge for understanding the molecular mechanisms of proteins. Membrane protein transporters operate two-state allosteric conformational changes using functionally important cooperative residues that mediate long-range communication from the substrate binding site to the translocation pathway. In this study, we identified functionally important cooperative residues of membrane protein transporters by integrating sequence conservation and co-evolutionary information. A newly derived evolutionary feature, the co-evolutionary coupling number, was introduced to measure the connectivity of co-evolving residue pairs and was integrated with the sequence conservation score. We tested this method on three Major Facilitator Superfamily (MFS) transporters, LacY, GlpT, and EmrD. MFS transporters are an important family of membrane protein transporters, which utilize diverse substrates, catalyze different modes of transport using unique combinations of functional residues, and have enough characterized functional residues to validate the performance of our method. We found that the conserved cores of evolutionarily coupled residues are involved in specific substrate recognition and translocation of MFS transporters. Furthermore, a subset of the residues forms an interaction network connecting functional sites in the protein structure. We also confirmed that our method is effective on other membrane protein transporters. Our results provide insight into the location of functional residues important for the molecular mechanisms of membrane protein transporters

Advantages of a Mechanistic Codon Substitution Model for Evolutionary Analysis of Protein-Coding Sequences

Author: A Doron-Faigenboim
A Schneider
A Stuart
AL Halpern
B Shapiro
B Zhong
C Kosiol
D Posada
Darren P. Martin
DT Jones
E Zuckerkandl
G Bazykin
G Schwarz
H Akaike
H Nishihara
J Adachi
J Adachi
J Adachi
J Wakeley
JP Huelsenbeck
K Tamura
M Averof
M Go
M Hasegawa
M Ingman
M Kimura
M Nikaido
MA Larkin
MA Suchard
MO Dayhoff
MW Dimmic
N Galtier
N Goldman
P Lopez
RK Jansen
S Guindon
S Miyazawa
S Miyazawa
S Whelan
S Whelan
Sanzo Miyazawa
SQ Le
SV Muse
T Gojobori
T Miyata
TK Seo
TK Seo
V Minin
W Delport
W Delport
WM Fitch
WW Brown
Z Abdo
Z Yang
Z Yang
Z Yang
Z Yang
Z Yang
Z Yang
Z Yang
Z Yang
Publication venue: Public Library of Science
Publication date: 29/12/2011
Field of study

A mechanistic codon substitution model, in which each codon substitution rate is proportional to the product of a codon mutation rate and the average fixation probability depending on the type of amino acid replacement, has advantages over nucleotide, amino acid, and empirical codon substitution models in evolutionary analysis of protein-coding sequences. It can approximate a wide range of codon substitution processes. If no selection pressure on amino acids is taken into account, it will become equivalent to a nucleotide substitution model. If mutation rates are assumed not to depend on the codon type, then it will become essentially equivalent to an amino acid substitution model. Mutation at the nucleotide level and selection at the amino acid level can be separately evaluated.The present scheme for single nucleotide mutations is equivalent to the general time-reversible model, but multiple nucleotide changes in infinitesimal time are allowed. Selective constraints on the respective types of amino acid replacements are tailored to each gene in a linear function of a given estimate of selective constraints. Their good estimates are those calculated by maximizing the respective likelihoods of empirical amino acid or codon substitution frequency matrices. Akaike and Bayesian information criteria indicate that the present model performs far better than the other substitution models for all five phylogenetic trees of highly-divergent to highly-homologous sequences of chloroplast, mitochondrial, and nuclear genes. It is also shown that multiple nucleotide changes in infinitesimal time are significant in long branches, although they may be caused by compensatory substitutions or other mechanisms. The variation of selective constraint over sites fits the datasets significantly better than variable mutation rates, except for 10 slow-evolving nuclear genes of 10 mammals. An critical finding for phylogenetic analysis is that assuming variable mutation rates over sites lead to the overestimation of branch lengths