47,598 research outputs found
The Mystery of Two Straight Lines in Bacterial Genome Statistics. Release 2007
In special coordinates (codon position--specific nucleotide frequencies)
bacterial genomes form two straight lines in 9-dimensional space: one line for
eubacterial genomes, another for archaeal genomes. All the 348 distinct
bacterial genomes available in Genbank in April 2007, belong to these lines
with high accuracy. The main challenge now is to explain the observed high
accuracy. The new phenomenon of complementary symmetry for codon
position--specific nucleotide frequencies is observed. The results of analysis
of several codon usage models are presented. We demonstrate that the
mean--field approximation, which is also known as context--free, or complete
independence model, or Segre variety, can serve as a reasonable approximation
to the real codon usage. The first two principal components of codon usage
correlate strongly with genomic G+C content and the optimal growth temperature
respectively. The variation of codon usage along the third component is related
to the curvature of the mean-field approximation. First three eigenvalues in
codon usage PCA explain 59.1%, 7.8% and 4.7% of variation. The eubacterial and
archaeal genomes codon usage is clearly distributed along two third order
curves with genomic G+C content as a parameter.Comment: Significantly extended version with new data for all the 348 distinct
bacterial genomes available in Genbank in April 200
Translation elongation can control translation initiation on eukaryotic mRNAs
Synonymous codons encode the same amino acid, but differ in other biophysical properties. The evolutionary selection of codons whose properties are optimal for a cell generates the phenomenon of codon bias. Although recent studies have shown strong effects of codon usage changes on protein expression levels and cellular physiology, no translational control mechanism is known that links codon usage to protein expression levels. Here, we demonstrate a novel translational control mechanism that responds to the speed of ribosome movement immediately after the start codon. High initiation rates are only possible if start codons are liberated sufficiently fast, thus accounting for the observation that fast codons are overrepresented in highly expressed proteins. In contrast, slow codons lead to slow liberation of the start codon by initiating ribosomes, thereby interfering with efficient translation initiation. Codon usage thus evolved as a means to optimise translation on individual mRNAs, as well as global optimisation of ribosome availability
Sum rules of codon usage probabilities
In the crystal basis model of the genetic code, it is deduced that the sum of
usage probabilities of the codons with C and A in the third position for the
quartets and/or sextets is independent of the biological species for
vertebrates. A comparison with experimental data shows that the prediction is
satisfied within about 5 %.Comment: 7 page
Genome landscapes and bacteriophage codon usage
Across all kingdoms of biological life, protein-coding genes exhibit unequal
usage of synonmous codons. Although alternative theories abound, translational
selection has been accepted as an important mechanism that shapes the patterns
of codon usage in prokaryotes and simple eukaryotes. Here we analyze patterns
of codon usage across 74 diverse bacteriophages that infect E. coli, P.
aeruginosa and L. lactis as their primary host. We introduce the concept of a
`genome landscape,' which helps reveal non-trivial, long-range patterns in
codon usage across a genome. We develop a series of randomization tests that
allow us to interrogate the significance of one aspect of codon usage, such a
GC content, while controlling for another aspect, such as adaptation to
host-preferred codons. We find that 33 phage genomes exhibit highly non-random
patterns in their GC3-content, use of host-preferred codons, or both. We show
that the head and tail proteins of these phages exhibit significant bias
towards host-preferred codons, relative to the non-structural phage proteins.
Our results support the hypothesis of translational selection on viral genes
for host-preferred codons, over a broad range of bacteriophages.Comment: 9 Color Figures, 5 Tables, 53 Reference
Codon Bias Patterns of 's Interacting Proteins
Synonymous codons, i.e., DNA nucleotide triplets coding for the same amino
acid, are used differently across the variety of living organisms. The
biological meaning of this phenomenon, known as codon usage bias, is still
controversial. In order to shed light on this point, we propose a new codon
bias index, , that is based on the competition between cognate and
near-cognate tRNAs during translation, without being tuned to the usage bias of
highly expressed genes. We perform a genome-wide evaluation of codon bias for
, comparing with other widely used indices: , , and
. We show that and capture similar information by being
positively correlated with gene conservation, measured by ERI, and
essentiality, whereas, and appear to be less sensitive to
evolutionary-functional parameters. Notably, the rate of variation of and
with ERI allows to obtain sets of genes that consistently belong to
specific clusters of orthologous genes (COGs). We also investigate the
correlation of codon bias at the genomic level with the network features of
protein-protein interactions in . We find that the most densely
connected communities of the network share a similar level of codon bias (as
measured by and ). Conversely, a small difference in codon bias
between two genes is, statistically, a prerequisite for the corresponding
proteins to interact. Importantly, among all codon bias indices, turns
out to have the most coherent distribution over the communities of the
interactome, pointing to the significance of competition among cognate and
near-cognate tRNAs for explaining codon usage adaptation
Estimating translational selection in Eukaryotic Genomes
Natural selection on codon usage is a pervasive force that acts on a large variety of prokaryotic and eukaryotic genomes. Despite this, obtaining reliable estimates of selection on codon usage has proved complicated, perhaps due to the fact that the selection coefficients involved are very small. In this work, a population genetics model is used to measure the strength of selected codon usage bias, S, in 10 eukaryotic genomes. It is shown that the strength of selection is closely linked to expression and that reliable estimates of selection coefficients can only be obtained for genes with very similar expression levels. We compare the strength of selected codon usage for orthologous genes across all 10 genomes classified according to expression categories. Fungi genomes present the largest S values (2.24–2.56), whereas multicellular invertebrate and plant genomes present more moderate values (0.61–1.91). The large mammalian genomes (human and mouse) show low S values (0.22–0.51) for the most highly expressed genes. This might not be evidence for selection in these organisms as the technique used here to estimate S does not properly account for nucleotide composition heterogeneity along such genomes. The relationship between estimated S values and empirical estimates of population size is presented here for the first time. It is shown, as theoretically expected, that population size has an important role in the operativity of translational selection
Genetic Evolution and Molecular Selection of the HE Gene of Influenza C Virus
Influenza C virus (ICV) was first identified in humans and swine, but recently also in cattle, indicating a wider host range and potential threat to both the livestock industry and public health than was originally anticipated. The ICV hemagglutinin-esterase (HE) glycoprotein has multiple functions in the viral replication cycle and is the major determinant of antigenicity. Here, we developed a comparative approach integrating genetics, molecular selection analysis, and structural biology to identify the codon usage and adaptive evolution of ICV. We show that ICV can be classified into six lineages, consistent with previous studies. The HE gene has a low codon usage bias, which may facilitate ICV replication by reducing competition during evolution. Natural selection, dinucleotide composition, and mutation pressure shape the codon usage patterns of the ICV HE gene, with natural selection being the most important factor. Codon adaptation index (CAI) and relative codon deoptimization index (RCDI) analysis revealed that the greatest adaption of ICV was to humans, followed by cattle and swine. Additionally, similarity index (SiD) analysis revealed that swine exerted a stronger evolutionary pressure on ICV than humans, which is considered the primary reservoir. Furthermore, a similar tendency was also observed in the M gene. Of note, we found HE residues 176, 194, and 198 to be under positive selection, which may be the result of escape from antibody responses. Our study provides useful information on the genetic evolution of ICV from a new perspective that can help devise prevention and control strategies
- …
