Skip to main content
Article thumbnail
Location of Repository

Unexpected correlations between gene expression and codon usage bias from microarray data for the whole Escherichia coli K-12 genome

By M. dos Reis, Lorenz Wernisch and Renos Savva


Escherichia coli has long been regarded as a model organism in the study of codon usage bias (CUB). However, most studies in this organism regarding this topic have been computational or, when experimental, restricted to small datasets; particularly poor attention has been given to genes with low CUB. In this work, correspondence analysis on codon usage is used to classify E.coli genes into three groups, and the relationship between them and expression levels from microarray experiments is studied. These groups are: group 1, highly biased genes; group 2, moderately biased genes; and group 3, AT-rich genes with low CUB. It is shown that, surprisingly, there is a negative correlation between codon bias and expression levels for group 3 genes, i.e. genes with extremely low codon adaptation index (CAI) values are highly expressed, while group 2 show the lowest average expression levels and group 1 show the usual expected positive correlation between CAI and expression. This trend is maintained over all functional gene groups, seeming to contradict the E.coli–yeast paradigm on CUB. It is argued that these findings are still compatible with the mutation–selection balance hypothesis of codon usage and that E.coli genes form a dynamic system shaped by these factors

Topics: bcs
Publisher: Oxford Journals
Year: 2003
OAI identifier:

Suggested articles


  1. (1982). A simple method for displaying the hydropathic character of a protein. doi
  2. (1998). An evaluation of measures of synonymous codon usage bias. doi
  3. (1986). An evolutionary perspective on synonymous codon usage in unicellular organisms. doi
  4. (1996). Analysis and predictions from Escherichia coli sequences, or E. coli in silico.
  5. (2003). ASAP, a 6984 Nucleic Acids Research,
  6. (2001). Characterizations of highly expressed genes of four fast-growing bacteria. doi
  7. (1980). Codon catalogue usage and the genome hypothesis. doi
  8. (1980). Codon frequencies in 119 individual genes con®rm consistent choices of degenerate bases according to genome type. doi
  9. (1990). Codon preferences in freeliving microorganisms.
  10. (1982). Codon selection in yeast.
  11. (1994). Codon usage in Caenorhabditis elegans: delineation of translational selection and mutational bias. doi
  12. (1986). Codon usage in regulatory genes in Escherichia coli does not re¯ect selection for `rare' codons. doi
  13. (1986). Codon usage in yeast: cluster analysis clearly differentiates highly and lowly expressed genes. doi
  14. (1993). Codon usage: mutational bias, translational selection, or both? doi
  15. (1981). Correlation between the abundance of Escherichia coli transfer RNAs and the occurrence of the respective codons in its protein genes: a proposal for a synonymous codon choice that is optimal for the E. coli translational system. doi
  16. (1981). Correlation between the abundance of Escherichia coli transfer RNAs and the occurrence of the respective codons in its protein genes. doi
  17. (2002). Correlations between mRNA expression levels and GC contents of coding and untranslated regions of genes in rodents. doi
  18. (1993). Correspondence Analysis in Practice. doi
  19. (1980). DNA sequences from the str operon of Escherichia coli. doi
  20. (1991). Evidence for horizontal gene transfer in Escherichia coli speciation. doi
  21. (1983). Evidence for use of rare codons in the danG gene and other regulatory genes of Escherichia coli.
  22. (2002). Evolution of synonymous codon usage in metazoans. doi
  23. (1999). Expression pattern and, surprisingly, gene length shape codon usage in Caenorhabditis, Drosophila and Arabidopsis. doi
  24. (1990). Finding Groups in Data: An Introduction to Cluster Analysis. doi
  25. (2001). Gene expression and molecular evolution. doi
  26. (1990). Generalized Additive Models. doi
  27. (1996). Genetic variation and evolutionary processes in natural populations of Escherichia coli.
  28. (2002). Genome evolution and developmental constraint in Caenorhabditis elegans. doi
  29. (1999). Genome-wide expression pro®ling in Escherichia coli k-12.
  30. (2002). Global analysis of mRNA decay and abundance in Escherichia coli at single-gene resolution using two-colour ¯uorescent DNA microarrays. doi
  31. (2000). Horizontal gene transfer in bacterial and archaeal complete genomes. doi
  32. (1994). Hydrophobicity, expressivity and aromaticity are the major trends of amino-acid usage in 999 Escherichia coli chromosome-encoded genes. doi
  33. (1993). Major codon preference: theme and variations.
  34. (1998). Molecular archaeology of the Escherichia coli genome. doi
  35. (2002). Over 1000 genes are involved in the DNA damage response of Escherichia coli. doi
  36. (1996). Phylogenetics and the amelioration of bacterial genomes. doi
  37. (2000). Relationship of codon bias to mRNA concentration and protein length in Saccharomyces cerevisiae. doi
  38. (2000). RNA expression analysis using a 30 base pair resolution Escherichia coli genome array.
  39. (1987). Structure± function studies on bacteriorhodopsin.
  40. (1999). Studies of codon usage and tRNA genes of 18 unicellular organisms and quanti®cation of Bacillus subtilis tRNAs: gene expression level and species-speci®c diversity of codon usage based on multivariate analysis. doi
  41. (1996). Synonymous codon bias is related to gene length in Escherichia coli: selection for translational accuracy? doi
  42. (1990). The `effective number of codons' used in a gene. doi
  43. (1986). The codon adaptation indexÐa measure of directional synonymous codon usage bias and its potential applications. doi
  44. (1997). The complete genome sequence of Escherichia coli K-12.
  45. (1990). The effect of context on synonymous codon usage in genes with low codon usage bias. doi
  46. (1991). The selection-mutation-drift theory of synonymous codon usage.
  47. (1997). Transfer RNA gene redundancy and translational selection in Saccharomyces cerevisiae. doi
  48. (1998). Translational selection and molecular evolution. doi
  49. (2001). Translational selection on codon usage in Xenopus laevis. doi
  50. (2000). tRNA gene number and codon usage in the C. elegans genome are co-adapted for optimal translation of highly expressed genes. doi
  51. (1997). tRNAscan-SE: a program for improved detection of transfer RNA genes in genomic sequence. doi
  52. (2002). Use and misuse of correspondence analysis in codon usage studies. doi

To submit an update or takedown request for this paper, please submit an Update/Correction/Removal Request.