Article thumbnail

Gene Space Dynamics During the Evolution of Aegilops tauschii, Brachypodium distachyon, Oryza sativa, and Sorghum bicolor Genomes

By A. N. Massa, H. Wanjugi, K. R. Deal, K. O'Brien, F. M. You, R. Maiti, A. P. Chan, Y. Q. Gu, M. C. Luo, O. D. Anderson, P. D. Rabinowicz, J. Dvorak and K. M. Devos


Nine different regions totaling 9.7 Mb of the 4.02 Gb Aegilops tauschii genome were sequenced using the Sanger sequencing technology and compared with orthologous Brachypodium distachyon, Oryza sativa (rice), and Sorghum bicolor (sorghum) genomic sequences. The ancestral gene content in these regions was inferred and used to estimate gene deletion and gene duplication rates along each branch of the phylogenetic tree relating the four species. The total gene number in the extant Ae. tauschii genome was estimated to be 36,371. The gene deletion and gene duplication rates and total gene numbers in the four genomes were used to estimate the total gene number in each node of the phylogenetic tree. The common ancestor of the Brachypodieae and Triticeae lineages was estimated to have had 28,558 genes, and the common ancestor of the Panicoideae, Ehrhartoideae, and Pooideae subfamilies was estimated to have had 27,152 or 28,350 genes, depending on the ancestral gene scenario. Relative to the Brachypodieae and Triticeae common ancestor, the gene number was reduced in B. distachyon by 3,026 genes and increased in Ae. tauschii by 7,813 genes. The sum of gene deletion and gene duplication rates, which reflects the rate of gene synteny loss, was correlated with the rate of structural chromosome rearrangements and was highest in the Ae. tauschii lineage and lowest in the rice lineage. The high rate of gene space evolution in the Ae. tauschii lineage accounts for the fact that, contrary to the expectations, the level of synteny between the phylogenetically more related Ae. tauschii and B. distachyon genomes is similar to the level of synteny between the Ae. tauschii genome and the genomes of the less related rice and sorghum. The ratio of gene duplication to gene deletion rates in these four grass species closely parallels both the total number of genes in a species and the overall genome size. Because the overall genome size is to a large extent a function of the repeated sequence content in a genome, we suggest that the amount and activity of repeated sequences are important factors determining the number of genes in a genome

Topics: Research Articles
Publisher: Oxford University Press
OAI identifier:
Provided by: PubMed Central

To submit an update or takedown request for this paper, please submit an Update/Correction/Removal Request.

Suggested articles


  1. (2003). (31 co-authors).
  2. (2009). (32 co-authors).
  3. (2003). A complex history of rearrangement in an orthologous region of the maize, sorghum, and rice genomes.
  4. (2000). A whole-genome assembly of Drosophila.
  5. (2004). Analyses of LTR-retrotransposon structures reveal recent and rapid genomic DNA loss in rice. Genome Res.
  6. (2001). Analysis of a contiguous 211 kb sequence in diploid wheat (Triticum monococcum L.) reveals multiple mechanisms of genome evolution.
  7. (2002). Apollo: a sequence annotation editor. Genome Biol.
  8. (2001). Comparative sequence analysis of colinear barley and rice bacterial artificial chromosomes. Plant Physiol.
  9. (2007). Comparison of orthologous loci from small grass genomes Brachypodium and rice: implications for wheat genomics and grass genome annotation.
  10. (1997). Detailed comparative mapping of cereal chromosome regions corresponding to the Ph1 locus in wheat.
  11. (1944). Discovery of the DD-analyser, one of the ancestors of Triticum vulgare (Japanese). Agric Hort
  12. (1997). DNA sequence evidence for the segmental allotetraploid origin of maize.
  13. (1997). Do plants have a one-way ticket to genomic obesity? Plant Cell.
  14. (1998). Evidence that a recent increase in maize genome size was caused by the massive amplification of intergene retrotransposons.
  15. (2005). Gene duplication and exon shuffling by helitron-like transposons generate intraspecies diversity in maize.
  16. (2005). Gene movement by Helitron transposons contributes to the haplotype variability of maize.
  17. (2007). Genome plasticity a key factor in the success of polyploid wheat under domestication.
  18. (2010). Genome sequencing and analysis of the model grass Brachypodium distachyon.
  19. (2005). Genome Sequencing Project.
  20. (2002). Genome size reduction through illegitimate recombination counteracts genome expansion in Arabidopsis. Genome Res.
  21. genomes of wheat.
  22. (2003). High-throughput fingerprinting of bacterial artificial chromosomes using the SNaPshot labeling kit and sizing of restriction fragments by capillary electrophoresis.
  23. (2009). Identification and characterization of pseudogenes in the rice gene complement.
  24. (2007). Mechanisms and rates of birth and death of dispersed duplicated genes during the evolution of a multigene family in diploid and tetraploid wheats. Mol Biol Evol.
  25. (2010). Megabase level sequencing reveals contrasted organization and evolution patterns of the wheat gene and transposable element spaces. Plant Cell 22:1686–1701.
  26. (1997). Microcolinearity in sh2-homologous regions of the maize, rice and sorghum genomes.
  27. (1996). Nested retrotransposons in the intergenic regions of the maize genome.
  28. (1999). Nuclear DNA content of perennial grasses of the Triticeae. Crop Sci.
  29. (1991). Nuclear DNA content of some important plant species. Plant Mol Biol Rep.
  30. (2004). On the tetraploid origin of the maize genome. Comp Funct Genomics 5:281–284.
  31. (2004). Pack-MULE transposable elements mediate gene evolution in plants.
  32. (2010). Patching gaps in plant genomes results in gene movement and erosion of colinearity.
  33. (2001). Phylogeny and subfamilial classification of the grasses (Poaceae). Ann Mo Bot Gard.
  34. (2006). Plant genome organisation and diversity: the year of the junk! Curr Opin Biotechnol.
  35. (2003). Rapid genome divergence at orthologous low molecular weight glutenin loci of the A and A m
  36. (2004). Rapid recent growth and divergence of rice nuclear genomes.
  37. (2001). Rolling-circle transposons in eukaryotes.
  38. (2004). Sequence composition, organization and evolution of the core Triticeae genomes.
  39. (2005). Tempos of gene locus deletions and duplications and their relationship to recombination rate during diploid and polyploid evolution in the Aegilops-Triticum alliance.
  40. (2009). The B73 maize genome: complexity, diversity and dynamics.
  41. (2004). The considerable genome size variation of Hordeum species (Poaceae) is linked to phylogeny, life form, ecology, and speciation rates. Mol Biol Evol.
  42. (2009). The DAWGPAWS pipeline for the annotation of genes and transposable elements in plant genomes. Plant Methods 5:8–18.
  43. (2005). The evolutionary fate of MULE-mediated duplications of host gene fragments in rice. Genome Res.
  44. (2003). The genetic colinearity of rice and other cereals on the basis of genomic sequence analysis.
  45. (1946). The origin of Triticum spelta and its free-threshing hexaploid relatives.
  46. (2009). The Sorghum bicolor genome and the diversification of grasses.
  47. (2005). Transposable elements, gene creation and genome rearrangement in flowering plants.
  48. (2009). Triticeae genome structure and evolution.