Location of Repository

Distance-based analysis to reveal vertebrate phylogeny without sequence alignment using complete mitochondrial genomes

By Zu-Guo Yu, Li-Qian Zhou, Vo V. Anh, Ka-Hou Chu, C. P. Li and Y. J. Chen


There have been a number of recent attempts to\ud develop methodologies that do not require sequence\ud alignment for deriving species phylogeny based on\ud overall similarities of the complete genomes. The\ud mitochondrial genomes have provided much information on the evolution of this organelle and have been used for phylogenetic reconstruction by various methods with or without sequence alignment.\ud In this paper we introduce three fast algorithms, namely, dynamical language model with correlation distance, Fourier transform with Kullback-Leibler divergence distance, log-correlation distance, for deriving vertebrate phylogeny based on mitochondrial genomes. The distance-based analyses show that the mitochondrial genomes are separated into three major clusters corresponding to mammals, fish, and\ud Archosauria (including birds and reptiles)respectively. The interrelationships among the mitochondrial genomes are roughly in agreement with the current understanding on the phylogeny of vertebrates revealed by the traditional approaches

Topics: 010202 Biological Mathematics, 060409 Molecular Evolution, vertebrate phylogeny, mitochondrial genome, dynamical language model, Fourier transform, correlation distance, Kullback, Leibler divergence distance
Publisher: The International Institute of Informatics and Systemics (IIIS)
Year: 2007
OAI identifier: oai:eprints.qut.edu.au:15654

Suggested articles



  1. (2000). A case for evolutionary genomics and the comprehensive examination of sequence biodiversity.
  2. A comprehensive vertebrate phylogeny using vector representations of protein sequences from whole genomes.
  3. (2002). A genomic schism in birds revealed by phylogenetic analysis of DNA strings.
  4. (2001). An information-based sequence distance and its application to whole mitochondrial genome phylogeny.
  5. (2003). Branching out.
  6. (2004). Chaos game representation, and multifractal and correlation analysis of protein sequences from complete genome based on detailed HP model,
  7. (1998). Complete mitochondrial DNA sequence of the fat dormouse, Glis glis: further evidence of rodent parahyly.
  8. (2004). CVTree: a phylogenetic tree reconstruction tool based on whole genomes.
  9. (2001). Distance, correlation and mutual information among portraits of organisms based on complete genomes.
  10. (1991). Elements of Information Theory.
  11. (2000). Information content of protein sequences.
  12. (2002). Integrated gene species phylogenies from unaligned whole genome protein sequences.
  13. Molecular phylogenetics and the origins of placental mammals.
  14. (1999). Molecular studies suggest that cartilaginous fishes have a terminal position in the piscine tree.
  15. (2000). Monophyletic origin of the order chiroptera and its phylogenetic position among mammalia, as inferred from the complete sequence of the mitochondrial DNA of a japanese megabat, the ryukyu flying fox Pteropus dasymallus.
  16. (2003). Multifractal and correlation analysis of protein sequences from complete genome,
  17. (2004). Origin and phylogeny of chloroplasts: A simple correlation analysis of complete genomes.
  18. (1993). PHYLIP (phylogeny Inference package) version 3.5c. Distributed by the author at http://evolution.genetics.washington.edu/phylip. html,
  19. (2003). Phylogenomics: intersection of evolution and genomics.
  20. (2005). Phylogeny of prokaryotes and chloroplasts revealed by a simple composition approach on all protein sequences from whole genome without sequence alignment,
  21. Resolution of the early placental mammal radiation using Bayesian phylogenetics.
  22. (2006). Ribosomal RNA as molecular barcodes: a simple correlation analysis without sequence alignment.
  23. (1996). The complete mitochondrial DNA sequence of the greater indian rhinoceros, Rhinoceros unicornis, and the phylogenetic relationship among Carnivora, Perissodactyla, and Artiodactyla.
  24. (2003). The genomic tree of living organisms based on a fractal model.
  25. (2000). The mitochondrial genome of the sperm whale and a new molecular reference for estimating eutherian divergence dates.
  26. (1987). The neighbor-joining method: a new method for reconstructing phylogenetic trees.
  27. (2000). The phylogenetic position of the Talpidae within eutheria based on analysis of complete mitochondrial sequences.
  28. (1990). Towards a natural system of organisms: Proposal for the domains Archaea, Bacteria, and Eucarya,
  29. Where do rodents fit? Evidence from the complete mitochondrial genome of Sciurus vulgaris.
  30. (2004). Whole proteome prokaryote phylogeny without sequence alignment: a K-string composition approach.

To submit an update or takedown request for this paper, please submit an Update/Correction/Removal Request.