Skip to main content
Article thumbnail
Location of Repository

Multidimensional Scaling Applied to Histogram-Based DNA Analysis

By António C. Costa, J. A. Tenreiro Machado and Maria Dulce Quelhas


This paper aims to study the relationships between chromosomal DNA sequences of twenty species. We propose a methodology combining DNA-based word frequency histograms, correlation methods, and an MDS technique to visualize structural information underlying chromosomes (CRs) and species. Four statistical measures are tested (Minkowski, Cosine, Pearson product-moment, and Kendall τ rank correlations) to analyze the information content of 421 nuclear CRs from twenty species. The proposed methodology is built on mathematical tools and allows the analysis and visualization of very large amounts of stream data, like DNA sequences, with almost no assumptions other than the predefined DNA “word length.” This methodology is able to produce comprehensible three-dimensional visualizations of CR clustering and related spatial and structural patterns. The results of the four test correlation scenarios show that the high-level information clusterings produced by the MDS tool are qualitatively similar, with small variations due to each correlation method characteristics, and that the clusterings are a consequence of the input data and not method’s artifacts

Topics: Genetics, QH426-470, Biology (General), QH301-705.5, Science, Q, DOAJ:Genetics, DOAJ:Biology, DOAJ:Biology and Life Sciences
Publisher: Hindawi Publishing Corporation
Year: 2012
DOI identifier: 10.1155/2012
OAI identifier:
Download PDF:
Sorry, we are unable to provide the full text but you may find it at the following location(s):
  • (external link)
  • (external link)
  • (external link)
  • (external link)
  • Suggested articles

    To submit an update or takedown request for this paper, please submit an Update/Correction/Removal Request.