Search CORE

1 research outputs found

P-value based visualization of codon usage data

Author: B Lafay
B Lafay
C Médigue
DC Shields
F Kunst
F Supek
G Perrière
G Perrière
H Romero
HC Wang
HE Wood
I Moszer
JG Lawrence
JO Mclnerney
JO Mclnerney
K Takemaru
KV Mardia
L Holm
MK Waldor
MO Hill
Peter Meinicke
R Merkl
S Casjens
S Kanaya
S Karlin
S Waack
SA Zahler
SK Gupta
Stephan Waack
T Kohonen
Thomas Brodag
WH Press
Wolfgang Florian Fricke
Publication venue: BioMed Central
Publication date: 01/01/2006
Field of study

Two important and not yet solved problems in bacterial genome research are the identification of horizontally transferred genes and the prediction of gene expression levels. Both problems can be addressed by multivariate analysis of codon usage data. In particular dimensionality reduction methods for visualization of multivariate data have shown to be effective tools for codon usage analysis. We here propose a multidimensional scaling approach using a novel similarity measure for codon usage tables. Our probabilistic similarity measure is based on P-values derived from the well-known chi-square test for comparison of two distributions. Experimental results on four microbial genomes indicate that the new method is well-suited for the analysis of horizontal gene transfer and translational selection. As compared with the widely-used correspondence analysis, our method did not suffer from outlier sensitivity and showed a better clustering of putative alien genes in most cases

CiteSeerX

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central