The expected number of total gene clusters and core gene clusters identified at the addition of each genome to the clustering dataset

Abstract

Modeling predictions are based on the eight strain training set (see 'Mathematical development of a finite supragenome model'). The number of genes observed in all strains levels off to an asymptote that corresponds to a core set of genes. The rate of increase in total genes decreases, but does not level off due to the discovery of rare genes.<p><b>Copyright information:</b></p><p>Taken from "Characterization and modeling of the core and supragenomes based on the complete genomic sequences of Rd and 12 clinical nontypeable strains"</p><p>http://genomebiology.com/2007/8/6/R103</p><p>Genome Biology 2007;8(6):R103-R103.</p><p>Published online 5 Jun 2007</p><p>PMCID:PMC2394751.</p><p></p

    Similar works

    Full text

    thumbnail-image

    Available Versions