A theoretical plot of the number of new genes expected to be found in the Nth genome for future sequencing projects

Abstract

The plot was generated using strains isolated in North America, and the extrapolation may not hold for isolates from other geographic locales if some distributed genes are geographically isolated. The model predicts that the number of new genes found in a strain will diminish 20 after sequencing 30 strains, and the number will trend toward 0 as the number of sequences becomes large.<p><b>Copyright information:</b></p><p>Taken from "Characterization and modeling of the core and supragenomes based on the complete genomic sequences of Rd and 12 clinical nontypeable strains"</p><p>http://genomebiology.com/2007/8/6/R103</p><p>Genome Biology 2007;8(6):R103-R103.</p><p>Published online 5 Jun 2007</p><p>PMCID:PMC2394751.</p><p></p

    Similar works

    Full text

    thumbnail-image

    Available Versions