The expected number of total gene clusters and core gene clusters identified at the addition of each genome to the clustering dataset

Benjamin Janto (34106); Fen Z Hu (34099); Garth D Ehrlich (34110); J Christopher Post (48105); Jay Hayes (48103); Justin S Hogg (48102); Randy Keefe (48104); Robert Boissy (34109)

The expected number of total gene clusters and core gene clusters identified at the addition of each genome to the clustering dataset

Authors: Benjamin Janto (34106)
Fen Z Hu (34099)
Garth D Ehrlich (34110)
J Christopher Post (48105)
Jay Hayes (48103)
Justin S Hogg (48102)
Randy Keefe (48104)
Robert Boissy (34109)
Publication date
Publisher
Doi

Abstract

Modeling predictions are based on the eight strain training set (see 'Mathematical development of a finite supragenome model'). The number of genes observed in all strains levels off to an asymptote that corresponds to a core set of genes. The rate of increase in total genes decreases, but does not level off due to the discovery of rare genes.Copyright information:Taken from "Characterization and modeling of the core and supragenomes based on the complete genomic sequences of Rd and 12 clinical nontypeable strains"http://genomebiology.com/2007/8/6/R103Genome Biology 2007;8(6):R103-R103.Published online 5 Jun 2007PMCID:PMC2394751.</p

Similar works

Full text

Available Versions

FigShare

oai:figshare.com:article/81140

Last time updated on 16/03/2018