Abstract

The amino acid composition calculated from a gene assembly coding more than 3,000-7,000 amino acid residues represents the species specific amino acid composition based on the complete genome. In the present mathematical study, the 17 amino acid composition based on the sample size, 3,000-7,000, represents an amino acid composition with 95% level simultaneous confidence intervals for all amino acid probabilities in the sample. A genomic structure is constructed homogeneously with putative small units coding similar amino acid compositions under a mathematical rule

    Similar works