Skip to main content
Article thumbnail
Location of Repository

A Re-Annotation of the Saccharomyces Cerevisiae Genome

By V. Wood, K. M. Rutherford, A Ivens, M-A Rajandream and B. Barrell


Discrepancies in gene and orphan number indicated by previous analyses suggest that S. cerevisiae would benefit from a consistent re-annotation. In this analysis three new genes are identified and 46 alterations to gene coordinates are described. 370 ORFs are defined as totally spurious ORFs which should be disregarded. At least a further 193 genes could be described as very hypothetical, based on a number of criteria. It was found that disparate genes with sequence overlaps over ten amino acids (especially at the N-terminus) are rare in both S. cerevisiae and Sz. pombe. A new S. cerevisiae gene number estimate with an upper limit of 5804 is proposed, but after the removal of very hypothetical genes and pseudogenes this is reduced to 5570. Although this is likely to be closer to the true upper limit, it is still predicted to be an overestimate of gene number. A complete list of revised gene coordinates is available from the Sanger Centre (S. cerevisiae reannotation: ftp://ftp/pub/yeast/SCreannotation)

Topics: Research Article
Publisher: Hindawi Publishing Corporation
OAI identifier:
Provided by: PubMed Central
Download PDF:
Sorry, we are unable to provide the full text but you may find it at the following location(s):
  • http://www.pubmedcentral.nih.g... (external link)
  • Suggested articles


    1. (1994). A generalized profile syntax for biomolecular sequences motifs and its function in automatic sequence interpretation.
    2. (1994). A workbench for large scale sequence homology analysis.
    3. (2000). Analysis of 114 kb of DNA sequence from fission yeast chromosome 2 immediately centromere-distal to his 5.
    4. (2000). Artemis: sequence visualization and annotation.
    5. (1990). Basic Local Alignment Search Tool.
    6. (1993). Dating the evolutionary radiations of the true fungi.
    7. (1997). Exploring the metabolic and genetic control of gene expression on a genomic scale.
    8. (1996). From DNA sequence to biological function.
    9. (1997). Functional genomics: it’s all how you read it.
    10. (2000). Genomic exploration of the hemiascomycetous yeasts: 19. Ascomycetesspecific genes.
    11. (2000). Genomic exploration of the hemiascomycetous yeasts: 21. Comparative functional classification of genes.
    12. (1988). Improved tools for biological sequence comparison.
    13. (1996). Life with 6000 genes.
    14. (1999). Origin and properties of non-coding ORFs in the yeast genome.
    15. (1997). Overview of the yeast genome.
    16. (1996). PairWise and SearchWise: finding the optimal alignment in a simultaneous comparison of a protein profile against all DNA translation frames.
    17. (1999). Pfam 3.1: 1313 multiple alignments and profiles HMMs match the majority of proteins.
    18. (2000). Recognition of protein coding genes in the yeast genome at better than 95% accuracy based on the Z curve.
    19. (1991). Synonymous codon usage in Saccharomyces cerevisiae.
    20. (1992). The complete DNA sequence of yeast chromosome III.
    21. (2000). The genome of Saccharomyces cerevisiae revisited.
    22. (1999). The SWISS-PROT protein sequence data bank and its supplement TrEMBL in 1999.
    23. (1996). The yeast genome project: what did we learn?
    24. (1997). tRNAscan-SE: a program for improved detection of transfer RNA genes in genomic sequence.

    To submit an update or takedown request for this paper, please submit an Update/Correction/Removal Request.