3,005 research outputs found

    Genome signatures, self-organizing maps and higher order phylogenies: a parametric analysis

    Get PDF
    Genome signatures are data vectors derived from the compositional statistics of DNA. The self-organizing map (SOM) is a neural network method for the conceptualisation of relationships within complex data, such as genome signatures. The various parameters of the SOM training phase are investigated for their effect on the accuracy of the resulting output map. It is concluded that larger SOMs, as well as taking longer to train, are less sensitive in phylogenetic classification of unknown DNA sequences. However, where a classification can be made, a larger SOM is more accurate. Increasing the number of iterations in the training phase of the SOM only slightly increases accuracy, without improving sensitivity. The optimal length of the DNA sequence k-mer from which the genome signature should be derived is 4 or 5, but shorter values are almost as effective. In general, these results indicate that small, rapidly trained SOMs are generally as good as larger, longer trained ones for the analysis of genome signatures. These results may also be more generally applicable to the use of SOMs for other complex data sets, such as microarray data

    Food-chain competition influences gene's size

    Full text link
    We have analysed an effect of the Bak-Sneppen predator-prey food-chain self-organization on nucleotide content of evolving species. In our model, genomes of the species under consideration have been represented by their nucleotide genomic fraction and we have applied two-parameter Kimura model of substitutions to include the changes of the fraction in time. The initial nucleotide fraction and substitution rates were decided with the help of random number generator. Deviation of the genomic nucleotide fraction from its equilibrium value was playing the role of the fitness parameter, BB, in Bak-Sneppen model. Our finding is, that the higher is the value of the threshold fitness, during the evolution course, the more frequent are large fluctuations in number of species with strongly differentiated nucleotide content; and it is more often the case that the oldest species, which survive the food-chain competition, might have specific nucleotide fraction making possible generating long genesComment: 11 pages including 7 figure

    Genome Snapshot: a new resource at the Saccharomyces Genome Database (SGD) presenting an overview of the Saccharomyces cerevisiae genome

    Get PDF
    Sequencing and annotation of the entire Saccharomyces cerevisiae genome has made it possible to gain a genome-wide perspective on yeast genes and gene products. To make this information available on an ongoing basis, the Saccharomyces Genome Database (SGD) () has created the Genome Snapshot (). The Genome Snapshot summarizes the current state of knowledge about the genes and chromosomal features of S.cerevisiae. The information is organized into two categories: (i) number of each type of chromosomal feature annotated in the genome and (ii) number and distribution of genes annotated to Gene Ontology terms. Detailed lists are accessible through SGD's Advanced Search tool (), and all the data presented on this page are available from the SGD ftp site ()

    Nucleosomes affect local transformation efficiency

    Get PDF
    Genetic transformation is a natural process during which foreign DNA enters a cell and integrates into the genome. Apart from its relevance for horizontal gene transfer in nature, transformation is also the cornerstone of today's recombinant gene technology. Despite its importance, relatively little is known about the factors that determine transformation efficiency. We hypothesize that differences in DNA accessibility associated with nucleosome positioning may affect local transformation efficiency. We investigated the landscape of transformation efficiency at various positions in the Saccharomyces cerevisiae genome and correlated these measurements with nucleosome positioning. We find that transformation efficiency shows a highly significant inverse correlation with relative nucleosome density. This correlation was lost when the nucleosome pattern, but not the underlying sequence was changed. Together, our results demonstrate a novel role for nucleosomes and also allow researchers to predict transformation efficiency of a target region and select spots in the genome that are likely to yield higher transformation efficiency

    Evidence for abundant transcription of non-coding regions in the Saccharomyces cerevisiae genome

    Get PDF
    Background: Recent studies in a growing number of organisms have yielded accumulating evidence that a significant portion of the non-coding region in the genome is transcribed. We address this issue in the yeast Saccharomyces cerevisiae. Results: Taking into account the absence of a significantly large yeast EST database, we use microarray expression data collected for genomic regions erroneously believed to be coding to study the expression pattern of non-coding regions in the Saccharomyces cerevisiae genome. We find that at least 164 out of 589 (28%) such regions are expressed under specific biological conditions. In particular, looking at the probes that are located opposing other known genes at the same genomic locus, we find that 88 out of 341 (26%) of these genes support antisense transcription. The expression patterns of these antisense genes are positively correlated. We validate these results using RT-PCR on a sample of 6 non-coding transcripts. Conclusions: 1. The yeast genome is transcribed on a scale larger than previously assumed. 2. Correlated transcription of antisense genes is abundant in the yeast genome. 3. Antisense genes in yeast are non-coding.Comment: Journal version available at http://www.biomedcentral.com/1471-2164/6/93/abstrac

    A Re-Annotation of the Saccharomyces Cerevisiae Genome

    Get PDF
    Discrepancies in gene and orphan number indicated by previous analyses suggest that S. cerevisiae would benefit from a consistent re-annotation. In this analysis three new genes are identified and 46 alterations to gene coordinates are described. 370 ORFs are defined as totally spurious ORFs which should be disregarded. At least a further 193 genes could be described as very hypothetical, based on a number of criteria. It was found that disparate genes with sequence overlaps over ten amino acids (especially at the N-terminus) are rare in both S. cerevisiae and Sz. pombe. A new S. cerevisiae gene number estimate with an upper limit of 5804 is proposed, but after the removal of very hypothetical genes and pseudogenes this is reduced to 5570. Although this is likely to be closer to the true upper limit, it is still predicted to be an overestimate of gene number. A complete list of revised gene coordinates is available from the Sanger Centre (S. cerevisiae reannotation: ftp://ftp/pub/yeast/SCreannotation)
    corecore