Search CORE

3,005 research outputs found

Genome signatures, self-organizing maps and higher order phylogenies: a parametric analysis

Author: Gatherer Derek
Publication venue: 'SAGE Publications'
Publication date: 01/01/2007
Field of study

Genome signatures are data vectors derived from the compositional statistics of DNA. The self-organizing map (SOM) is a neural network method for the conceptualisation of relationships within complex data, such as genome signatures. The various parameters of the SOM training phase are investigated for their effect on the accuracy of the resulting output map. It is concluded that larger SOMs, as well as taking longer to train, are less sensitive in phylogenetic classification of unknown DNA sequences. However, where a classification can be made, a larger SOM is more accurate. Increasing the number of iterations in the training phase of the SOM only slightly increases accuracy, without improving sensitivity. The optimal length of the DNA sequence k-mer from which the genome signature should be derived is 4 or 5, but shorter values are almost as effective. In general, these results indicate that small, rapidly trained SOMs are generally as good as larger, longer trained ones for the analysis of genome signatures. These results may also be more generally applicable to the use of SOMs for other complex data sets, such as microarray data

Crossref

Directory of Open Access Journals

PubMed Central

Enlighten

Lancaster E-Prints

Food-chain competition influences gene's size

Author: Bak
Chowdhury
Dietrich Stauffer
Gierlik
Harrison
Ito
Kimura
Kimura
Lobry
Lobry
Marta Dembska
Mirosław R. Dudek
Rzhetsky
Stauffer
Wen-Hsiung Li
Wen-Hsiung Li
Wentian Li
Publication venue: 'Elsevier BV'
Publication date: 23/01/2006
Field of study

We have analysed an effect of the Bak-Sneppen predator-prey food-chain self-organization on nucleotide content of evolving species. In our model, genomes of the species under consideration have been represented by their nucleotide genomic fraction and we have applied two-parameter Kimura model of substitutions to include the changes of the fraction in time. The initial nucleotide fraction and substitution rates were decided with the help of random number generator. Deviation of the genomic nucleotide fraction from its equilibrium value was playing the role of the fitness parameter,

B

, in Bak-Sneppen model. Our finding is, that the higher is the value of the threshold fitness, during the evolution course, the more frequent are large fluctuations in number of species with strongly differentiated nucleotide content; and it is more often the case that the oldest species, which survive the food-chain competition, might have specific nucleotide fraction making possible generating long genesComment: 11 pages including 7 figure

arXiv.org e-Print Archive

Crossref

Genome Snapshot: a new resource at the Saccharomyces Genome Database (SGD) presenting an overview of the Saccharomyces cerevisiae genome

Author: Andrada Rey
Balakrishnan Rama
Binkley Gail
Botstein David
Cherry J. Michael
Christie Karen R.
Costanzo Maria C.
Dolinski Kara
Dong Qing
Dwight Selina S.
Engel Stacia R.
Fisk Dianna G.
Hirschman Jodi E.
Hong Eurie L.
Lane Christopher
Livstone Michael S.
Miyasato Stuart
Nash Robert
Oughtred Rose
Park Julie
Schroeder Mark
Sethuraman Anand
Skrzypek Marek
Starr Barry
Thanawala Mayank K.
Theesfeld Chandra L.
Weng Shuai
Williams Jennifer
Publication venue: Oxford University Press
Publication date: 28/12/2005
Field of study

Sequencing and annotation of the entire Saccharomyces cerevisiae genome has made it possible to gain a genome-wide perspective on yeast genes and gene products. To make this information available on an ongoing basis, the Saccharomyces Genome Database (SGD) () has created the Genome Snapshot (). The Genome Snapshot summarizes the current state of knowledge about the genes and chromosomal features of S.cerevisiae. The information is organized into two categories: (i) number of each type of chromosomal feature annotated in the genome and (ii) number and distribution of genes annotated to Gene Ontology terms. Detailed lists are accessible through SGD's Advanced Search tool (), and all the data presented on this page are available from the SGD ftp site ()

CiteSeerX

Crossref

PubMed Central

Nucleosomes affect local transformation efficiency

Author: Aslankoohi Elham
Marchal Kathleen
Sanchez-Rodriguez Aminael
Sun Hong
van der Zande Elisa
Verstrepen Kevin J
Voordeckers Karin
Publication venue: 'Oxford University Press (OUP)'
Publication date: 01/01/2012
Field of study

Genetic transformation is a natural process during which foreign DNA enters a cell and integrates into the genome. Apart from its relevance for horizontal gene transfer in nature, transformation is also the cornerstone of today's recombinant gene technology. Despite its importance, relatively little is known about the factors that determine transformation efficiency. We hypothesize that differences in DNA accessibility associated with nucleosome positioning may affect local transformation efficiency. We investigated the landscape of transformation efficiency at various positions in the Saccharomyces cerevisiae genome and correlated these measurements with nucleosome positioning. We find that transformation efficiency shows a highly significant inverse correlation with relative nucleosome density. This correlation was lost when the nucleosome pattern, but not the underlying sequence was changed. Together, our results demonstrate a novel role for nucleosomes and also allow researchers to predict transformation efficiency of a target region and select spots in the genome that are likely to yield higher transformation efficiency

CiteSeerX

Ghent University Academic Bibliography

PubMed Central

Evidence for abundant transcription of non-coding regions in the Saccharomyces cerevisiae genome

Author: Eisenberg Eli
Havilio Moshe
Kupiec Martin
Lerman Galia
Levanon Erez Y.
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2005
Field of study

Background: Recent studies in a growing number of organisms have yielded accumulating evidence that a significant portion of the non-coding region in the genome is transcribed. We address this issue in the yeast Saccharomyces cerevisiae. Results: Taking into account the absence of a significantly large yeast EST database, we use microarray expression data collected for genomic regions erroneously believed to be coding to study the expression pattern of non-coding regions in the Saccharomyces cerevisiae genome. We find that at least 164 out of 589 (28%) such regions are expressed under specific biological conditions. In particular, looking at the probes that are located opposing other known genes at the same genomic locus, we find that 88 out of 341 (26%) of these genes support antisense transcription. The expression patterns of these antisense genes are positively correlated. We validate these results using RT-PCR on a sample of 6 non-coding transcripts. Conclusions: 1. The yeast genome is transcribed on a scale larger than previously assumed. 2. Correlated transcription of antisense genes is abundant in the yeast genome. 3. Antisense genes in yeast are non-coding.Comment: Journal version available at http://www.biomedcentral.com/1471-2164/6/93/abstrac

arXiv.org e-Print Archive

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Reconstructing the History of Yeast Genomes

Author: A Bhutkar
AU Sinha
B Dutrillaux
B Llorente
C Soighe
C Zheng
D Sankoff
David Sankoff
E Tannier
FS Dietrich
Jianzhi Zhang
JL Gordon
KH Wolfe
KP Byrne
M Kellis
N Martin
P Pevzner
WJ Murphy
Publication venue: Public Library of Science
Publication date: 01/05/2009
Field of study

Crossref

Directory of Open Access Journals

PubMed Central

A Re-Annotation of the Saccharomyces Cerevisiae Genome

Author: A Ivens
Altschul
B. Barrell
Bairoch
Bairoch
Bateman
Berbee
Birney
Blandin
DeRisi
Dujon
Gaillardin
Goffeau
Hieter
K. M. Rutherford
Lowe
M-A Rajandream
Mackiewicz
Malpertuy
Mewes
Oliver
Oliver
Pearson
Rutherford
Sharp
Sonnhammer
Stoesser
V. Wood
Xiang
Zhang
Publication venue: Hindawi Publishing Corporation
Publication date: 01/01/2001
Field of study

Discrepancies in gene and orphan number indicated by previous analyses suggest that S. cerevisiae would benefit from a consistent re-annotation. In this analysis three new genes are identified and 46 alterations to gene coordinates are described. 370 ORFs are defined as totally spurious ORFs which should be disregarded. At least a further 193 genes could be described as very hypothetical, based on a number of criteria. It was found that disparate genes with sequence overlaps over ten amino acids (especially at the N-terminus) are rare in both S. cerevisiae and Sz. pombe. A new S. cerevisiae gene number estimate with an upper limit of 5804 is proposed, but after the removal of very hypothetical genes and pseudogenes this is reduced to 5570. Although this is likely to be closer to the true upper limit, it is still predicted to be an overestimate of gene number. A complete list of revised gene coordinates is available from the Sanger Centre (S. cerevisiae reannotation: ftp://ftp/pub/yeast/SCreannotation)

Crossref

Directory of Open Access Journals

PubMed Central