Search CORE

680 research outputs found

Fungal BLAST and Model Organism BLASTP Best Hits: new comparison resources at the Saccharomyces Genome Database (SGD)

Author: Balakrishnan Rama
Binkley Gail
Botstein David
Cherry J. Michael
Christie Karen R.
Costanzo Maria C.
Dolinski Kara
Dong Qing
Dwight Selina S.
Engel Stacia R.
Fisk Dianna G.
Hirschman Jodi E.
Hong Eurie L.
Lane Christopher
Nash Robert
Oughtred Rose
Sethuraman Anand
Skrzypek Marek
Theesfeld Chandra L.
Weng Shuai
Publication venue: Oxford University Press
Publication date: 17/12/2004
Field of study

The Saccharomyces Genome Database (SGD; http://www.yeastgenome.org/) is a scientific database of gene, protein and genomic information for the yeast Saccharomyces cerevisiae. SGD has recently developed two new resources that facilitate nucleotide and protein sequence comparisons between S.cerevisiae and other organisms. The Fungal BLAST tool provides directed searches against all fungal nucleotide and protein sequences available from GenBank, divided into categories according to organism, status of completeness and annotation, and source. The Model Organism BLASTP Best Hits resource displays, for each S.cerevisiae protein, the single most similar protein from several model organisms and presents links to the database pages of those proteins, facilitating access to curated information about potential orthologs of yeast proteins

Crossref

PubMed Central

YeastMine--an integrated data warehouse for Saccharomyces cerevisiae data as a multipurpose tool-kit.

Author: Balakrishnan Rama
Binkley Gail
Cherry J Michael
Hitz Benjamin C
Hong Eurie L
Karra Kalpana
Micklem Gos
Park Julie
Sullivan Julie
Publication venue: Database (Oxford)
Publication date: 01/01/2012
Field of study

The Saccharomyces Genome Database (SGD; http://www.yeastgenome.org/) provides high-quality curated genomic, genetic, and molecular information on the genes and their products of the budding yeast Saccharomyces cerevisiae. To accommodate the increasingly complex, diverse needs of researchers for searching and comparing data, SGD has implemented InterMine (http://www.InterMine.org), an open source data warehouse system with a sophisticated querying interface, to create YeastMine (http://yeastmine.yeastgenome.org). YeastMine is a multifaceted search and retrieval environment that provides access to diverse data types. Searches can be initiated with a list of genes, a list of Gene Ontology terms, or lists of many other data types. The results from queries can be combined for further analysis and saved or downloaded in customizable file formats. Queries themselves can be customized by modifying predefined templates or by creating a new template to access a combination of specific data types. YeastMine offers multiple scenarios in which it can be used such as a powerful search interface, a discovery tool, a curation aid and also a complex database presentation format. DATABASE URL: http://yeastmine.yeastgenome.org

PubMed Central

Apollo (Cambridge)

Sequence resources at the Candida Genome Database

Author: Arnaud Martha B.
Binkley Gail
Costanzo Maria C.
Lane Christopher
Miyasato Stuart R.
Shah Prachi
Sherlock Gavin
Skrzypek Marek S.
Publication venue: Oxford University Press
Publication date: 07/11/2006
Field of study

The Candida Genome Database (CGD, ) contains a curated collection of genomic information and community resources for researchers who are interested in the molecular biology of the opportunistic pathogen Candida albicans. With the recent release of a new assembly of the C.albicans genome, Assembly 20, C.albicans genomics has entered a new era. Although the C.albicans genome assembly continues to undergo refinement, multiple assemblies and gene nomenclatures will remain in widespread use by the research community. CGD has now taken on the responsibility of maintaining the most up-to-date version of the genome sequence by providing the data from this new assembly alongside the data from the previous assemblies, as well as any future corrections and refinements. In this database update, we describe the sequence information available for C.albicans, the sequence information contained in CGD, and the tools for sequence retrieval, analysis and comparison that CGD provides. CGD is freely accessible at and CGD curators may be contacted by email at [email protected]

CiteSeerX

Crossref

PubMed Central

Automation of gene assignments to metabolic pathways using high-throughput expression data

Author: Popescu Liviu
Yona Golan
Publication venue: BioMed Central
Publication date: 01/01/2005
Field of study

BACKGROUND: Accurate assignment of genes to pathways is essential in order to understand the functional role of genes and to map the existing pathways in a given genome. Existing algorithms predict pathways by extrapolating experimental data in one organism to other organisms for which this data is not available. However, current systems classify all genes that belong to a specific EC family to all the pathways that contain the corresponding enzymatic reaction, and thus introduce ambiguity. RESULTS: Here we describe an algorithm for assignment of genes to cellular pathways that addresses this problem by selectively assigning specific genes to pathways. Our algorithm uses the set of experimentally elucidated metabolic pathways from MetaCyc, together with statistical models of enzyme families and expression data to assign genes to enzyme families and pathways by optimizing correlated co-expression, while minimizing conflicts due to shared assignments among pathways. Our algorithm also identifies alternative ("backup") genes and addresses the multi-domain nature of proteins. We apply our model to assign genes to pathways in the Yeast genome and compare the results for genes that were assigned experimentally. Our assignments are consistent with the experimentally verified assignments and reflect characteristic properties of cellular pathways. CONCLUSION: We present an algorithm for automatic assignment of genes to metabolic pathways. The algorithm utilizes expression data and reduces the ambiguity that characterizes assignments that are based only on EC numbers

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Visualizing syntenic relationships among the hemiascomycetes with the Yeast Gene Order Browser

Author: Byrne Kevin P.
Wolfe Kenneth H.
Publication venue: Oxford University Press
Publication date: 28/12/2005
Field of study

The Yeast Gene Order Browser (YGOB) is an online tool designed to facilitate the comparative genomic visualization and appraisal of synteny within and between the genomes of seven hemiascomycete yeast species. Three of these genomes are polyploid, and hence contain intra-genomic syntenic regions, the correct assembly of which is a particular success of YGOB. Designed to accurately assemble, display and score gene order relationships, YGOB is both an interactive tool for browsing genomic data, and a software engine now being used for evolutionary analyses on a whole-genome scale. Underlying the online interface is the YGOB database, which consists of homology assignments across the species, extensively curated based on sequence similarity and novelly, an appraisal of genomic context (synteny) in multiple genomes. Currently the YGOB database incorporates genome data from Saccharomyces cerevisiae, Candida glabrata, Saccharomyces castellii, Ashbya gossypii, Kluyveromyces lactis, Kluyveromyces waltii and Saccharomyces kluyveri, but the system is scaleable to accommodate additional genomes. This paper discusses the usage and utility of version 1.0 of YGOB, which is publicly available at

CiteSeerX

Crossref

PubMed Central

dictyBase, the model organism database for Dictyostelium discoideum

Author: Chisholm Rex L.
Fey Petra
Gaudet Pascale
Just Eric M.
Kibbe Warren A.
Merchant Sohel N.
Pilcher Karen E.
Publication venue: Oxford University Press
Publication date: 01/01/2005
Field of study

dictyBase () is the model organism database (MOD) for the social amoeba Dictyostelium discoideum. The unique biology and phylogenetic position of Dictyostelium offer a great opportunity to gain knowledge of processes not characterized in other organisms. The recent completion of the 34 MB genome sequence, together with the sizable scientific literature using Dictyostelium as a research organism, provided the necessary tools to create a well-annotated genome. dictyBase has leveraged software developed by the Saccharomyces Genome Database and the Generic Model Organism Database project. This has reduced the time required to develop a full-featured MOD and greatly facilitated our ability to focus on annotation and providing new functionality. We hope that manual curation of the Dictyostelium genome will facilitate the annotation of other genomes

CiteSeerX

Crossref

PubMed Central

CandidaDB: a genome database for Candida albicans pathogenomics

Author: Albrecht Antje
Bader O.
Brown A. J. P.
Castillo L.
d'Enfert C.
de Groot P.
Dominguez A.
Ernst J. F.
Fradin C.
Frangeul L.
Gaillardin C.
Garcia-Sanchez S.
Goyard S.
Hube B.
Jones L.
Klis F. M.
Krishnamurthy S.
Kunze D.
Lopez M.-C.
Martin J. Perez
Martin N.
Mavor A.
Moszer I.
Onésime D.
Rodriguez-Arnaveilhe S.
Sentandreu R.
Tekaia F.
Valentin E.
Publication venue: Oxford University Press
Publication date: 17/12/2004
Field of study

CandidaDB is a database dedicated to the genome of the most prevalent systemic fungal pathogen of humans, Candida albicans. CandidaDB is based on an annotation of the Stanford Genome Technology Center C.albicans genome sequence data by the European Galar Fungail Consortium. CandidaDB Release 2.0 (June 2004) contains information pertaining to Assembly 19 of the genome of C.albicans strain SC5314. The current release contains 6244 annotated entries corresponding to 130 tRNA genes and 5917 protein-coding genes. For these, it provides tentative functional assignments along with numerous pre-run analyses that can assist the researcher in the evaluation of gene function for the purpose of specific or large-scale analysis. CandidaDB is based on GenoList, a generic relational data schema and a World Wide Web interface that has been adapted to the handling of eukaryotic genomes. The interface allows users to browse easily through genome data and retrieve information. CandidaDB also provides more elaborate tools, such as pattern searching, that are tightly connected to the overall browsing system. As the C.albicans genome is diploid and still incompletely assembled, CandidaDB provides tools to browse the genome by individual supercontigs and to examine information about allelic sequences obtained from complementary contigs. CandidaDB is accessible at http://genolist.pasteur.fr/CandidaDB

Aberdeen University Research

International Migration, Integration and Social Cohesion online publications

Hal-Diderot

Recommended from our members

The functional network in predictive biology : predicting phenotype from genotype and predicting human disease from fungal phenotype

Author: McGary Kriston Lyle
Publication venue
Publication date: 01/12/2008
Field of study

textThe ability to predict is one of the hallmarks of successful theories. Historically, the predictive power of biology has lagged behind disciplines like physics because the biological world is complex, challenging to quantify, and full of exceptions. However, in recent years the amount of available data has expanded exponentially and biological predictions based on this data become a possibility. The functional gene network is a quantitative way to integrate this data and a useful framework for making biological predictions. This study demonstrates that functional networks capture real biological insight and uses the network to predict both subcellular protein localization and the phenotypic outcome of gene knockouts. Furthermore, I use the functional network to evaluate genetic modules shared between diverse organisms that lead to orthologous phenotypes, many that are non-obvious. I show that the successful predictions of the functional network have broad applicability and implications that range from the design of large-scale biological experiments to the discovery of genes with potential roles in human disease.Institute for Cellular and Molecular Biolog

Texas ScholarWorks

Evolutionary plasticity determination by orthologous groups distribution

Author: Castro Mauro AA
Dalmolin Rodrigo JS
de Almeida Rita MC
Moreira José CF
Rybarczyk Filho José L
Souza Luis HT
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Abstract Background Genetic plasticity may be understood as the ability of a functional gene network to tolerate alterations in its components or structure. Usually, the studies involving gene modifications in the course of the evolution are concerned to nucleotide sequence alterations in closely related species. However, the analysis of large scale data about the distribution of gene families in non-exclusively closely related species can provide insights on how plastic or how conserved a given gene family is. Here, we analyze the abundance and diversity of all Eukaryotic Clusters of Orthologous Groups (KOG) present in STRING database, resulting in a total of 4,850 KOGs. This dataset comprises 481,421 proteins distributed among 55 eukaryotes. Results We propose an index to evaluate the evolutionary plasticity and conservation of an orthologous group based on its abundance and diversity across eukaryotes. To further KOG plasticity analysis, we estimate the evolutionary distance average among all proteins which take part in the same orthologous group. As a result, we found a strong correlation between the evolutionary distance average and the proposed evolutionary plasticity index. Additionally, we found low evolutionary plasticity in <it>Saccharomyces cerevisiae </it>genes associated with inviability and <it>Mus musculus </it>genes associated with early lethality. At last, we plot the evolutionary plasticity value in different gene networks from yeast and humans. As a result, it was possible to discriminate among higher and lower plastic areas of the gene networks analyzed. Conclusions The distribution of gene families brings valuable information on evolutionary plasticity which might be related with genetic plasticity. Accordingly, it is possible to discriminate among conserved and plastic orthologous groups by evaluating their abundance and diversity across eukaryotes. Reviewers This article was reviewed by Prof Manyuan Long, Hiroyuki Toh, and Sebastien Halary.</p

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Crossref

Springer - Publisher Connector

Lume 5.8

Directory of Open Access Journals

PubMed Central

RCAAP - Repositório Científico de Acesso Aberto de Portugal

A MOD(ern) perspective on literature curation

Author: Berardini Tanya Z.
Drabkin Harold J.
Hirschman Jodi
Howe Doug
Publication venue: Springer-Verlag
Publication date: 01/01/2010
Field of study

Curation of biological data is a multi-faceted task whose goal is to create a structured, comprehensive, integrated, and accurate resource of current biological knowledge. These structured data facilitate the work of the scientific community by providing knowledge about genes or genomes and by generating validated connections between the data that yield new information and stimulate new research approaches. For the model organism databases (MODs), an important source of data is research publications. Every published paper containing experimental information about a particular model organism is a candidate for curation. All such papers are examined carefully by curators for relevant information. Here, four curators from different MODs describe the literature curation process and highlight approaches taken by the four MODs to address: (1) the decision process by which papers are selected, and (2) the identification and prioritization of the data contained in the paper. We will highlight some of the challenges that MOD biocurators face, and point to ways in which researchers and publishers can support the work of biocurators and the value of such support

The Jackson Laboratory: The Mouseion at the JAXlibrary

Springer - Publisher Connector

PubMed Central