Search CORE

36 research outputs found

Automated generation of gene summaries at the Alliance of Genome Resources

Author: Arnaboldi Valerio
Chan Juancarlos
Dolan Mary E.
Engel Stacia R.
Kishore Ranjana
Nash Robert S.
Shimoyama Mary
Sternberg Paul W.
Urbano Jose M.
Van Slyke Ceri E.
Publication venue: 'Oxford University Press (OUP)'
Publication date: 19/06/2020
Field of study

Short paragraphs that describe gene function, referred to as gene summaries, are valued by users of biological knowledgebases for the ease with which they convey key aspects of gene function. Manual curation of gene summaries, while desirable, is difficult for knowledgebases to sustain. We developed an algorithm that uses curated, structured gene data at the Alliance of Genome Resources (Alliance; www.alliancegenome.org) to automatically generate gene summaries that simulate natural language. The gene data used for this purpose include curated associations (annotations) to ontology terms from the Gene Ontology, Disease Ontology, model organism knowledgebase (MOK)-specific anatomy ontologies and Alliance orthology data. The method uses sentence templates for each data category included in the gene summary in order to build a natural language sentence from the list of terms associated with each gene. To improve readability of the summaries when numerous gene annotations are present, we developed a new algorithm that traverses ontology graphs in order to group terms by their common ancestors. The algorithm optimizes the coverage of the initial set of terms and limits the length of the final summary, using measures of information content of each ontology term as a criterion for inclusion in the summary. The automated gene summaries are generated with each Alliance release, ensuring that they reflect current data at the Alliance. Our method effectively leverages category-specific curation efforts of the Alliance member databases to create modular, structured and standardized gene summaries for seven member species of the Alliance. These automatically generated gene summaries make cross-species gene function comparisons tenable and increase discoverability of potential models of human disease. In addition to being displayed on Alliance gene pages, these summaries are also included on several MOK gene pages

Automated generation of gene summaries at the Alliance of Genome Resources.

Author: Arnaboldi Valerio
Chan Juancarlos
Dolan Mary E
Engel Stacia R
Genome Resources The Alliance Of
Kishore Ranjana
Nash Robert S
Shimoyama Mary
Sternberg Paul W
Urbano Jose M
Van Slyke Ceri E
Publication venue: The Mouseion at the JAXlibrary
Publication date: 01/01/2020
Field of study

The Jackson Laboratory: The Mouseion at the JAXlibrary

Caltech Authors

Fungal BLAST and Model Organism BLASTP Best Hits: new comparison resources at the Saccharomyces Genome Database (SGD)

Author: Balakrishnan Rama
Binkley Gail
Botstein David
Cherry J. Michael
Christie Karen R.
Costanzo Maria C.
Dolinski Kara
Dong Qing
Dwight Selina S.
Engel Stacia R.
Fisk Dianna G.
Hirschman Jodi E.
Hong Eurie L.
Lane Christopher
Nash Robert
Oughtred Rose
Sethuraman Anand
Skrzypek Marek
Theesfeld Chandra L.
Weng Shuai
Publication venue: Oxford University Press
Publication date: 17/12/2004
Field of study

The Saccharomyces Genome Database (SGD; http://www.yeastgenome.org/) is a scientific database of gene, protein and genomic information for the yeast Saccharomyces cerevisiae. SGD has recently developed two new resources that facilitate nucleotide and protein sequence comparisons between S.cerevisiae and other organisms. The Fungal BLAST tool provides directed searches against all fungal nucleotide and protein sequences available from GenBank, divided into categories according to organism, status of completeness and annotation, and source. The Model Organism BLASTP Best Hits resource displays, for each S.cerevisiae protein, the single most similar protein from several model organisms and presents links to the database pages of those proteins, facilitating access to curated information about potential orthologs of yeast proteins

Crossref

PubMed Central

Expanded protein information at SGD: new pages and proteome browser

Author: Balakrishnan Rama
Binkley Gail
Botstein David
Cherry J. Michael
Christie Karen R.
Costanzo Maria C.
Dolinski Kara
Dong Qing
Dwight Selina S.
Engel Stacia R.
Fisk Dianna G.
Hirschman Jodi E.
Hitz Ben
Hong Eurie L.
Lane Christopher
Livstone Michael S.
Miyasato Stuart
Nash Robert
Oughtred Rose
Park Julie
Schroeder Mark
Sethuraman Anand
Skrzypek Marek
Theesfeld Chandra L.
Weng Shuai
Publication venue: Oxford University Press
Publication date: 16/11/2006
Field of study

The recent explosion in protein data generated from both directed small-scale studies and large-scale proteomics efforts has greatly expanded the quantity of available protein information and has prompted the Saccharomyces Genome Database (SGD; ) to enhance the depth and accessibility of protein annotations. In particular, we have expanded ongoing efforts to improve the integration of experimental information and sequence-based predictions and have redesigned the protein information web pages. A key feature of this redesign is the development of a GBrowse-derived interactive Proteome Browser customized to improve the visualization of sequence-based protein information. This Proteome Browser has enabled SGD to unify the display of hidden Markov model (HMM) domains, protein family HMMs, motifs, transmembrane regions, signal peptides, hydropathy plots and profile hits using several popular prediction algorithms. In addition, a physico-chemical properties page has been introduced to provide easy access to basic protein information. Improvements to the layout of the Protein Information page and integration of the Proteome Browser will facilitate the ongoing expansion of sequence-specific experimental information captured in SGD, including post-translational modifications and other user-defined annotations. Finally, SGD continues to improve upon the availability of genetic and physical interaction data in an ongoing collaboration with BioGRID by providing direct access to more than 82 000 manually-curated interactions

Crossref

PubMed Central

Genome Snapshot: a new resource at the Saccharomyces Genome Database (SGD) presenting an overview of the Saccharomyces cerevisiae genome

Author: Andrada Rey
Balakrishnan Rama
Binkley Gail
Botstein David
Cherry J. Michael
Christie Karen R.
Costanzo Maria C.
Dolinski Kara
Dong Qing
Dwight Selina S.
Engel Stacia R.
Fisk Dianna G.
Hirschman Jodi E.
Hong Eurie L.
Lane Christopher
Livstone Michael S.
Miyasato Stuart
Nash Robert
Oughtred Rose
Park Julie
Schroeder Mark
Sethuraman Anand
Skrzypek Marek
Starr Barry
Thanawala Mayank K.
Theesfeld Chandra L.
Weng Shuai
Williams Jennifer
Publication venue: Oxford University Press
Publication date: 28/12/2005
Field of study

Sequencing and annotation of the entire Saccharomyces cerevisiae genome has made it possible to gain a genome-wide perspective on yeast genes and gene products. To make this information available on an ongoing basis, the Saccharomyces Genome Database (SGD) () has created the Genome Snapshot (). The Genome Snapshot summarizes the current state of knowledge about the genes and chromosomal features of S.cerevisiae. The information is organized into two categories: (i) number of each type of chromosomal feature annotated in the genome and (ii) number and distribution of genes annotated to Gene Ontology terms. Detailed lists are accessible through SGD's Advanced Search tool (), and all the data presented on this page are available from the SGD ftp site ()

CiteSeerX

Crossref

PubMed Central

Correction: AGAPE (Automated Genome Analysis PipelinE) for Pan-Genome Analysis of Saccharomyces cerevisiae

Author: A Bergstrom
A Borneman
A Dereeper
A Gibson
A Goffeau
A Nitta
B Cantarel
B Dunn
B Read
Barbara Dunn
Benjamin J. A. Dickins
C Brachmann
C Kumar
D Botstein
D Fisk
E Winzeler
F Lacroute
F Sherman
F Winston
G Liti
G Song
Giltae Song
H Lam
H Li
H Wu
J Argueso
J Cherry
J Fay
J Heck
J Heitman
J Perez-Ortin
J Schacherer
J Simpson
J Simpson
J van Dijken
J Warringer
J. Michael Cherry
Janos Demeter
Joseph Schacherer
JS Robinson
M Novo
O Bedoya-Reina
P Sniegowski
R Atkinson
R Borts
R Dubin
R Mortimer
R Mortimer
R Sikorski
S Altschul
S Bentley
S Doniger
S Engel
S Hunter
S Kane
S Salzberg
S Spingola
Stacia Engel
T Akao
T Asano
T Kobayashi
V Meyrial
W Kent
W Voth
Y Li
Y Zhao
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 17/03/2015
Field of study

The characterization and public release of genome sequences from thousands of organisms is expanding the scope for genetic variation studies. However, understanding the phenotypic consequences of genetic variation remains a challenge in eukaryotes due to the complexity of the genotype-phenotype map. One approach to this is the intensive study of model systems for which diverse sources of information can be accumulated and integrated. Saccharomyces cerevisiae is an extensively studied model organism, with well-known protein functions and thoroughly curated phenotype data. To develop and expand the available resources linking genomic variation with function in yeast, we aim to model the pan-genome of S. cerevisiae. To initiate the yeast pan-genome, we newly sequenced or re-sequenced the genomes of 25 strains that are commonly used in the yeast research community using advanced sequencing technology at high quality. We also developed a pipeline for automated pan-genome analysis, which integrates the steps of assembly, annotation, and variation calling. To assign strain-specific functional annotations, we identified genes that were not present in the reference genome. We classified these according to their presence or absence across strains and characterized each group of genes with known functional and phenotypic features. The functional roles of novel genes not found in the reference genome and associated with strains or groups of strains appear to be consistent with anticipated adaptations in specific lineages. As more S. cerevisiae strain genomes are released, our analysis can be used to collate genome data and relate it to lineage-specific patterns of genome evolution. Our new tool set will enhance our understanding of genomic and functional evolution in S. cerevisiae, and will be available to the yeast genetics and molecular biology community

Public Library of Science (PLOS)

Crossref

Nottingham Trent Institutional Repository (IRep)

Directory of Open Access Journals

PubMed Central

FigShare

Alliance of Genome Resources Portal: unified model organism research platform

Author: Agapite Julie
Albou Laurent-Philippe
Aleksander Suzi
Argasinska Joanna
Arnaboldi Valerio
Attrill Helen
Bello Susan M.
Blake Judith A.
Blodgett Olin
Bradford Yvonne M.
Bult Carol J.
Cain Scott
Calvi Brian R.
Carbon Seth
Chan Juancarlos
Chen Wen J.
Cherry J. Michael
Cho Jaehyoung
Christie Karen R.
Crosby Madeline A.
De Pons Jeff
Dolan Mary E
dos Santos Gilberto
Dunn Barbara
Dunn Nathan
Eagle Anne
Ebert Dustin
Engel Stacia R.
Fashena David
Frazer Ken
Gao Sibyl
Gondwe Felix
Goodman Josh
Gramates L. Sian
Grove Christian A.
Harris Todd
Harrison Marie-Claire
Howe Douglas G.
Howe Kevin L.
Jha Sagar
Kadin James A.
Kalita Patrick
Karra Kalpana
Kaufman Thomas C.
Kishore Ranjana
Laulederkind Stan
Lee Raymond
MacPherson Kevin A.
Marygold Steven J.
Matthews Beverley
Millburn Gillian
Miyasato Stuart
Moxon Sierra
Mueller Hans-Michael
Mungall Christopher
Muruganujan Anushya
Mushayahama Tremayne
Nash Robert S.
Ng Patrick
Paulini Michael
Perrimon Norbert
Pich Christian
Raciti Daniela
Richardson Joel E.
Russell Matthew
Russo Gelbart Susan
Ruzicka Leyla
Schaper Kevin
Shaw David R.
Shimoyama Mary
Shrivatsav Ajay
Simison Matt
Skrzypek Marek
Smith Cynthia
Smith Jennifer R.
Sternberg Paul W.
Tabone Christopher J.
Thomas Paul D.
Thota Jyothi
Tomczuk Monika
Toro Sabrina
Tutaj Marek
Tutaj Monika
Urbano Jose-Maria
Van Auken Kimberly
Van Slyke Ceri E.
Wang Shur-Jen
Weng Shuai
Westerfield Monte
Williams Gary
Wong Edith D.
Wright Adam
Yook Karen
Publication venue: 'Oxford University Press (OUP)'
Publication date: 08/01/2020
Field of study

The Alliance of Genome Resources (Alliance) is a consortium of the major model organism databases and the Gene Ontology that is guided by the vision of facilitating exploration of related genes in human and well-studied model organisms by providing a highly integrated and comprehensive platform that enables researchers to leverage the extensive body of genetic and genomic studies in these organisms. Initiated in 2016, the Alliance is building a central portal (www.alliancegenome.org) for access to data for the primary model organisms along with gene ontology data and human data. All data types represented in the Alliance portal (e.g. genomic data and phenotype descriptions) have common data models and workflows for curation. All data are open and freely available via a variety of mechanisms. Long-term plans for the Alliance project include a focus on coverage of additional model organisms including those without dedicated curation communities, and the inclusion of new data types with a particular focus on providing data and tools for the non-model-organism researcher that support enhanced discovery about human health and disease. Here we review current progress and present immediate plans for this new bioinformatics resource

Alliance of Genome Resources Portal: unified model organism research platform

Author: Agapite Julie
Albou Laurent-Philippe
Aleksander Suzi
Argasinska Joanna
Arnaboldi Valerio
Attrill Helen
Bello Susan M.
Blake Judith A.
Blodgett Olin
Bradford Yvonne M.
Bult Carol J.
Cain Scott
Calvi Brian R.
Carbon Seth
Chan Juancarlos
Chen Wen J.
Cherry J. Michael
Cho Jaehyoung
Christie Karen R.
Crosby Madeline A.
De Pons Jeff
Dolan Mary E
dos Santos Gilberto
Dunn Barbara
Dunn Nathan
Eagle Anne
Ebert Dustin
Engel Stacia R.
Fashena David
Frazer Ken
Gao Sibyl
Gondwe Felix
Goodman Josh
Gramates L. Sian
Grove Christian A.
Harris Todd
Harrison Marie-Claire
Howe Douglas G.
Howe Kevin L.
Jha Sagar
Kadin James A.
Kalita Patrick
Karra Kalpana
Kaufman Thomas C.
Kishore Ranjana
Laulederkind Stan
Lee Raymond
MacPherson Kevin A.
Marygold Steven J.
Matthews Beverley
Millburn Gillian
Miyasato Stuart
Moxon Sierra
Mueller Hans-Michael
Mungall Christopher
Muruganujan Anushya
Mushayahama Tremayne
Nash Robert S.
Ng Patrick
Paulini Michael
Perrimon Norbert
Pich Christian
Raciti Daniela
Richardson Joel E.
Russell Matthew
Russo Gelbart Susan
Ruzicka Leyla
Schaper Kevin
Shaw David R.
Shimoyama Mary
Shrivatsav Ajay
Simison Matt
Skrzypek Marek
Smith Cynthia
Smith Jennifer R.
Sternberg Paul W.
Tabone Christopher J.
Thomas Paul D.
Thota Jyothi
Tomczuk Monika
Toro Sabrina
Tutaj Marek
Tutaj Monika
Urbano Jose-Maria
Van Auken Kimberly
Van Slyke Ceri E.
Wang Shur-Jen
Weng Shuai
Westerfield Monte
Williams Gary
Wong Edith D.
Wright Adam
Yook Karen
Publication venue: 'Oxford University Press (OUP)'
Publication date: 08/01/2020
Field of study

Caltech Authors