Search CORE

52 research outputs found

AmiGO: online access to ontology and annotation data

Author: Altschul
Amelia Ireland
Boyle
Brad Marshall
Christopher J. Mungall
Gene Ontology Consortium
Seth Carbon
ShengQiang Shu
Suzanna Lewis
Publication venue: Oxford University Press
Publication date: 15/01/2009
Field of study

AmiGO is a web application that allows users to query, browse and visualize ontologies and related gene product annotation (association) data. AmiGO can be used online at the Gene Ontology (GO) website to access the data provided by the GO Consortium1; it can also be downloaded and installed to browse local ontologies and annotations.2 AmiGO is free open source software developed and maintained by the GO Consortium

Crossref

PubMed Central

UNT Digital Library

Term Matrix: a novel Gene Ontology annotation quality control system based on ontology term co-annotation patterns.

Author: Attrill Helen
Carbon Seth
Engel Stacia R
Feuermann Marc
Gaudet Pascale
Harris Midori A
Hill David P
Lock Antonia
Lovering Ruth C
Mungall Christopher J
Poux Sylvain
Rutherford Kim M
Van Auken Kimberly
Wood Valerie
Publication venue: The Mouseion at the JAXlibrary
Publication date: 01/09/2020
Field of study

Biological processes are accomplished by the coordinated action of gene products. Gene products often participate in multiple processes, and can therefore be annotated to multiple Gene Ontology (GO) terms. Nevertheless, processes that are functionally, temporally and/or spatially distant may have few gene products in common, and co-annotation to unrelated processes probably reflects errors in literature curation, ontology structure or automated annotation pipelines. We have developed an annotation quality control workflow that uses rules based on mutually exclusive processes to detect annotation errors, based on and validated by case studies including the three we present here: fission yeast protein-coding gene annotations over time; annotations for cohesin complex subunits in human and model species; and annotations using a selected set of GO biological process terms in human and five model species. For each case study, we reviewed available GO annotations, identified pairs of biological processes which are unlikely to be correctly co-annotated to the same gene products (e.g. amino acid metabolism and cytokinesis), and traced erroneous annotations to their sources. To date we have generated 107 quality control rules, and corrected 289 manual annotations in eukaryotes and over 52 700 automatically propagated annotations across all taxa

The Jackson Laboratory: The Mouseion at the JAXlibrary

KG-COVID-19: A Framework to Produce Customized Knowledge Graphs for COVID-19 Response.

Author: Balhoff James P
Blau Hannah
Callahan Tiffany J
Cappelletti Luca
Carbon Seth
Fontana Tommaso
Good Benjamin M
Haendel Melissa A
Harris Nomi L
Joachimiak Marcin P
Matentzoglu Nicolas
Mungall Christopher J
Munoz-Torres Monica C
Ravanmehr Vida
Reese Justin T
Robinson Peter N
Shefchek Kent A
Unni Deepak
Publication venue: The Mouseion at the JAXlibrary
Publication date: 01/01/2021
Field of study

Integrated, up-to-date data about SARS-CoV-2 and COVID-19 is crucial for the ongoing response to the COVID-19 pandemic by the biomedical research community. While rich biological knowledge exists for SARS-CoV-2 and related viruses (SARS-CoV, MERS-CoV), integrating this knowledge is difficult and time-consuming, since much of it is in siloed databases or in textual format. Furthermore, the data required by the research community vary drastically for different tasks; the optimal data for a machine learning task, for example, is much different from the data used to populate a browsable user interface for clinicians. To address these challenges, we created KG-COVID-19, a flexible framework that ingests and integrates heterogeneous biomedical data to produce knowledge graphs (KGs), and applied it to create a KG for COVID-19 response. This KG framework also can be applied to other problems in which siloed biomedical data must be quickly integrated for different research applications, including future pandemics

The Jackson Laboratory: The Mouseion at the JAXlibrary

eScholarship - University of California

A method for increasing expressivity of Gene Ontology annotations using a compositional approach.

Author: Alam-Faruque Yasmin
Blake Judith A
Carbon Seth
Dietze Heiko
Dimmer Emily C
Foulger Rebecca E
Harris Midori A
Hill David P
Huntley Rachael P
Khodiyar Varsha K
Lock Antonia
Lomax Jane
Lovering Ruth C
Mungall Christopher J
Mutowo-Meullenet Prudence
Sawford Tony
Van Auken Kimberly
Wood Valerie
Publication venue: BMC Bioinformatics
Publication date: 21/05/2014
Field of study

BACKGROUND: The Gene Ontology project integrates data about the function of gene products across a diverse range of organisms, allowing the transfer of knowledge from model organisms to humans, and enabling computational analyses for interpretation of high-throughput experimental and clinical data. The core data structure is the annotation, an association between a gene product and a term from one of the three ontologies comprising the GO. Historically, it has not been possible to provide additional information about the context of a GO term, such as the target gene or the location of a molecular function. This has limited the specificity of knowledge that can be expressed by GO annotations. RESULTS: The GO Consortium has introduced annotation extensions that enable manually curated GO annotations to capture additional contextual details. Extensions represent effector-target relationships such as localization dependencies, substrates of protein modifiers and regulation targets of signaling pathways and transcription factors as well as spatial and temporal aspects of processes such as cell or tissue type or developmental stage. We describe the content and structure of annotation extensions, provide examples, and summarize the current usage of annotation extensions. CONCLUSIONS: The additional contextual information captured by annotation extensions improves the utility of functional annotation by representing dependencies between annotations to terms in the different ontologies of GO, external ontologies, or an organism's gene products. These enhanced annotations can also support sophisticated queries and reasoning, and will provide curated, directional links between many gene products to support pathway and network reconstruction

Crossref

The Jackson Laboratory: The Mouseion at the JAXlibrary

Springer - Publisher Connector

KG-Hub-building and exchanging biological knowledge graphs.

Author: Balhoff Jim
Bruskiewich Richard M
Callahan Tiffany J
Cappelletti Luca
Carbon Seth
Caufield J Harry
Chan Lauren E
Cortes Katherina
Elsarboukh Glass
Fontana Tommaso
Haendel Melissa A
Harris Nomi L
Hegde Harshad
Joachimiak Marcin P
Matentzoglu Nicolas
Moxon Sierra A T
Mungall Christopher J
Munoz-Torres Monica C
Putman Tim
Ravanmehr Vida
Reese Justin T
Robinson Peter N
Schaper Kevin
Shefchek Kent A
Thessen Anne E
Unni Deepak R
Publication venue: The Mouseion at the JAXlibrary
Publication date: 01/07/2023
Field of study

MOTIVATION: Knowledge graphs (KGs) are a powerful approach for integrating heterogeneous data and making inferences in biology and many other domains, but a coherent solution for constructing, exchanging, and facilitating the downstream use of KGs is lacking. RESULTS: Here we present KG-Hub, a platform that enables standardized construction, exchange, and reuse of KGs. Features include a simple, modular extract-transform-load pattern for producing graphs compliant with Biolink Model (a high-level data model for standardizing biological data), easy integration of any OBO (Open Biological and Biomedical Ontologies) ontology, cached downloads of upstream data sources, versioned and automatically updated builds with stable URLs, web-browsable storage of KG artifacts on cloud infrastructure, and easy reuse of transformed subgraphs across projects. Current KG-Hub projects span use cases including COVID-19 research, drug repurposing, microbial-environmental interactions, and rare disease research. KG-Hub is equipped with tooling to easily analyze and manipulate KGs. KG-Hub is also tightly integrated with graph machine learning (ML) tools which allow automated graph ML, including node embeddings and training of models for link prediction and node classification. AVAILABILITY AND IMPLEMENTATION: https://kghub.org

The Jackson Laboratory: The Mouseion at the JAXlibrary

The Monarch Initiative in 2024: an analytic platform integrating phenotypes, genes and diseases across species.

Author: Alquaddoomi Faisal S
Braun Ian
Bruskiewich Richard M
Cappelletti Luca
Carbon Seth
Caron Anita R
Caufield J Harry
Chan Lauren E
Chute Christopher G
Cortes Katherina G
Cox Corey
De Souza Vinícius
Elsarboukh Glass
Fontana Tommaso
Gehrke Sarah
Haendel Melissa A
Harris Nomi L
Hartley Emily L
Hegde Harshad
Hurwitz Eric
Jacobsen Julius O B
Krishnamurthy Madan
Laraway Bryan J
Matentzoglu Nicolas
McLaughlin James A
McMurry Julie A
Moxon Sierra A T
Mullen Kathleen R
Mungall Christopher J
Munoz-Torres Monica C
O\u27Neil Shawn T
Osumi-Sutherland David
Putman Tim E
Reese Justin T
Robinson Peter N
Rubinetti Vincent P
Schaper Kevin
Shefchek Kent A
Smedley Damian
Stefancsik Ray
Toro Sabrina
Vasilevsky Nicole A
Walls Ramona L
Whetzel Patricia L
Publication venue: The Mouseion at the JAXlibrary
Publication date: 05/01/2024
Field of study

Bridging the gap between genetic variations, environmental determinants, and phenotypic outcomes is critical for supporting clinical diagnosis and understanding mechanisms of diseases. It requires integrating open data at a global scale. The Monarch Initiative advances these goals by developing open ontologies, semantic data models, and knowledge graphs for translational research. The Monarch App is an integrated platform combining data about genes, phenotypes, and diseases across species. Monarch\u27s APIs enable access to carefully curated datasets and advanced analysis tools that support the understanding and diagnosis of disease for diverse applications such as variant prioritization, deep phenotyping, and patient profile-matching. We have migrated our system into a scalable, cloud-based infrastructure; simplified Monarch\u27s data ingestion and knowledge graph integration systems; enhanced data mapping and integration standards; and developed a new user interface with novel search and graph navigation features. Furthermore, we advanced Monarch\u27s analytic tools by developing a customized plugin for OpenAI\u27s ChatGPT to increase the reliability of its responses about phenotypic data, allowing us to interrogate the knowledge in the Monarch graph using state-of-the-art Large Language Models. The resources of the Monarch Initiative can be found at monarchinitiative.org and its corresponding code repository at github.com/monarch-initiative/monarch-app

The Jackson Laboratory: The Mouseion at the JAXlibrary

OBO Foundry in 2021: Operationalizing Open Data Principles to Evaluate Ontologies

Biological ontologies are used to organize, curate, and interpret the vast quantities of data arising from biological experiments. While this works well when using a single ontology, integrating multiple ontologies can be problematic, as they are developed independently, which can lead to incompatibilities. The Open Biological and Biomedical Ontologies Foundry was created to address this by facilitating the development, harmonization, application, and sharing of ontologies, guided by a set of overarching principles. One challenge in reaching these goals was that the OBO principles were not originally encoded in a precise fashion, and interpretation was subjective. Here we show how we have addressed this by formally encoding the OBO principles as operational rules and implementing a suite of automated validation checks and a dashboard for objectively evaluating each ontology’s compliance with each principle. This entailed a substantial effort to curate metadata across all ontologies and to coordinate with individual stakeholders. We have applied these checks across the full OBO suite of ontologies, revealing areas where individual ontologies require changes to conform to our principles. Our work demonstrates how a sizable federated community can be organized and evaluated on objective criteria that help improve overall quality and interoperability, which is vital for the sustenance of the OBO project and towards the overall goals of making data FAIR. Competing Interest StatementThe authors have declared no competing interest

PhilPapers

Electronic Publication Information Center

eScholarship - University of California

The Monarch Initiative in 2019: an integrative data and analytic platform connecting phenotypes to genotypes across species.

Author: Babb Larry
Balhoff James P
Bello Susan M
Blau Hannah
Bradford Yvonne
Brush Matthew
Carbon Seth
Carmody Leigh
Chan Lauren E
Cipriani Valentina
Conlin Tom
Cuzick Alayne
Dunn Nathan
Essaid Shahim
Fey Petra
Gargano Michael
Gourdine Jean-Phillipe
Grove Chris
Groza Tudor
Haendel Melissa A
Hamosh Ada
Harris Midori
Harris Nomi L
Helbig Ingo
Hoatlin Maureen
Jacobsen Julius O B
Joachimiak Marcin
Jupp Simon
Keith Daniel
Köhler Sebastian
Lett Kenneth B
Lewis Suzanna E
Matentzoglu Nicolas
McMurry Julie A
McNamara Craig
Mungall Christopher J
Munoz-Torres Monica C
Osumi-Sutherland David
Pendlington Zoë M
Pilgrim Clare
Putman Tim
Ravanmehr Vida
Reese Justin
Riggs Erin
Robb Sofia
Robinson Peter N
Rocca Maria D
Roncaglia Paola
Seager James
Segerdell Erik
Shefchek Kent A
Similuk Morgan
Smedley Damian
Storm Andrea L
Thaxon Courtney
Thessen Anne
Unni Deepak
Vasilevsky Nicole
Zhang Xingmin Aaron
Publication venue: The Mouseion at the JAXlibrary
Publication date: 01/01/2020
Field of study

In biology and biomedicine, relating phenotypic outcomes with genetic variation and environmental factors remains a challenge: patient phenotypes may not match known diseases, candidate variants may be in genes that haven\u27t been characterized, research organisms may not recapitulate human or veterinary diseases, environmental factors affecting disease outcomes are unknown or undocumented, and many resources must be queried to find potentially significant phenotypic associations. The Monarch Initiative (https://monarchinitiative.org) integrates information on genes, variants, genotypes, phenotypes and diseases in a variety of species, and allows powerful ontology-based search. We develop many widely adopted ontologies that together enable sophisticated computational analysis, mechanistic discovery and diagnostics of Mendelian diseases. Our algorithms and tools are widely used to identify animal models of human disease through phenotypic similarity, for differential diagnostics and to facilitate translational research. Launched in 2015, Monarch has grown with regards to data (new organisms, more sources, better modeling); new API and standards; ontologies (new Mondo unified disease ontology, improvements to ontologies such as HPO and uPheno); user interface (a redesigned website); and community development. Monarch data, algorithms and tools are being used and extended by resources such as GA4GH and NCATS Translator, among others, to aid mechanistic discovery and diagnostics

The Jackson Laboratory: The Mouseion at the JAXlibrary

eScholarship - University of California

The Gene Ontology's Reference Genome Project: A Unified Framework for Functional Annotation across Species

Author: Ashburner Michael
Basu Siddhartha
Berardini Tanya
Blake Judith A.
Carbon Seth
Cherry J. Michael
Chisholm Rex
Dimmer Emily
Dolan Mary
Dolinski Kara
Engel Stacia R.
Fey Petra
Gaudet Pascale
Hill David P.
Howe Doug
Hu James C.
Huntley Rachael
Khodiyar Varsha K.
Kishore Ranjana
Lewis Suzanna E.
Li Donghui
Lovering Ruth C.
McCarthy Fiona
Mungall Christopher J.
Ni Li
Petri Victoria
Siegele Deborah A.
Thomas Paul
Tweedie Susan
Van Auken Kimberly
Wood Valerie
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 03/07/2009
Field of study

The Gene Ontology (GO) is a collaborative effort that provides structured vocabularies for annotating the molecular function, biological role, and cellular location of gene products in a highly systematic way and in a species-neutral manner with the aim of unifying the representation of gene function across different organisms. Each contributing member of the GO Consortium independently associates GO terms to gene products from the organism(s) they are annotating. Here we introduce the Reference Genome project, which brings together those independent efforts into a unified framework based on the evolutionary relationships between genes in these different organisms. The Reference Genome project has two primary goals: to increase the depth and breadth of annotations for genes in each of the organisms in the project, and to create data sets and tools that enable other genome annotation efforts to infer GO annotations for homologous genes in their organisms. In addition, the project has several important incidental benefits, such as increasing annotation consistency across genome databases, and providing important improvements to the GO's logical structure and biological content

Alliance of Genome Resources Portal: unified model organism research platform

Author: Agapite Julie
Albou Laurent-Philippe
Aleksander Suzi
Argasinska Joanna
Arnaboldi Valerio
Attrill Helen
Bello Susan M.
Blake Judith A.
Blodgett Olin
Bradford Yvonne M.
Bult Carol J.
Cain Scott
Calvi Brian R.
Carbon Seth
Chan Juancarlos
Chen Wen J.
Cherry J. Michael
Cho Jaehyoung
Christie Karen R.
Crosby Madeline A.
De Pons Jeff
Dolan Mary E
dos Santos Gilberto
Dunn Barbara
Dunn Nathan
Eagle Anne
Ebert Dustin
Engel Stacia R.
Fashena David
Frazer Ken
Gao Sibyl
Gondwe Felix
Goodman Josh
Gramates L. Sian
Grove Christian A.
Harris Todd
Harrison Marie-Claire
Howe Douglas G.
Howe Kevin L.
Jha Sagar
Kadin James A.
Kalita Patrick
Karra Kalpana
Kaufman Thomas C.
Kishore Ranjana
Laulederkind Stan
Lee Raymond
MacPherson Kevin A.
Marygold Steven J.
Matthews Beverley
Millburn Gillian
Miyasato Stuart
Moxon Sierra
Mueller Hans-Michael
Mungall Christopher
Muruganujan Anushya
Mushayahama Tremayne
Nash Robert S.
Ng Patrick
Paulini Michael
Perrimon Norbert
Pich Christian
Raciti Daniela
Richardson Joel E.
Russell Matthew
Russo Gelbart Susan
Ruzicka Leyla
Schaper Kevin
Shaw David R.
Shimoyama Mary
Shrivatsav Ajay
Simison Matt
Skrzypek Marek
Smith Cynthia
Smith Jennifer R.
Sternberg Paul W.
Tabone Christopher J.
Thomas Paul D.
Thota Jyothi
Tomczuk Monika
Toro Sabrina
Tutaj Marek
Tutaj Monika
Urbano Jose-Maria
Van Auken Kimberly
Van Slyke Ceri E.
Wang Shur-Jen
Weng Shuai
Westerfield Monte
Williams Gary
Wong Edith D.
Wright Adam
Yook Karen
Publication venue: 'Oxford University Press (OUP)'
Publication date: 08/01/2020
Field of study

The Alliance of Genome Resources (Alliance) is a consortium of the major model organism databases and the Gene Ontology that is guided by the vision of facilitating exploration of related genes in human and well-studied model organisms by providing a highly integrated and comprehensive platform that enables researchers to leverage the extensive body of genetic and genomic studies in these organisms. Initiated in 2016, the Alliance is building a central portal (www.alliancegenome.org) for access to data for the primary model organisms along with gene ontology data and human data. All data types represented in the Alliance portal (e.g. genomic data and phenotype descriptions) have common data models and workflows for curation. All data are open and freely available via a variety of mechanisms. Long-term plans for the Alliance project include a focus on coverage of additional model organisms including those without dedicated curation communities, and the inclusion of new data types with a particular focus on providing data and tools for the non-model-organism researcher that support enhanced discovery about human health and disease. Here we review current progress and present immediate plans for this new bioinformatics resource