30 research outputs found
GA4GH Phenopackets: A Practical Introduction.
The Global Alliance for Genomics and Health (GA4GH) is developing a suite of coordinated standards for genomics for healthcare. The Phenopacket is a new GA4GH standard for sharing disease and phenotype information that characterizes an individual person, linking that individual to detailed phenotypic descriptions, genetic information, diagnoses, and treatments. A detailed example is presented that illustrates how to use the schema to represent the clinical course of a patient with retinoblastoma, including demographic information, the clinical diagnosis, phenotypic features and clinical measurements, an examination of the extirpated tumor, therapies, and the results of genomic analysis. The Phenopacket Schema, together with other GA4GH data and technical standards, will enable data exchange and provide a foundation for the computational analysis of disease and phenotype information to improve our ability to diagnose and conduct research on all types of disorders, including cancer and rare diseases
GA4GH Phenopackets: A Practical Introduction
The Global Alliance for Genomics and Health (GA4GH) is developing a suite of coordinated standards for genomics for healthcare. The Phenopacket is a new GA4GH standard for sharing disease and phenotype information that characterizes an individual person, linking that individual to detailed phenotypic descriptions, genetic information, diagnoses, and treatments. A detailed example is presented that illustrates how to use the schema to represent the clinical course of a patient with retinoblastoma, including demographic information, the clinical diagnosis, phenotypic features and clinical measurements, an examination of the extirpated tumor, therapies, and the results of genomic analysis. The Phenopacket Schema, together with other GA4GH data and technical standards, will enable data exchange and provide a foundation for the computational analysis of disease and phenotype information to improve our ability to diagnose and conduct research on all types of disorders, including cancer and rare diseases
The Human Phenotype Ontology in 2024: phenotypes around the world.
The Human Phenotype Ontology (HPO) is a widely used resource that comprehensively organizes and defines the phenotypic features of human disease, enabling computational inference and supporting genomic and phenotypic analyses through semantic similarity and machine learning algorithms. The HPO has widespread applications in clinical diagnostics and translational research, including genomic diagnostics, gene-disease discovery, and cohort analytics. In recent years, groups around the world have developed translations of the HPO from English to other languages, and the HPO browser has been internationalized, allowing users to view HPO term labels and in many cases synonyms and definitions in ten languages in addition to English. Since our last report, a total of 2239 new HPO terms and 49235 new HPO annotations were developed, many in collaboration with external groups in the fields of psychiatry, arthrogryposis, immunology and cardiology. The Medical Action Ontology (MAxO) is a new effort to model treatments and other measures taken for clinical management. Finally, the HPO consortium is contributing to efforts to integrate the HPO and the GA4GH Phenopacket Schema into electronic health records (EHRs) with the goal of more standardized and computable integration of rare disease data in EHRs
A Genome-Wide Association Study of Diabetic Kidney Disease in Subjects With Type 2 Diabetes
dentification of sequence variants robustly associated with predisposition to diabetic kidney disease (DKD) has the potential to provide insights into the pathophysiological mechanisms responsible. We conducted a genome-wide association study (GWAS) of DKD in type 2 diabetes (T2D) using eight complementary dichotomous and quantitative DKD phenotypes: the principal dichotomous analysis involved 5,717 T2D subjects, 3,345 with DKD. Promising association signals were evaluated in up to 26,827 subjects with T2D (12,710 with DKD). A combined T1D+T2D GWAS was performed using complementary data available for subjects with T1D, which, with replication samples, involved up to 40,340 subjects with diabetes (18,582 with DKD). Analysis of specific DKD phenotypes identified a novel signal near GABRR1 (rs9942471, P = 4.5 x 10(-8)) associated with microalbuminuria in European T2D case subjects. However, no replication of this signal was observed in Asian subjects with T2D or in the equivalent T1D analysis. There was only limited support, in this substantially enlarged analysis, for association at previously reported DKD signals, except for those at UMOD and PRKAG2, both associated with estimated glomerular filtration rate. We conclude that, despite challenges in addressing phenotypic heterogeneity, access to increased sample sizes will continue to provide more robust inference regarding risk variant discovery for DKD.Peer reviewe
The Human Proteoform Project: Defining the human proteome
Proteins are the primary effectors of function in biology, and thus, complete knowledge of their structure and properties is fundamental to deciphering function in basic and translational research. The chemical diversity of proteins is expressed in their many proteoforms, which result from combinations of genetic polymorphisms, RNA splice variants, and posttranslational modifications. This knowledge is foundational for the biological complexes and networks that control biology yet remains largely unknown. We propose here an ambitious initiative to define the human proteome, that is, to generate a definitive reference set of the proteoforms produced from the genome. Several examples of the power and importance of proteoform-level knowledge in disease-based research are presented along with a call for improved technologies in a two-pronged strategy to the Human Proteoform Project
Recommended from our members
ProForma: A Standard Proteoform Notation
The Consortium for Top-Down Proteomics (CTDP) proposes a standardized notation, ProForma, for writing the sequence of fully characterized proteoforms. ProForma provides a means to communicate any proteoform by writing the amino acid sequence using standard one-letter notation and specifying modifications or unidentified mass shifts within brackets following certain amino acids. The notation is unambiguous, human-readable, and can easily be parsed and written by bioinformatic tools. This system uses seven rules and supports a wide range of possible use cases, ensuring compatibility and reproducibility of proteoform annotations. Standardizing proteoform sequences will simplify storage, comparison, and reanalysis of proteomic studies, and the Consortium welcomes input and contributions from the research community on the continued design and maintenance of this standard
A five-level classification system for proteoform identifications
International audienceTo the editorThe term proteoform, introduced in Nature Methods in 2013 (ref. 1), has rapidly gained acceptance in the proteomics community. The challenge and importance of comprehensively identifying proteoforms in complex samples has been recognized, and reports have begun to appear of new platforms towards that end2,3,4,5. However, one interesting central ambiguity has emerged, namely determining precisely what is meant by a ‘proteoform identification’. At present, the only practical approaches for establishing the exact primary structure of a proteoform employ mass spectrometry (MS), and a wide range of MS results claim proteoform identifications6. This seemingly small matter has significant impact, as the ambiguity in what is meant by an ‘identification’ makes it difficult to compare results from different laboratories and approaches. This situation hinders the ability of the community to evaluate technological progress and to efficiently expand biological knowledge