124,627 research outputs found

    Towards linked open gene mutations data

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>With the advent of high-throughput technologies, a great wealth of variation data is being produced. Such information may constitute the basis for correlation analyses between genotypes and phenotypes and, in the future, for personalized medicine. Several databases on gene variation exist, but this kind of information is still scarce in the Semantic Web framework.</p> <p>In this paper, we discuss issues related to the integration of mutation data in the Linked Open Data infrastructure, part of the Semantic Web framework. We present the development of a mapping from the IARC TP53 Mutation database to RDF and the implementation of servers publishing this data.</p> <p>Methods</p> <p>A version of the IARC TP53 Mutation database implemented in a relational database was used as first test set. Automatic mappings to RDF were first created by using D2RQ and later manually refined by introducing concepts and properties from domain vocabularies and ontologies, as well as links to Linked Open Data implementations of various systems of biomedical interest.</p> <p>Since D2RQ query performances are lower than those that can be achieved by using an RDF archive, generated data was also loaded into a dedicated system based on tools from the Jena software suite.</p> <p>Results</p> <p>We have implemented a D2RQ Server for TP53 mutation data, providing data on a subset of the IARC database, including gene variations, somatic mutations, and bibliographic references. The server allows to browse the RDF graph by using links both between classes and to external systems. An alternative interface offers improved performances for SPARQL queries. The resulting data can be explored by using any Semantic Web browser or application.</p> <p>Conclusions</p> <p>This has been the first case of a mutation database exposed as Linked Data. A revised version of our prototype, including further concepts and IARC TP53 Mutation database data sets, is under development.</p> <p>The publication of variation information as Linked Data opens new perspectives: the exploitation of SPARQL searches on mutation data and other biological databases may support data retrieval which is presently not possible. Moreover, reasoning on integrated variation data may support discoveries towards personalized medicine.</p

    Rare germline variants in DNA repair genes and the angiogenesis pathway predispose prostate cancer patients to develop metastatic disease

    Get PDF
    Background Prostate cancer (PrCa) demonstrates a heterogeneous clinical presentation ranging from largely indolent to lethal. We sought to identify a signature of rare inherited variants that distinguishes between these two extreme phenotypes. Methods We sequenced germline whole exomes from 139 aggressive (metastatic, age of diagnosis < 60) and 141 non-aggressive (low clinical grade, age of diagnosis ≥60) PrCa cases. We conducted rare variant association analyses at gene and gene set levels using SKAT and Bayesian risk index techniques. GO term enrichment analysis was performed for genes with the highest differential burden of rare disruptive variants. Results Protein truncating variants (PTVs) in specific DNA repair genes were significantly overrepresented among patients with the aggressive phenotype, with BRCA2, ATM and NBN the most frequently mutated genes. Differential burden of rare variants was identified between metastatic and non-aggressive cases for several genes implicated in angiogenesis, conferring both deleterious and protective effects. Conclusions Inherited PTVs in several DNA repair genes distinguish aggressive from non-aggressive PrCa cases. Furthermore, inherited variants in genes with roles in angiogenesis may be potential predictors for risk of metastases. If validated in a larger dataset, these findings have potential for future clinical application

    Global DNA methylation and transcriptional analyses of human ESC-derived cardiomyocytes.

    Get PDF
    With defined culture protocol, human embryonic stem cells (hESCs) are able to generate cardiomyocytes in vitro, therefore providing a great model for human heart development, and holding great potential for cardiac disease therapies. In this study, we successfully generated a highly pure population of human cardiomyocytes (hCMs) (&gt;95% cTnT(+)) from hESC line, which enabled us to identify and characterize an hCM-specific signature, at both the gene expression and DNA methylation levels. Gene functional association network and gene-disease network analyses of these hCM-enriched genes provide new insights into the mechanisms of hCM transcriptional regulation, and stand as an informative and rich resource for investigating cardiac gene functions and disease mechanisms. Moreover, we show that cardiac-structural genes and cardiac-transcription factors have distinct epigenetic mechanisms to regulate their gene expression, providing a better understanding of how the epigenetic machinery coordinates to regulate gene expression in different cell types

    Non-functional immunoglobulin G transcripts in a case of hyper-immunoglobulin M syndrome similar to type 4

    Get PDF
    86% of immunoglobulin G (IgG) heavy-chain gene transcripts were found to be non-functional in the peripheral blood B cells of a patient initially diagnosed with common variable immunodeficiency, who later developed raised IgM, whereas no non-functionally rearranged transcripts were found in the cells of seven healthy control subjects. All the patient's IgM heavy-chain and κ light-chain transcripts were functional, suggesting that either non-functional rearrangements were being selectively class-switched to IgG, or that receptor editing was rendering genes non-functional after class-switching. The functional γ-chain sequences showed a normal rate of somatic hypermutation while non-functional sequences contained few somatic mutations, suggesting that most came from cells that had no functional gene and therefore were not receiving signals for hypermutation. However, apoptosis of peripheral blood lymphocytes was not impaired. No defects have been found in any of the genes currently known to be responsible for hyper-IgM syndrome but the phenotype fits best to type 4

    MoKCa database - mutations of kinases in cancer

    Get PDF
    Members of the protein kinase family are amongst the most commonly mutated genes in human cancer, and both mutated and activated protein kinases have proved to be tractable targets for the development of new anticancer therapies The MoKCa database (Mutations of Kinases in Cancer, http://strubiol.icr.ac.uk/extra/mokca) has been developed to structurally and functionally annotate, and where possible predict, the phenotypic consequences of mutations in protein kinases implicated in cancer. Somatic mutation data from tumours and tumour cell lines have been mapped onto the crystal structures of the affected protein domains. Positions of the mutated amino-acids are highlighted on a sequence-based domain pictogram, as well as a 3D-image of the protein structure, and in a molecular graphics package, integrated for interactive viewing. The data associated with each mutation is presented in the Web interface, along with expert annotation of the detailed molecular functional implications of the mutation. Proteins are linked to functional annotation resources and are annotated with structural and functional features such as domains and phosphorylation sites. MoKCa aims to provide assessments available from multiple sources and algorithms for each potential cancer-associated mutation, and present these together in a consistent and coherent fashion to facilitate authoritative annotation by cancer biologists and structural biologists, directly involved in the generation and analysis of new mutational data

    The role of axonopathy in Parkinson\u27s disease

    Get PDF

    The evolution of genetic architectures underlying quantitative traits

    Full text link
    In the classic view introduced by R. A. Fisher, a quantitative trait is encoded by many loci with small, additive effects. Recent advances in QTL mapping have begun to elucidate the genetic architectures underlying vast numbers of phenotypes across diverse taxa, producing observations that sometimes contrast with Fisher's blueprint. Despite these considerable empirical efforts to map the genetic determinants of traits, it remains poorly understood how the genetic architecture of a trait should evolve, or how it depends on the selection pressures on the trait. Here we develop a simple, population-genetic model for the evolution of genetic architectures. Our model predicts that traits under moderate selection should be encoded by many loci with highly variable effects, whereas traits under either weak or strong selection should be encoded by relatively few loci. We compare these theoretical predictions to qualitative trends in the genetics of human traits, and to systematic data on the genetics of gene expression levels in yeast. Our analysis provides an evolutionary explanation for broad empirical patterns in the genetic basis of traits, and it introduces a single framework that unifies the diversity of observed genetic architectures, ranging from Mendelian to Fisherian.Comment: Minor changes in the text; Added supplementary materia

    A murine model of variant late infantile ceroid lipofuscinosis recapitulates behavioral and pathological phenotypes of human disease.

    Get PDF
    Neuronal ceroid lipofuscinoses (NCLs; also known collectively as Batten Disease) are a family of autosomal recessive lysosomal storage disorders. Mutations in as many as 13 genes give rise to ∼10 variants of NCL, all with overlapping clinical symptomatology including visual impairment, motor and cognitive dysfunction, seizures, and premature death. Mutations in CLN6 result in both a variant late infantile onset neuronal ceroid lipofuscinosis (vLINCL) as well as an adult-onset form of the disease called Type A Kufs. CLN6 is a non-glycosylated membrane protein of unknown function localized to the endoplasmic reticulum (ER). In this study, we perform a detailed characterization of a naturally occurring Cln6 mutant (Cln6(nclf)) mouse line to validate its utility for translational research. We demonstrate that this Cln6(nclf) mutation leads to deficits in motor coordination, vision, memory, and learning. Pathologically, we demonstrate loss of neurons within specific subregions and lamina of the cortex that correlate to behavioral phenotypes. As in other NCL models, this model displays selective loss of GABAergic interneuron sub-populations in the cortex and the hippocampus with profound, early-onset glial activation. Finally, we demonstrate a novel deficit in memory and learning, including a dramatic reduction in dendritic spine density in the cerebral cortex, which suggests a reduction in synaptic strength following disruption in CLN6. Together, these findings highlight the behavioral and pathological similarities between the Cln6(nclf) mouse model and human NCL patients, validating this model as a reliable format for screening potential therapeutics
    • …
    corecore