33 research outputs found

    Genome-Wide Analyses for Osteosarcoma in Leonberger Dogs Reveal the CDKN2A/B Gene Locus as a Major Risk Locus

    Get PDF
    Dogs represent a unique spontaneous cancer model. Osteosarcoma (OSA) is the most common primary bone tumor in dogs (OMIA 001441-9615), and strongly resembles human forms of OSA. Several large- to giant-sized dog breeds, including the Leonberger, have a greatly increased risk of developing OSA. We performed genome-wide association analysis with high-density imputed SNP genotype data from 273 Leonberger cases with a median age of 8.1 [3.1–13.5] years and 365 controls older than eight years. This analysis revealed significant associations at the CDKN2A/B gene locus on canine chromosome 11, mirroring previous findings in other dog breeds, such as the greyhound, that also show an elevated risk for OSA. Heritability (h2SNP) was determined to be 20.6% (SE = 0.08; p-value = 5.7 × 10−4) based on a breed prevalence of 20%. The 2563 SNPs across the genome accounted for nearly all the h2SNP of OSA, with 2183 SNPs of small effect, 316 SNPs of moderate effect, and 64 SNPs of large effect. As with many other cancers it is likely that regulatory, non-coding variants underlie the increased risk for cancer development. Our findings confirm a complex genetic basis of OSA, moderate heritability, and the crucial role of the CDKN2A/B locus leading to strong cancer predisposition in dogs. It will ultimately be interesting to study and compare the known genetic loci associated with canine OSA in human OSA

    Efficient exploration of pan-cancer networks by generalized covariance selection and interactive web content

    Get PDF
    Statistical network modeling techniques are increasingly important tools to analyze cancer genomics data. However, current tools and resources are not designed to work across multiple diagnoses and technical platforms, thus limiting their applicability to comprehensive pan-cancer datasets such as The Cancer Genome Atlas (TCGA). To address this, we describe a new data driven modeling method, based on generalized Sparse Inverse Covariance Selection (SICS). The method integrates genetic, epigenetic and transcriptional data from multiple cancers, to define links that are present in multiple cancers, a subset of cancers, or a single cancer. It is shown to be statistically robust and effective at detecting direct pathway links in data from TCGA. To facilitate interpretation of the results, we introduce a publicly accessible tool ( ext-link-type="uri" xlink:href="http://cancerlandscapes.org/">cancerlandscapes.org), in which the derived networks are explored as interactive web content, linked to several pathway and pharmacological databases. To evaluate the performance of the method, we constructed a model for eight TCGA cancers, using data from 3900 patients. The model rediscovered known mechanisms and contained interesting predictions. Possible applications include prediction of regulatory relationships, comparison of network modules across multiple forms of cancer and identification of drug targets

    Efficient exploration of pan-cancer networks by generalized covariance selection and interactive web content

    No full text
    Statistical network modeling techniques are increasingly important tools to analyze cancer genomics data. However, current tools and resources are not designed to work across multiple diagnoses and technical platforms, thus limiting their applicability to comprehensive pan-cancer datasets such as The Cancer Genome Atlas (TCGA). To address this, we describe a new data driven modeling method, based on generalized Sparse Inverse Covariance Selection (SICS). The method integrates genetic, epigenetic and transcriptional data from multiple cancers, to define links that are present in multiple cancers, a subset of cancers, or a single cancer. It is shown to be statistically robust and effective at detecting direct pathway links in data from TCGA. To facilitate interpretation of the results, we introduce a publicly accessible tool ( ext-link-type="uri" xlink:href="http://cancerlandscapes.org/">cancerlandscapes.org), in which the derived networks are explored as interactive web content, linked to several pathway and pharmacological databases. To evaluate the performance of the method, we constructed a model for eight TCGA cancers, using data from 3900 patients. The model rediscovered known mechanisms and contained interesting predictions. Possible applications include prediction of regulatory relationships, comparison of network modules across multiple forms of cancer and identification of drug targets

    Bayesian Estimation of Transcript Levels Using a General Model of Array Measurement Noise

    No full text
    Gene arrays demonstrate a promising ability to characterize expression levels across the entire genome but su#er from significant levels of measurement noise. We present a rigorous new approach to estimate transcript levels and ratios from one or more gene array experiments, given a model of measurement noise and available prior information

    Whole-genome sequencing of glioblastoma reveals enrichment of non-coding constraint mutations in known and novel genes

    No full text
    Background Glioblastoma (GBM) has one of the worst 5-year survival rates of all cancers. While genomic studies of the disease have been performed, alterations in the non-coding regulatory regions of GBM have largely remained unexplored. We apply whole-genome sequencing (WGS) to identify non-coding mutations, with regulatory potential in GBM, under the hypothesis that regions of evolutionary constraint are likely to be functional, and somatic mutations are likely more damaging than in unconstrained regions. Results We validate our GBM cohort, finding similar copy number aberrations and mutated genes based on coding mutations as previous studies. Performing analysis on non-coding constraint mutations and their position relative to nearby genes, we find a significant enrichment of non-coding constraint mutations in the neighborhood of 78 genes that have previously been implicated in GBM. Among them, SEMA3C and DYNC1I1 show the highest frequencies of alterations, with multiple mutations overlapping transcription factor binding sites. We find that a non-coding constraint mutation in the SEMA3C promoter reduces the DNA binding capacity of the region. We also identify 1776 other genes enriched for non-coding constraint mutations with likely regulatory potential, providing additional candidate GBM genes. The mutations in the top four genes, DLX5, DLX6, FOXA1, and ISL1, are distributed over promoters, UTRs, and multiple transcription factor binding sites. Conclusions These results suggest that non-coding constraint mutations could play an essential role in GBM, underscoring the need to connect non-coding genomic variation to biological function and disease pathology.De tre första författarna delar förstaförfattarskapetTitle in thesis list of papers: Whole Genome Sequencing of Glioblastoma Reveals Enrichment of Non-Coding Constraint Mutations in Known and Novel Genes</p
    corecore