804 research outputs found
Recommended from our members
SNP panel identification assay (SPIA): a genetic-based assay for the identification of cell lines
Translational research hinges on the ability to make observations in model systems and to implement those findings into clinical applications, such as the development of diagnostic tools or targeted therapeutics. Tumor cell lines are commonly used to model carcinogenesis. The same tumor cell line can be simultaneously studied in multiple research laboratories throughout the world, theoretically generating results that are directly comparable. One important assumption in this paradigm is that researchers are working with the same cells. However, recent work using high throughput genomic analyses questions the accuracy of this assumption. Observations by our group and others suggest that experiments reported in the scientific literature may contain pre-analytic errors due to inaccurate identities of the cell lines employed. To address this problem, we developed a simple approach that enables an accurate determination of cell line identity by genotyping 34 single nucleotide polymorphisms (SNPs). Here, we describe the empirical development of a SNP panel identification assay (SPIA) compatible with routine use in the laboratory setting to ensure the identity of tumor cell lines and human tumor samples throughout the course of long term research use
Somatic rearrangements across cancer reveal classes of samples with distinct patterns of DNA breakage and rearrangement-induced hypermutability
Whole-genome sequencing using massively parallel sequencing technologies enables accurate detection of somatic rearrangements in cancer. Pinpointing large numbers of rearrangement breakpoints to base-pair resolution allows analysis of rearrangement microhomology and genomic location for every sample. Here we analyze 95 tumor genome sequences from breast, head and neck, colorectal, and prostate carcinomas, and from melanoma, multiple myeloma, and chronic lymphocytic leukemia. We discover three genomic factors that are significantly correlated with the distribution of rearrangements: replication time, transcription rate, and GC content. The correlation is complex, and different patterns are observed between tumor types, within tumor types, and even between different types of rearrangements. Mutations in the APC gene correlate with and, hence, potentially contribute to DNA breakage in late-replicating, low %GC, untranscribed regions of the genome. We show that somatic rearrangements display less microhomology than germline rearrangements, and that breakpoint loci are correlated with local hypermutability with a particular enrichment for C ↔ G transversions
Allele-Specific Amplification in Cancer Revealed by SNP Array Analysis
Amplification, deletion, and loss of heterozygosity of genomic DNA are hallmarks of cancer. In recent years a variety of studies have emerged measuring total chromosomal copy number at increasingly high resolution. Similarly, loss-of-heterozygosity events have been finely mapped using high-throughput genotyping technologies. We have developed a probe-level allele-specific quantitation procedure that extracts both copy number and allelotype information from single nucleotide polymorphism (SNP) array data to arrive at allele-specific copy number across the genome. Our approach applies an expectation-maximization algorithm to a model derived from a novel classification of SNP array probes. This method is the first to our knowledge that is able to (a) determine the generalized genotype of aberrant samples at each SNP site (e.g., CCCCT at an amplified site), and (b) infer the copy number of each parental chromosome across the genome. With this method, we are able to determine not just where amplifications and deletions occur, but also the haplotype of the region being amplified or deleted. The merit of our model and general approach is demonstrated by very precise genotyping of normal samples, and our allele-specific copy number inferences are validated using PCR experiments. Applying our method to a collection of lung cancer samples, we are able to conclude that amplification is essentially monoallelic, as would be expected under the mechanisms currently believed responsible for gene amplification. This suggests that a specific parental chromosome may be targeted for amplification, whether because of germ line or somatic variation. An R software package containing the methods described in this paper is freely available at http://genome.dfci.harvard.edu/~tlaframb/PLASQ
Prioritizing causal disease genes using unbiased genomic features
Background: Cardiovascular disease (CVD) is the leading cause of death in the developed world. Human genetic studies, including genome-wide sequencing and SNP-array approaches, promise to reveal disease genes and mechanisms representing new therapeutic targets. In practice, however, identification of the actual genes contributing to disease pathogenesis has lagged behind identification of associated loci, thus limiting the clinical benefits. Results: To aid in localizing causal genes, we develop a machine learning approach, Objective Prioritization for Enhanced Novelty (OPEN), which quantitatively prioritizes gene-disease associations based on a diverse group of genomic features. This approach uses only unbiased predictive features and thus is not hampered by a preference towards previously well-characterized genes. We demonstrate success in identifying genetic determinants for CVD-related traits, including cholesterol levels, blood pressure, and conduction system and cardiomyopathy phenotypes. Using OPEN, we prioritize genes, including FLNC, for association with increased left ventricular diameter, which is a defining feature of a prevalent cardiovascular disorder, dilated cardiomyopathy or DCM. Using a zebrafish model, we experimentally validate FLNC and identify a novel FLNC splice-site mutation in a patient with severe DCM. Conclusion: Our approach stands to assist interpretation of large-scale genetic studies without compromising their fundamentally unbiased nature. Electronic supplementary material The online version of this article (doi:10.1186/s13059-014-0534-8) contains supplementary material, which is available to authorized users
Recommended from our members
PAK1 is a Breast Cancer Oncogene that Coordinately Activates MAPK and MET Signaling
Activating mutations in the RAS family or BRAF frequently occur in many types of human cancers but are rarely detected in breast tumors. However, activation of the RAS-RAF-MEK-ERK Mitogen-Activated Protein Kinase (MAPK) pathway is commonly observed in human breast cancers, suggesting that other genetic alterations lead to activation of this signaling pathway. To identify breast cancer oncogenes that activate the MAPK pathway, we screened a library of human kinases for their ability to induce anchorage-independent growth in a derivative of immortalized human mammary epithelial cells (HMLE). We identified PAK1 as a kinase that permitted HMLE cells to form anchorage-independent colonies. PAK1 is amplified in several human cancer types, including 33% of breast tumor samples and cancer cell lines. The kinase activity of PAK1 is necessary for PAK1-induced transformation. Moreover, we show that PAK1 simultaneously activates MAPK and MET signaling; the latter via inhibition of Merlin. Disruption of these activities inhibits PAK1-driven anchorage-independent growth. These observations establish PAK1 amplification as an alternative mechanism for MAPK activation in human breast cancer and credential PAK1 as a breast cancer oncogene that coordinately regulates multiple signaling pathways, the cooperation of which leads to malignant transformation
Author Correction: Disruption of chromatin folding domains by somatic genomic rearrangements in human cancer.
Angiomatous meningiomas have a distinct genetic profile with multiple chromosomal polysomies including polysomy of chromosome 5
Meningiomas are a diverse group of tumors with a broad spectrum of histologic features. There are over 12 variants of meningioma, whose genetic features are just beginning to be described. Angiomatous meningioma is a World Health Organization (WHO) meningioma variant with a predominance of blood vessels. They are uncommon and confirming the histopathologic classification can be challenging. Given a lack of biomarkers that define the angiomatous subtype and limited understanding of the genetic changes underlying its tumorigenesis, we compared the genomic characteristics of angiomatous meningioma to more common meningioma subtypes. While typical grade I meningiomas demonstrate monosomy of chromosome 22 or lack copy number aberrations, 13 of 14 cases of angiomatous meningioma demonstrated a distinct copy number profile – polysomies of at least one chromosome, but often of many, especially in chromosomes 5, 13, and 20. WHO grade II atypical meningiomas with angiomatous features have both polysomies and genetic aberrations characteristic of other atypical meningiomas. Sequencing of over 560 cancer-relevant genes in 16 cases of angiomatous meningioma showed that these tumors lack common mutations found in other variants of meningioma. Our study demonstrates that angiomatous meningiomas have distinct genomic features that may be clinically useful for their diagnosis
Amplification of phosphoglycerate dehydrogenase diverts glycolytic flux and contributes to oncogenesis
GISTIC2.0 facilitates sensitive and confident localization of the targets of focal somatic copy-number alteration in human cancers
We describe methods with enhanced power and specificity to identify genes targeted by somatic copy-number alterations (SCNAs) that drive cancer growth. By separating SCNA profiles into underlying arm-level and focal alterations, we improve the estimation of background rates for each category. We additionally describe a probabilistic method for defining the boundaries of selected-for SCNA regions with user-defined confidence. Here we detail this revised computational approach, GISTIC2.0, and validate its performance in real and simulated datasets
Tangent normalization for somatic copy-number inference in cancer genome analysis.
MOTIVATION: Somatic copy-number alterations (SCNAs) play an important role in cancer development. Systematic noise in sequencing and array data present a significant challenge to the inference of SCNAs for cancer genome analyses. As part of The Cancer Genome Atlas, the Broad Institute Genome Characterization Center developed the Tangent normalization method to generate copy-number profiles using data from single-nucleotide polymorphism (SNP) arrays and whole-exome sequencing (WES) technologies for over 10 000 pairs of tumors and matched normal samples. Here, we describe the Tangent method, which uses a unique linear combination of normal samples as a reference for each tumor sample, to subtract systematic errors that vary across samples. We also describe a modification of Tangent, called Pseudo-Tangent, which enables denoising through comparisons between tumor profiles when few normal samples are available. RESULTS: Tangent normalization substantially increases signal-to-noise ratios (SNRs) compared to conventional normalization methods in both SNP array and WES analyses. Tangent and Pseudo-Tangent normalizations improve the SNR by reducing noise with minimal effect on signal and exceed the contribution of other steps in the analysis such as choice of segmentation algorithm. Tangent and Pseudo-Tangent are broadly applicable and enable more accurate inference of SCNAs from DNA sequencing and array data. AVAILABILITY AND IMPLEMENTATION: Tangent is available at https://github.com/broadinstitute/tangent and as a Docker image (https://hub.docker.com/r/broadinstitute/tangent). Tangent is also the normalization method for the copy-number pipeline in Genome Analysis Toolkit 4 (GATK4). SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online
- …