202 research outputs found

    Phenotype Restricted Genome-Wide Association Study Using a Gene-Centric Approach Identifies Three Low-Risk Neuroblastoma Susceptibility Loci

    Get PDF
    Neuroblastoma is a malignant neoplasm of the developing sympathetic nervous system that is notable for its phenotypic diversity. High-risk patients typically have widely disseminated disease at diagnosis and a poor survival probability, but low-risk patients frequently have localized tumors that are almost always cured with little or no chemotherapy. Our genome-wide association study (GWAS) has identified common variants within FLJ22536, BARD1, and LMO1 as significantly associated with neuroblastoma and more robustly associated with high-risk disease. Here we show that a GWAS focused on low-risk cases identified SNPs within DUSP12 at 1q23.3 (P = 2.07×10−6), DDX4 and IL31RA both at 5q11.2 (P = 2.94×10−6 and 6.54×10−7 respectively), and HSD17B12 at 11p11.2 (P = 4.20×10−7) as being associated with the less aggressive form of the disease. These data demonstrate the importance of robust phenotypic data in GWAS analyses and identify additional susceptibility variants for neuroblastoma

    Finding exclusively deleted or amplified genomic areas in lung adenocarcinomas using a novel chromosomal pattern analysis

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Genomic copy number alteration (CNA) that are recurrent across multiple samples often harbor critical genes that can drive either the initiation or the progression of cancer disease. Up to now, most researchers investigating recurrent CNAs consider separately the marginal frequencies for copy gain or loss and select the areas of interest based on arbitrary cut-off thresholds of these frequencies. In practice, these analyses ignore the interdependencies between the propensity of being deleted or amplified for a clone. In this context, a joint analysis of the copy number changes across tumor samples may bring new insights about patterns of recurrent CNAs.</p> <p>Methods</p> <p>We propose to identify patterns of recurrent CNAs across tumor samples from high-resolution comparative genomic hybridization microarrays. Clustering is achieved by modeling the copy number state (loss, no-change, gain) as a multinomial distribution with probabilities parameterized through a latent class model leading to nine patterns of recurrent CNAs. This model gives us a powerful tool to identify clones with contrasting propensity of being deleted or amplified across tumor samples. We applied this model to a homogeneous series of 65 lung adenocarcinomas.</p> <p>Results</p> <p>Our latent class model analysis identified interesting patterns of chromosomal aberrations. Our results showed that about thirty percent of the genomic clones were classified either as "exclusively" deleted or amplified recurrent CNAs and could be considered as non random chromosomal events. Most of the known oncogenes or tumor suppressor genes associated with lung adenocarcinoma were located within these areas. We also describe genomic areas of potential interest and show that an increase of the frequency of amplification in these particular areas is significantly associated with poorer survival.</p> <p>Conclusion</p> <p>Analyzing jointly deletions and amplifications through our latent class model analysis allows highlighting specific genomic areas with exclusively amplified or deleted recurrent CNAs which are good candidate for harboring oncogenes or tumor suppressor genes.</p

    Software comparison for evaluating genomic copy number variation for Affymetrix 6.0 SNP array platform

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Copy number data are routinely being extracted from genome-wide association study chips using a variety of software. We empirically evaluated and compared four freely-available software packages designed for Affymetrix SNP chips to estimate copy number: Affymetrix Power Tools (APT), Aroma.Affymetrix, PennCNV and CRLMM. Our evaluation used 1,418 GENOA samples that were genotyped on the Affymetrix Genome-Wide Human SNP Array 6.0. We compared bias and variance in the locus-level copy number data, the concordance amongst regions of copy number gains/deletions and the false-positive rate amongst deleted segments.</p> <p>Results</p> <p>APT had median locus-level copy numbers closest to a value of two, whereas PennCNV and Aroma.Affymetrix had the smallest variability associated with the median copy number. Of those evaluated, only PennCNV provides copy number specific quality-control metrics and identified 136 poor CNV samples. Regions of copy number variation (CNV) were detected using the hidden Markov models provided within PennCNV and CRLMM/VanillaIce. PennCNV detected more CNVs than CRLMM/VanillaIce; the median number of CNVs detected per sample was 39 and 30, respectively. PennCNV detected most of the regions that CRLMM/VanillaIce did as well as additional CNV regions. The median concordance between PennCNV and CRLMM/VanillaIce was 47.9% for duplications and 51.5% for deletions. The estimated false-positive rate associated with deletions was similar for PennCNV and CRLMM/VanillaIce.</p> <p>Conclusions</p> <p>If the objective is to perform statistical tests on the locus-level copy number data, our empirical results suggest that PennCNV or Aroma.Affymetrix is optimal. If the objective is to perform statistical tests on the summarized segmented data then PennCNV would be preferred over CRLMM/VanillaIce. Specifically, PennCNV allows the analyst to estimate locus-level copy number, perform segmentation and evaluate CNV-specific quality-control metrics within a single software package. PennCNV has relatively small bias, small variability and detects more regions while maintaining a similar estimated false-positive rate as CRLMM/VanillaIce. More generally, we advocate that software developers need to provide guidance with respect to evaluating and choosing optimal settings in order to obtain optimal results for an individual dataset. Until such guidance exists, we recommend trying multiple algorithms, evaluating concordance/discordance and subsequently consider the union of regions for downstream association tests.</p

    Notions of Bidirectional Computation and Entangled State Monads

    Get PDF
    Bidirectional transformations (bx) support principled consistency maintenance between data sources. Each data source corresponds to one perspective on a composite system, manifested by operations to ‘get’ and ‘set’ a view of the whole from that particular perspective. Bx are important in a wide range of settings, including databases, interactive applications, and model-driven development. We show that bx are naturally modelled in terms of mutable state; in particular, the ‘set’ operations are stateful functions. This leads naturally to considering bx that exploit other computational effects too, such as I/O, nondeterminism, and failure, all largely ignored in the bx literature to date. We present a semantic foundation for symmetric bidirectional transformations with effects. We build on the mature theory of monadic encapsulation of effects in functional programming, develop the equational theory and important combinators for effectful bx, and provide a prototype implementation in Haskell along with several illustrative examples

    Accurate and exact CNV identification from targeted high-throughput sequence data

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Massively parallel sequencing of barcoded DNA samples significantly increases screening efficiency for clinically important genes. Short read aligners are well suited to single nucleotide and indel detection. However, methods for CNV detection from targeted enrichment are lacking. We present a method combining coverage with map information for the identification of deletions and duplications in targeted sequence data.</p> <p>Results</p> <p>Sequencing data is first scanned for gains and losses using a comparison of normalized coverage data between samples. CNV calls are confirmed by testing for a signature of sequences that span the CNV breakpoint. With our method, CNVs can be identified regardless of whether breakpoints are within regions targeted for sequencing. For CNVs where at least one breakpoint is within targeted sequence, exact CNV breakpoints can be identified. In a test data set of 96 subjects sequenced across ~1 Mb genomic sequence using multiplexing technology, our method detected mutations as small as 31 bp, predicted quantitative copy count, and had a low false-positive rate.</p> <p>Conclusions</p> <p>Application of this method allows for identification of gains and losses in targeted sequence data, providing comprehensive mutation screening when combined with a short read aligner.</p

    A novel SNP analysis method to detect copy number alterations with an unbiased reference signal directly from tumor samples

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Genomic instability in cancer leads to abnormal genome copy number alterations (CNA) as a mechanism underlying tumorigenesis. Using microarrays and other technologies, tumor CNA are detected by comparing tumor sample CN to normal reference sample CN. While advances in microarray technology have improved detection of copy number alterations, the increase in the number of measured signals, noise from array probes, variations in signal-to-noise ratio across batches and disparity across laboratories leads to significant limitations for the accurate identification of CNA regions when comparing tumor and normal samples.</p> <p>Methods</p> <p>To address these limitations, we designed a novel "Virtual Normal" algorithm (VN), which allowed for construction of an unbiased reference signal directly from test samples within an experiment using any publicly available normal reference set as a baseline thus eliminating the need for an in-lab normal reference set.</p> <p>Results</p> <p>The algorithm was tested using an optimal, paired tumor/normal data set as well as previously uncharacterized pediatric malignant gliomas for which a normal reference set was not available. Using Affymetrix 250K Sty microarrays, we demonstrated improved signal-to-noise ratio and detected significant copy number alterations using the VN algorithm that were validated by independent PCR analysis of the target CNA regions.</p> <p>Conclusions</p> <p>We developed and validated an algorithm to provide a virtual normal reference signal directly from tumor samples and minimize noise in the derivation of the raw CN signal. The algorithm reduces the variability of assays performed across different reagent and array batches, methods of sample preservation, multiple personnel, and among different laboratories. This approach may be valuable when matched normal samples are unavailable or the paired normal specimens have been subjected to variations in methods of preservation.</p

    Notch signaling in mouse blastocyst development and hatching

    Get PDF
    Research Areas: Developmental BiologyBackground: Mammalian early embryo development requires a well-orchestrated interplay of cell signaling pathways. Notch is a major regulatory pathway involved in cell-fate determination in embryonic and adult scenarios. However, the role of Notch in embryonic pre-implantation development is controversial. In particular, Notch role on blastocyst development and hatching remains elusive, and a complete picture of the transcription and expression patterns of Notch components during this time-period is not available. Results: This study provided a comprehensive view on the dynamics of individual embryo gene transcription and protein expression patterns of Notch components (receptors Notch1–4; ligands Dll1 and Dll4, Jagged1–2; and effectors Hes1–2), and their relationship with transcription of gene markers of pluripotency and differentiation (Sox2, Oct4, Klf4, Cdx2) during mouse blastocyst development and hatching. Transcription of Notch1–2, Jagged1–2 and Hes1 was highly prevalent and dynamic along stages of development, whereas transcription of Notch3–4, Dll4 and Hes2 had a low prevalence among embryos. Transcription levels of Notch1, Notch2, Jagged2 and Hes1 correlated with each other and with those of pluripotency and differentiation genes. Gene transcription was associated to protein expression, except for Jagged2, where high transcription levels in all embryos were not translated into protein. Presence of Notch signaling activity was confirmed through nuclear NICD and Hes1 detection, and downregulation of Hes1 transcription following canonical signaling blockade with DAPT. In vitro embryo culture supplementation with Jagged1 had no effect on embryo developmental kinetics. In contrast, supplementation with Jagged2 abolished Jagged1 transcription, downregulated Cdx2 transcription and inhibited blastocyst hatching. Notch signaling blockade by DAPT downregulated transcription of Sox2, and retarded embryo hatching. Conclusion: Transcription of Notch genes showed a dynamic pattern along blastocyst development and hatching. Data confirmed Notch signaling activity, and lead to the suggestion that Notch canonical signaling may be operating through Notch1, Notch3, Jagged1 and Hes1. Embryo culture supplementation with Jagged1 and Jagged2 unveiled a possible regulatory effect between Jagged1, Cdx2 and blastocyst hatching. Overall, results indicate that a deregulation in Notch signaling, either by its over or under-activation, affects blastocyst development and hatching.info:eu-repo/semantics/publishedVersio

    Bayesian estimation of genomic copy number with single nucleotide polymorphism genotyping arrays

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>The identification of copy number aberration in the human genome is an important area in cancer research. We develop a model for determining genomic copy numbers using high-density single nucleotide polymorphism genotyping microarrays. The method is based on a Bayesian spatial normal mixture model with an unknown number of components corresponding to true copy numbers. A reversible jump Markov chain Monte Carlo algorithm is used to implement the model and perform posterior inference.</p> <p>Results</p> <p>The performance of the algorithm is examined on both simulated and real cancer data, and it is compared with the popular CNAG algorithm for copy number detection.</p> <p>Conclusions</p> <p>We demonstrate that our Bayesian mixture model performs at least as well as the hidden Markov model based CNAG algorithm and in certain cases does better. One of the added advantages of our method is the flexibility of modeling normal cell contamination in tumor samples.</p

    Detection of recurrent rearrangement breakpoints from copy number data

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Copy number variants (CNVs), including deletions, amplifications, and other rearrangements, are common in human and cancer genomes. Copy number data from array comparative genome hybridization (aCGH) and next-generation DNA sequencing is widely used to measure copy number variants. Comparison of copy number data from multiple individuals reveals recurrent variants. Typically, the interior of a recurrent CNV is examined for genes or other loci associated with a phenotype. However, in some cases, such as gene truncations and fusion genes, the target of variant lies at the boundary of the variant.</p> <p>Results</p> <p>We introduce Neighborhood Breakpoint Conservation (NBC), an algorithm for identifying rearrangement breakpoints that are highly conserved at the same locus in multiple individuals. NBC detects recurrent breakpoints at varying levels of resolution, including breakpoints whose location is exactly conserved and breakpoints whose location varies within a gene. NBC also identifies pairs of recurrent breakpoints such as those that result from fusion genes. We apply NBC to aCGH data from 36 primary prostate tumors and identify 12 novel rearrangements, one of which is the well-known TMPRSS2-ERG fusion gene. We also apply NBC to 227 glioblastoma tumors and predict 93 novel rearrangements which we further classify as gene truncations, germline structural variants, and fusion genes. A number of these variants involve the protein phosphatase PTPN12 suggesting that deregulation of PTPN12, via a variety of rearrangements, is common in glioblastoma.</p> <p>Conclusions</p> <p>We demonstrate that NBC is useful for detection of recurrent breakpoints resulting from copy number variants or other structural variants, and in particular identifies recurrent breakpoints that result in gene truncations or fusion genes. Software is available at <url>http://http.//cs.brown.edu/people/braphael/software.html</url>.</p

    GISTIC2.0 facilitates sensitive and confident localization of the targets of focal somatic copy-number alteration in human cancers

    Get PDF
    We describe methods with enhanced power and specificity to identify genes targeted by somatic copy-number alterations (SCNAs) that drive cancer growth. By separating SCNA profiles into underlying arm-level and focal alterations, we improve the estimation of background rates for each category. We additionally describe a probabilistic method for defining the boundaries of selected-for SCNA regions with user-defined confidence. Here we detail this revised computational approach, GISTIC2.0, and validate its performance in real and simulated datasets
    corecore