8 research outputs found

    Performance evaluation of DNA copy number segmentation methods

    Full text link
    A number of bioinformatic or biostatistical methods are available for analyzing DNA copy number profiles measured from microarray or sequencing technologies. In the absence of rich enough gold standard data sets, the performance of these methods is generally assessed using unrealistic simulation studies, or based on small real data analyses. We have designed and implemented a framework to generate realistic DNA copy number profiles of cancer samples with known truth. These profiles are generated by resampling real SNP microarray data from genomic regions with known copy-number state. The original real data have been extracted from dilutions series of tumor cell lines with matched blood samples at several concentrations. Therefore, the signal-to-noise ratio of the generated profiles can be controlled through the (known) percentage of tumor cells in the sample. In this paper, we describe this framework and illustrate some of the benefits of the proposed data generation approach on a practical use case: a comparison study between methods for segmenting DNA copy number profiles from SNP microarrays. This study indicates that no single method is uniformly better than all others. It also helps identifying pros and cons for the compared methods as a function of biologically informative parameters, such as the fraction of tumor cells in the sample and the proportion of heterozygous markers. Availability: R package jointSeg: http://r-forge.r-project.org/R/?group\_id=156

    Patterns of chromosomal copy-number alterations in intrahepatic cholangiocarcinoma

    Get PDF
    International audienceBackground: Intrahepatic cholangiocarcinomas (ICC) are relatively rare malignant tumors associated with a poor prognosis. Recent studies using genome-wide sequencing technologies have mainly focused on identifying new driver mutations. There is nevertheless a need to investigate the spectrum of copy number aberrations in order to identify potential target genes in the altered chromosomal regions. The aim of this study was to characterize the patterns of chromosomal copy-number alterations (CNAs) in ICC. Methods: 53 patients having ICC with frozen material were selected. In 47 cases, DNA hybridization has been performed on a genomewide SNP array. A procedure with a segmentation step and a calling step classified genomic regions into copy-number aberration states. We identified the exclusively amplified and deleted recurrent genomic areas. These areas are those showing the highest estimated propensity level for copy loss (resp. copy gain) together with the lowest level for copy gain (resp. copy loss). We investigated ICC clustering. We analyzed the relationships between CNAs and clinico-pathological characteristics. Results: The overall genomic profile of ICC showed many alterations with higher rates for the deletions. Exclusively deleted genomic areas were 1p, 3p and 14q. The main exclusively amplified genomic areas were 1q, 7p, 7q and 8q. Based on the exclusively deleted/amplified genomic areas, a clustering analysis identified three tumors groups: the first group characterized by copy loss of 1p and copy gain of 7p, the second group characterized by 1p and 3p copy losses without 7p copy gain, the last group characterized mainly by very few CNAs. From univariate analyses, the number of tumors, the size of the largest tumor and the stage were significantly associated with shorter time recurrence. We found no relationship between the number of altered cytobands or tumor groups and time to recurrence. Conclusion: This study describes the spectrum of chromosomal aberrations across the whole genome. Some of the recurrent exclusive CNAs harbor candidate target genes. Despite the absence of correlation between CNAs and clinico-pathological characteristics, the co-occurence of 7p gain and 1p loss in a subgroup of patients may suggest a differential activation of EGFR and its downstream pathways, which may have a potential effect on targeted therapies

    Weighted Consensus Segmentations

    Get PDF
    The problem of segmenting linearly ordered data is frequently encountered in time-series analysis, computational biology, and natural language processing. Segmentations obtained independently from replicate data sets or from the same data with different methods or parameter settings pose the problem of computing an aggregate or consensus segmentation. This Segmentation Aggregation problem amounts to finding a segmentation that minimizes the sum of distances to the input segmentations. It is again a segmentation problem and can be solved by dynamic programming. The aim of this contribution is (1) to gain a better mathematical understanding of the Segmentation Aggregation problem and its solutions and (2) to demonstrate that consensus segmentations have useful applications. Extending previously known results we show that for a large class of distance functions only breakpoints present in at least one input segmentation appear in the consensus segmentation. Furthermore, we derive a bound on the size of consensus segments. As show-case applications, we investigate a yeast transcriptome and show that consensus segments provide a robust means of identifying transcriptomic units. This approach is particularly suited for dense transcriptomes with polycistronic transcripts, operons, or a lack of separation between transcripts. As a second application, we demonstrate that consensus segmentations can be used to robustly identify growth regimes from sets of replicate growth curves

    Patterns of chromosomal copy-number alterations in intrahepatic cholangiocarcinoma

    Get PDF
    BACKGROUND: Intrahepatic cholangiocarcinomas (ICC) are relatively rare malignant tumors associated with a poor prognosis. Recent studies using genome-wide sequencing technologies have mainly focused on identifying new driver mutations. There is nevertheless a need to investigate the spectrum of copy number aberrations in order to identify potential target genes in the altered chromosomal regions. The aim of this study was to characterize the patterns of chromosomal copy-number alterations (CNAs) in ICC. METHODS: 53 patients having ICC with frozen material were selected. In 47 cases, DNA hybridization has been performed on a genomewide SNP array. A procedure with a segmentation step and a calling step classified genomic regions into copy-number aberration states. We identified the exclusively amplified and deleted recurrent genomic areas. These areas are those showing the highest estimated propensity level for copy loss (resp. copy gain) together with the lowest level for copy gain (resp. copy loss). We investigated ICC clustering. We analyzed the relationships between CNAs and clinico-pathological characteristics. RESULTS: The overall genomic profile of ICC showed many alterations with higher rates for the deletions. Exclusively deleted genomic areas were 1p, 3p and 14q. The main exclusively amplified genomic areas were 1q, 7p, 7q and 8q. Based on the exclusively deleted/amplified genomic areas, a clustering analysis identified three tumors groups: the first group characterized by copy loss of 1p and copy gain of 7p, the second group characterized by 1p and 3p copy losses without 7p copy gain, the last group characterized mainly by very few CNAs. From univariate analyses, the number of tumors, the size of the largest tumor and the stage were significantly associated with shorter time recurrence. We found no relationship between the number of altered cytobands or tumor groups and time to recurrence. CONCLUSION: This study describes the spectrum of chromosomal aberrations across the whole genome. Some of the recurrent exclusive CNAs harbor candidate target genes. Despite the absence of correlation between CNAs and clinico-pathological characteristics, the co-occurence of 7p gain and 1p loss in a subgroup of patients may suggest a differential activation of EGFR and its downstream pathways, which may have a potential effect on targeted therapies

    Integrative characterisation and prediction of the radiation response in radiation oncology

    Get PDF
    corecore