Article thumbnail

Determining Frequent Patterns of Copy Number Alterations in Cancer

By Franck Rapaport and Christina Leslie


Cancer progression is often driven by an accumulation of genetic changes but also accompanied by increasing genomic instability. These processes lead to a complicated landscape of copy number alterations (CNAs) within individual tumors and great diversity across tumor samples. High resolution array-based comparative genomic hybridization (aCGH) is being used to profile CNAs of ever larger tumor collections, and better computational methods for processing these data sets and identifying potential driver CNAs are needed. Typical studies of aCGH data sets take a pipeline approach, starting with segmentation of profiles, calls of gains and losses, and finally determination of frequent CNAs across samples. A drawback of pipelines is that choices at each step may produce different results, and biases are propagated forward. We present a mathematically robust new method that exploits probe-level correlations in aCGH data to discover subsets of samples that display common CNAs. Our algorithm is related to recent work on maximum-margin clustering. It does not require pre-segmentation of the data and also provides grouping of recurrent CNAs into clusters. We tested our approach on a large cohort of glioblastoma aCGH samples from The Cancer Genome Atlas and recovered almost all CNAs reported in the initial study. We also found additional significant CNAs missed by the original analysis but supported by earlier studies, and we identified significant correlations between CNAs

Topics: Research Article
Publisher: Public Library of Science
OAI identifier:
Provided by: PubMed Central

To submit an update or takedown request for this paper, please submit an Update/Correction/Removal Request.

Suggested articles


  1. (2007). A faster circular binary segmentation algorithm for the analysis of array CGH data.
  2. (2005). A tutorial on principal component analysis.
  3. (2004). An integrated view of copy number and allelic alterations in the cancer genome using single nucleotide polymorphism arrays.
  4. (2006). Apoptosis promoted by up-regulation of TFPT (TCF3 fusion partner) appears p53 independent, cell type restricted and cell density influenced.
  5. (2003). Array comparative genome hybridization for tumor classification and gene discovery in mouse models of malignant melanoma.
  6. (2007). ArrayCGH-based classification of neuroblastoma into genomic subgroups.
  7. (2007). Assessing the significance of chromosomal aberrations in cancer: methodology and application to glioma.
  8. (2005). Bladder Cancer Stage and Outcome by Array-Based Comparative Genomic Hybridization.
  9. (2008). CDH11 expression is associated with survival in patients with osteosarcoma.
  10. (2007). Characterizing the cancer genome in lung adenocarcinoma.
  11. (1994). Chromosomal gains and losses in uveal melanomas detected by comparative genomic hybridization.
  12. (2004). Circular binary segmentation for the analysis of array-based DNA copy number data.
  13. (2008). Classification of arrayCGH data using fused SVM.
  14. (2005). Comparative analysis of algorithms for identifying amplifications and deletions in array CGH data.
  15. (2008). Comprehensive genomic characterization defines human glioblastoma genes and core pathways.
  16. (2007). DIFFRAC: a discriminative and flexible framework for clustering.
  17. (2009). Down’s syndrome suppression of tumour growth and the role of the calcineurin inhibitor DSCR1.
  18. (2009). Dysregulation of the transcription factors SOX4, CBFB and SMARCC1 correlates with outcome of colorectal cancer.
  19. (2008). Efficient multiclass maximum margin clustering. In:
  20. (2008). Feedback circuit among INK4 tumor suppressors constrains human glioblastoma development.
  21. (2010). Finding recurrent copy number alteration regions: A review of methods.
  22. (2007). FISH 1p/ 19q deletion/imbalance for molecular subclassification of glioblastoma.
  23. (2008). Functional copy-number alterations in cancer.
  24. (1999). Genetic aberrations in glioblastoma multiforme: translocation of chromosome 10 in an O-2A-like cell line.
  25. (1999). Genome-wide analysis of DNA copy-number changes using cDNA microarrays.
  26. (2008). Genomic changes and gene expression profiles reveal that established glioma cell lines are poorly representative of primary human gliomas.
  27. (2009). Genomic profiling and identification of high risk uveal melanoma by array-CGH analysis of primary tumors and liver metastases. Invest Ophthalmol Vis Sci.
  28. (1998). High resolution analysis of DNA copy number variation using comparative genomic hybridization to microarrays.
  29. (2007). High-resolution aCGH and expression profiling identifies a novel genomic subtype of er negative breast cancer.
  30. (2005). Identification of oligodendroglioma specific chromosomal copy number changes in the glioblastoma MI-4 cell line by array-CGH and FISH analyses.
  31. (2007). Inverted and deleted chromosome 16 with deletion of 39 CBFB identified by fluorescence in situ hybridization.
  32. (2004). Least angle regression.
  33. (2007). Maximum margin clustering made practical. In:
  34. (2005). Maximum margin clustering.
  35. (2001). Significance analysis of microarrays applied to the ionizing radiation response.
  36. (2005). Sparsity and smoothness via the fused lasso.
  37. (2006). Spatial normalization of array-CGH data.
  38. (2008). Spatial smoothing and hot spot detection for CGH data using the fused lasso.
  39. (1999). Study of chromosome 12 copy number in breast cancer using fluorescence in situ hybridization.
  40. (2006). Tavare ` S
  41. (2004). The entire regularization path for the support vector machine.
  42. (1999). The TOMLAB optimization environment in Matlab.
  43. (2006). Using array-comparative genomic hybridization to define molecular portraits of primary breast cancers.
  44. (1999). Using SeDuMi 1.02, a MATLAB toolbox for optimization over symmetric cones.