183 research outputs found

    Population-genetic nature of copy number variations in the human genome

    Get PDF
    Copy number variations (CNVs) are universal genetic variations, and their association with disease has been increasingly recognized. We designed high-density microarrays for CNVs, and detected 3000–4000 CNVs (4–6% of the genomic sequence) per population that included CNVs previously missed because of smaller sizes and residing in segmental duplications. The patterns of CNVs across individuals were surprisingly simple at the kilo-base scale, suggesting the applicability of a simple genetic analysis for these genetic loci. We utilized the probabilistic theory to determine integer copy numbers of CNVs and employed a recently developed phasing tool to estimate the population frequencies of integer copy number alleles and CNV–SNP haplotypes. The results showed a tendency toward a lower frequency of CNV alleles and that most of our CNVs were explained only by zero-, one- and two-copy alleles. Using the estimated population frequencies, we found several CNV regions with exceptionally high population differentiation. Investigation of CNV–SNP linkage disequilibrium (LD) for 500–900 bi- and multi-allelic CNVs per population revealed that previous conflicting reports on bi-allelic LD were unexpectedly consistent and explained by an LD increase correlated with deletion-allele frequencies. Typically, the bi-allelic LD was lower than SNP–SNP LD, whereas the multi-allelic LD was somewhat stronger than the bi-allelic LD. After further investigation of tag SNPs for CNVs, we conclude that the customary tagging strategy for disease association studies can be applicable for common deletion CNVs, but direct interrogation is needed for other types of CNVs

    CARAT: A novel method for allelic detection of DNA copy number changes using high density oligonucleotide arrays

    Get PDF
    BACKGROUND: DNA copy number alterations are one of the main characteristics of the cancer cell karyotype and can contribute to the complex phenotype of these cells. These alterations can lead to gains in cellular oncogenes as well as losses in tumor suppressor genes and can span small intervals as well as involve entire chromosomes. The ability to accurately detect these changes is central to understanding how they impact the biology of the cell. RESULTS: We describe a novel algorithm called CARAT (Copy Number Analysis with Regression And Tree) that uses probe intensity information to infer copy number in an allele-specific manner from high density DNA oligonuceotide arrays designed to genotype over 100, 000 SNPs. Total and allele-specific copy number estimations using CARAT are independently evaluated for a subset of SNPs using quantitative PCR and allelic TaqMan reactions with several human breast cancer cell lines. The sensitivity and specificity of the algorithm are characterized using DNA samples containing differing numbers of X chromosomes as well as a test set of normal individuals. Results from the algorithm show a high degree of agreement with results from independent verification methods. CONCLUSION: Overall, CARAT automatically detects regions with copy number variations and assigns a significance score to each alteration as well as generating allele-specific output. When coupled with SNP genotype calls from the same array, CARAT provides additional detail into the structure of genome wide alterations that can contribute to allelic imbalance

    Increased mRNA expression levels of ERCC1, OGG1 and RAI in colorectal adenomas and carcinomas

    Get PDF
    BACKGROUND: The majority of colorectal cancer (CRC) cases develop through the adenoma-carcinoma pathway. If an increase in DNA repair expression is detected in both early adenomas and carcinomas it may indicate that low repair capacity in the normal mucosa is a risk factor for adenoma formation. METHODS: We have examined mRNA expression of two DNA repair genes, ERCC1 and OGG1 as well as the putative apoptosis controlling gene RAI, in normal tissues and lesions from 36 cases with adenomas (mild/moderat n = 21 and severe n = 15, dysplasia) and 9 with carcinomas. RESULTS: Comparing expression levels of ERCC1, OGG1 and RAI between normal tissue and all lesions combined yielded higher expression levels in lesions, 3.3-fold higher (P = 0.005), 5.6-fold higher(P < 3·10(-5)) and 7.7-fold higher (P = 0.0005), respectively. The levels of ERCC1, OGG1 and RAI expressions when comparing lesions, did not differ between adenomas and CRC cases, P = 0.836, P = 0.341 and P = 0.909, respectively. When comparing expression levels in normal tissue, the levels for OGG1 and RAI from CRC cases were significantly lower compared to the cases with adenomas, P = 0.012 and P = 0.011, respectively. CONCLUSION: Our results suggest that increased expression of defense genes is an early event in the progression of colorectal adenomas to carcinomas

    Retrospective evaluation of whole exome and genome mutation calls in 746 cancer samples

    Get PDF
    The Cancer Genome Atlas (TCGA) and International Cancer Genome Consortium (ICGC) curated consensus somatic mutation calls using whole exome sequencing (WES) and whole genome sequencing (WGS), respectively. Here, as part of the ICGC/TCGA Pan-Cancer Analysis of Whole Genomes (PCAWG) Consortium, which aggregated whole genome sequencing data from 2,658 cancers across 38 tumour types, we compare WES and WGS side-by-side from 746 TCGA samples, finding that ~80% of mutations overlap in covered exonic regions. We estimate that low variant allele fraction (VAF < 15%) and clonal heterogeneity contribute up to 68% of private WGS mutations and 71% of private WES mutations. We observe that ~30% of private WGS mutations trace to mutations identified by a single variant caller in WES consensus efforts. WGS captures both ~50% more variation in exonic regions and un-observed mutations in loci with variable GC-content. Together, our analysis highlights technological divergences between two reproducible somatic variant detection efforts.publishedVersio
    • …
    corecore