9 research outputs found
Germline Variation in Cancer-Susceptibility Genes in a Healthy, Ancestrally Diverse Cohort: Implications for Individual Genome Sequencing
<div><p>Technological advances coupled with decreasing costs are bringing whole genome and whole exome sequencing closer to routine clinical use. One of the hurdles to clinical implementation is the high number of variants of unknown significance. For cancer-susceptibility genes, the difficulty in interpreting the clinical relevance of the genomic variants is compounded by the fact that most of what is known about these variants comes from the study of highly selected populations, such as cancer patients or individuals with a family history of cancer. The genetic variation in known cancer-susceptibility genes in the general population has not been well characterized to date. To address this gap, we profiled the nonsynonymous genomic variation in 158 genes causally implicated in carcinogenesis using high-quality whole genome sequences from an ancestrally diverse cohort of 681 healthy individuals. We found that all individuals carry multiple variants that may impact cancer susceptibility, with an average of 68 variants per individual. Of the 2,688 allelic variants identified within the cohort, most are very rare, with 75% found in only 1 or 2 individuals in our population. Allele frequencies vary between ancestral groups, and there are 21 variants for which the minor allele in one population is the major allele in another. Detailed analysis of a selected subset of 5 clinically important cancer genes, <i>BRCA1</i>, <i>BRCA2</i>, <i>KRAS</i>, <i>TP53</i>, and <i>PTEN</i>, highlights differences between germline variants and reported somatic mutations. The dataset can serve a resource of genetic variation in cancer-susceptibility genes in 6 ancestry groups, an important foundation for the interpretation of cancer risk from personal genome sequences.</p></div
Number of cancer-gene variants per individual by ancestry.
<p>The distribution of the number of nonsynonymous genes per subject for each of the 6 ancestry-based subpopulations.</p
Profile of the variability per individual.
<p>(A) Boxplot of the total number of variants, the number of variants listed in HGMD, the number of likely deleterious variants, and the number of variants of unknown significance per individual for cancer-associated genes. (B) Distribution of the number of cancer genes with at least one nonsynonymous variant per individual.</p
Admixture coefficients for the subpopulations.
<p>The admixture proportions of the 6 ancestral populations (colors) are displayed for all individuals in each of the 7 groups defined in the cohort (panels). (A) European (B) Central Asian (C) East Asian (D) African (E) African-European (F) Hispanic (G) Other. Red: European, Blue: Central Asian, Cyan: East Asian, Yellow: African, Green: Native American, Magenta: Oceania.</p
Correlation between the number of variants and coding length.
<p>The number of nonsynonymous variants vs. total number of coding bases for each of the 158 cancer-susceptibility genes.</p
p53 DNA-binding domain variants.
<p>The DNA-binding domain of the p53 protein (black) bound to DNA (purple) <a href="http://www.plosone.org/article/info:doi/10.1371/journal.pone.0094554#pone.0094554-Cho1" target="_blank">[84]</a>. Common somatic mutations (yellow) contact the DNA or stabilize the structure. Variants in our cohort (red) occur at residues distal to the DNA binding site except for Arg 283 (green).</p
Variation prevalence per gene.
<p>Distribution of the number of individuals with a variant per gene for (A) all variants (B) rare variants.</p
Average numbers of cancer-gene variants per individual by ancestry.
<p>Average numbers of cancer-gene variants per individual by ancestry.</p
Positions at which the major allele differs between ancestry groups.
a<p>Bold: Allele frequencies >0.5.</p