66 research outputs found
Similarity Measures for Clustering SNP Data
The issue of suitable similarity measures for a particular kind of genetic data – so called SNP data – arises from the GENICA (Interdisciplinary
Study Group on Gene Environment Interaction and Breast Cancer in Germany) case-control study of sporadic breast cancer. The GENICA study
aims to investigate the influence and interaction of single nucleotide polymorphic (SNP) loci and exogenous risk factors. A single nucleotide
polymorphism is a point mutation that is present in at least 1 % of a population. SNPs are the most common form of human genetic variations.
In particular, we consider 65 SNP loci and 2 insertions of longer sequences in genes involved in the metabolism of hormones, xenobiotics and drugs as well as in the repair of DNA and signal transduction. Assuming that these single nucleotide changes may lead, for instance, to altered enzymes or to a reduced or enhanced amount of the original enzymes – with each alteration
alone having minor effects – we aim to detect combinations of SNPs that under certain environmental conditions increase the risk of sporadic breast cancer. The search for patterns in the present data set may be performed by a variety
of clustering and classification approaches. We consider here the problem of
suitable measures of proximity of two variables or subjects as an indispensable basis for a further cluster analysis.
Generally, clustering approaches are a useful tool to detect structures and to
generate hypothesis about potential relationships in complex data situations.
Searching for patterns in the data there are two possible objectives: the identification of groups of similar objects or subjects or the identification of
groups of similar variables within the whole or within subpopulations.
Comparing the individual genetic profiles as well as comparing the genetic information across subpopulations we discuss possible choices of similarity measures, in particular similarity measures based on the counts of matches and mismatches. New matching coefficients are introduced with a more flexible weighting scheme to account for the general problem of the comparison of SNP data: The large proportion of homozygous reference sequences relative to the homo- and heterozygous SNPs is masking the accordances and differences of interest
FGF receptor genes and breast cancer susceptibility: results from the Breast Cancer Association Consortium
Background:Breast cancer is one of the most common malignancies in women. Genome-wide association studies have identified FGFR2 as a breast cancer susceptibility gene. Common variation in other fibroblast growth factor (FGF) receptors might also modify risk. We tested this hypothesis by studying genotyped single-nucleotide polymorphisms (SNPs) and imputed SNPs in FGFR1, FGFR3, FGFR4 and FGFRL1 in the Breast Cancer Association Consortium.
Methods:Data were combined from 49 studies, including 53 835 cases and 50 156 controls, of which 89 050 (46 450 cases and 42 600 controls) were of European ancestry, 12 893 (6269 cases and 6624 controls) of Asian and 2048 (1116 cases and 932 controls) of African ancestry. Associations with risk of breast cancer, overall and by disease sub-type, were assessed using unconditional logistic regression.
Results:Little evidence of association with breast cancer risk was observed for SNPs in the FGF receptor genes. The strongest evidence in European women was for rs743682 in FGFR3; the estimated per-allele odds ratio was 1.05 (95 confidence interval=1.02-1.09, P=0.0020), which is substantially lower than that observed for SNPs in FGFR2.
Conclusion:Our results suggest that common variants in the other FGF receptors are not associated with risk of breast cancer to the degree observed for FGFR2. © 2014 Cancer Research UK
An intergenic risk locus containing an enhancer deletion in 2q35 modulates breast cancer risk by deregulating IGFBP5 expression.
Breast cancer is the most diagnosed malignancy and the second leading cause of cancer mortality in females. Previous association studies have identified variants on 2q35 associated with the risk of breast cancer. To identify functional susceptibility loci for breast cancer, we interrogated the 2q35 gene desert for chromatin architecture and functional variation correlated with gene expression. We report a novel intergenic breast cancer risk locus containing an enhancer copy number variation (enCNV; deletion) located approximately 400Kb upstream to IGFBP5, which overlaps an intergenic ERα-bound enhancer that loops to the IGFBP5 promoter. The enCNV is correlated with modified ERα binding and monoallelic-repression of IGFBP5 following estrogen treatment. We investigated the association of enCNV genotype with breast cancer in 1,182 cases and 1,362 controls, and replicate our findings in an independent set of 62,533 cases and 60,966 controls from 41 case control studies and 11 GWAS. We report a dose-dependent inverse association of 2q35 enCNV genotype (percopy OR=0.68 95%CI 0.55-0.83, P=0.0002; replication OR=0.77 95%CI 0.73-0.82, P=2.1x10(-19)) and identify 13 additional linked variants (r(2)>0.8) in the 20Kb linkage block containing the enCNV (P=3.2x10(-15) - 5.6x10(-17)). These associations were independent of previously reported 2q35 variants, rs13387042/rs4442975 and rs16857609, and were stronger for ER-positive than ER-negative disease. Together, these results suggest that 2q35 breast cancer risk loci may be mediating their effect through IGFBP5
An intergenic risk locus containing an enhancer deletion in 2q35 modulates breast cancer risk by deregulating IGFBP5 expression.
Breast cancer is the most diagnosed malignancy and the second leading cause of cancer mortality in females. Previous association studies have identified variants on 2q35 associated with the risk of breast cancer. To identify functional susceptibility loci for breast cancer, we interrogated the 2q35 gene desert for chromatin architecture and functional variation correlated with gene expression. We report a novel intergenic breast cancer risk locus containing an enhancer copy number variation (enCNV; deletion) located approximately 400Kb upstream to IGFBP5, which overlaps an intergenic ERα-bound enhancer that loops to the IGFBP5 promoter. The enCNV is correlated with modified ERα binding and monoallelic-repression of IGFBP5 following estrogen treatment. We investigated the association of enCNV genotype with breast cancer in 1,182 cases and 1,362 controls, and replicate our findings in an independent set of 62,533 cases and 60,966 controls from 41 case control studies and 11 GWAS. We report a dose-dependent inverse association of 2q35 enCNV genotype (percopy OR=0.68 95%CI 0.55-0.83, P=0.0002; replication OR=0.77 95%CI 0.73-0.82, P=2.1x10(-19)) and identify 13 additional linked variants (r(2)>0.8) in the 20Kb linkage block containing the enCNV (P=3.2x10(-15) - 5.6x10(-17)). These associations were independent of previously reported 2q35 variants, rs13387042/rs4442975 and rs16857609, and were stronger for ER-positive than ER-negative disease. Together, these results suggest that 2q35 breast cancer risk loci may be mediating their effect through IGFBP5
Candidate locus analysis of the TERT-CLPTM1L cancer risk region on chromosome 5p15 identifies multiple independent variants associated with endometrial cancer risk.
Several studies have reported associations between multiple cancer types and single-nucleotide polymorphisms (SNPs) on chromosome 5p15, which harbours TERT and CLPTM1L, but no such association has been reported with endometrial cancer. To evaluate the role of genetic variants at the TERT-CLPTM1L region in endometrial cancer risk, we carried out comprehensive fine-mapping analyses of genotyped and imputed SNPs using a custom Illumina iSelect array which includes dense SNP coverage of this region. We examined 396 SNPs (113 genotyped, 283 imputed) in 4,401 endometrial cancer cases and 28,758 controls. Single-SNP and forward/backward logistic regression models suggested evidence for three variants independently associated with endometrial cancer risk (P = 4.9 × 10(-6) to P = 7.7 × 10(-5)). Only one falls into a haplotype previously associated with other cancer types (rs7705526, in TERT intron 1), and this SNP has been shown to alter TERT promoter activity. One of the novel associations (rs13174814) maps to a second region in the TERT promoter and the other (rs62329728) is in the promoter region of CLPTM1L; neither are correlated with previously reported cancer-associated SNPs. Using TCGA RNASeq data, we found significantly increased expression of both TERT and CLPTM1L in endometrial cancer tissue compared with normal tissue (TERT P = 1.5 × 10(-18), CLPTM1L P = 1.5 × 10(-19)). Our study thus reports a novel endometrial cancer risk locus and expands the spectrum of cancer types associated with genetic variation at 5p15, further highlighting the importance of this region for cancer susceptibility.This work was supported by the NHMRC Project Grant (ID#1031333). This work was also supported by Cancer Research UK (C1287/A10118,
C1287/A 10710, C12292/A11174, C1281/A12014, C5047/A8384,
C5047/A15007, C5047/A10692)This is the published version. It first appeared at http://link.springer.com/article/10.1007%2Fs00439-014-1515-4
Genetic predisposition to in situ and invasive lobular carcinoma of the breast.
Invasive lobular breast cancer (ILC) accounts for 10-15% of all invasive breast carcinomas. It is generally ER positive (ER+) and often associated with lobular carcinoma in situ (LCIS). Genome-wide association studies have identified more than 70 common polymorphisms that predispose to breast cancer, but these studies included predominantly ductal (IDC) carcinomas. To identify novel common polymorphisms that predispose to ILC and LCIS, we pooled data from 6,023 cases (5,622 ILC, 401 pure LCIS) and 34,271 controls from 36 studies genotyped using the iCOGS chip. Six novel SNPs most strongly associated with ILC/LCIS in the pooled analysis were genotyped in a further 516 lobular cases (482 ILC, 36 LCIS) and 1,467 controls. These analyses identified a lobular-specific SNP at 7q34 (rs11977670, OR (95%CI) for ILC = 1.13 (1.09-1.18), P = 6.0 × 10(-10); P-het for ILC vs IDC ER+ tumors = 1.8 × 10(-4)). Of the 75 known breast cancer polymorphisms that were genotyped, 56 were associated with ILC and 15 with LCIS at P<0.05. Two SNPs showed significantly stronger associations for ILC than LCIS (rs2981579/10q26/FGFR2, P-het = 0.04 and rs889312/5q11/MAP3K1, P-het = 0.03); and two showed stronger associations for LCIS than ILC (rs6678914/1q32/LGR6, P-het = 0.001 and rs1752911/6q14, P-het = 0.04). In addition, seven of the 75 known loci showed significant differences between ER+ tumors with IDC and ILC histology, three of these showing stronger associations for ILC (rs11249433/1p11, rs2981579/10q26/FGFR2 and rs10995190/10q21/ZNF365) and four associated only with IDC (5p12/rs10941679; rs2588809/14q24/RAD51L1, rs6472903/8q21 and rs1550623/2q31/CDCA7). In conclusion, we have identified one novel lobular breast cancer specific predisposition polymorphism at 7q34, and shown for the first time that common breast cancer polymorphisms predispose to LCIS. We have shown that many of the ER+ breast cancer predisposition loci also predispose to ILC, although there is some heterogeneity between ER+ lobular and ER+ IDC tumors. These data provide evidence for overlapping, but distinct etiological pathways within ER+ breast cancer between morphological subtypes
- …