60 research outputs found

    Generalizing Cross-Document Event Coreference Resolution Across Multiple Corpora

    Full text link
    Cross-document event coreference resolution (CDCR) is an NLP task in which mentions of events need to be identified and clustered throughout a collection of documents. CDCR aims to benefit downstream multi-document applications, but despite recent progress on corpora and system development, downstream improvements from applying CDCR have not been shown yet. We make the observation that every CDCR system to date was developed, trained, and tested only on a single respective corpus. This raises strong concerns on their generalizability -- a must-have for downstream applications where the magnitude of domains or event mentions is likely to exceed those found in a curated corpus. To investigate this assumption, we define a uniform evaluation setup involving three CDCR corpora: ECB+, the Gun Violence Corpus and the Football Coreference Corpus (which we reannotate on token level to make our analysis possible). We compare a corpus-independent, feature-based system against a recent neural system developed for ECB+. Whilst being inferior in absolute numbers, the feature-based system shows more consistent performance across all corpora whereas the neural system is hit-and-miss. Via model introspection, we find that the importance of event actions, event time, etc. for resolving coreference in practice varies greatly between the corpora. Additional analysis shows that several systems overfit on the structure of the ECB+ corpus. We conclude with recommendations on how to achieve generally applicable CDCR systems in the future -- the most important being that evaluation on multiple CDCR corpora is strongly necessary. To facilitate future research, we release our dataset, annotation guidelines, and system implementation to the public.Comment: Accepted at CL Journa

    Identification of Broad-Spectrum Antiviral Compounds by Targeting Viral Entry.

    Get PDF
    Viruses are a major threat to human health and economic well-being. In recent years Ebola, Zika, influenza, and chikungunya virus epidemics have raised awareness that infections can spread rapidly before vaccines or specific antagonists can be made available. Broad-spectrum antivirals are drugs with the potential to inhibit infection by viruses from different groups or families, which may be deployed during outbreaks when specific diagnostics, vaccines or directly acting antivirals are not available. While pathogen-directed approaches are generally effective against a few closely related viruses, targeting cellular pathways used by multiple viral agents can have broad-spectrum efficacy. Virus entry, particularly clathrin-mediated endocytosis, constitutes an attractive target as it is used by many viruses. Using a phenotypic screening strategy where the inhibitory activity of small molecules was sequentially tested against different viruses, we identified 12 compounds with broad-spectrum activity, and found a subset blocking viral internalisation and/or fusion. Importantly, we show that compounds identified with this approach can reduce viral replication in a mouse model of Zika infection. This work provides proof of concept that it is possible to identify broad-spectrum inhibitors by iterative phenotypic screenings, and that inhibition of host-pathways critical for viral life cycles can be an effective antiviral strategy

    Genetic Analyses of Heme Oxygenase 1 (HMOX1) in Different Forms of Pancreatitis

    Get PDF
    Contains fulltext : 107993.pdf (publisher's version ) (Open Access)BACKGROUND: Heme oxygenase 1 (HMOX1) is the rate limiting enzyme in heme degradation and a key regulator of inflammatory processes. In animal models the course of pancreatitis was ameliorated by up-regulation of HMOX1 expression. Additionally, carbon monoxide released during heme breakdown inhibited proliferation of pancreatic stellate cells and might thereby prevent the development of chronic pancreatitis (CP). Transcription of HMOX1 in humans is influenced by a GT-repeat located in the promoter. As such, HMOX1 variants might be of importance in the pathogenesis of pancreatitis. METHODS: The GT-repeat and SNP rs2071746 were investigated with fluorescence labelled primers and by melting curve analysis in 285 patients with acute pancreatitis, 208 patients with alcoholic CP, 207 patients with idiopathic/hereditary CP, 147 patients with alcoholic liver cirrhosis, and in 289 controls, respectively. GT-repeat analysis was extended to a total of 446 alcoholic CP patients. In addition, we performed DNA sequencing in 145 patients with alcoholic CP, 138 patients with idiopathic/hereditary CP, 147 patients with alcoholic liver cirrhosis, and 151 controls. Exon 3 screening was extended to additional patients and controls. RESULTS: S- and L-alleles of the GT-repeat, genotypes and alleles of SNP rs2071746 and non-synonymous variants detected by sequencing were found with similar frequencies in all groups. CONCLUSIONS: Although functional data implicate a potential influence of HMOX1 variants on the pathogenesis of pancreatitis, we did not find any association. As rare non-synonymous HMOX1 variants were found in patients and controls, it is rather unlikely that they will have functional consequences essential for pancreatitis development

    A Common Variant of PNPLA3 (p.I148M) Is Not Associated with Alcoholic Chronic Pancreatitis

    Get PDF
    Contains fulltext : 110441.pdf (publisher's version ) (Open Access)BACKGROUND: Chronic pancreatitis (CP) is an inflammatory disease that in some patients leads to exocrine and endocrine dysfunction. In industrialized countries the most common aetiology is chronic alcohol abuse. Descriptions of associated genetic alterations in alcoholic CP are rare. However, a common PNPLA3 variant (p.I148M) is associated with the development of alcoholic liver cirrhosis (ALC). Since, alcoholic CP and ALC share the same aetiology PNPLA3 variant (p.I148M) possibly influences the development of alcoholic CP. METHODS: Using melting curve analysis we genotyped the variant in 1510 patients with pancreatitis or liver disease (961 German and Dutch alcoholic CP patients, 414 German patients with idiopathic or hereditary CP, and 135 patients with ALC). In addition, we included in total 2781 healthy controls in the study. RESULTS: The previously published overrepresentation of GG-genotype was replicated in our cohort of ALC (p-value <0.0001, OR 2.3, 95% CI 1.6-3.3). Distributions of genotype and allele frequencies of the p.I148M variant were comparable in patients with alcoholic CP, idiopathic and hereditary CP and in healthy controls. CONCLUSIONS: The absence of an association of PNPLA3 p.I148M with alcoholic CP seems not to point to a common pathway in the development of alcoholic CP and alcoholic liver cirrhosis

    A Meta-Analysis and Genome-Wide Association Study of Platelet Count and Mean Platelet Volume in African Americans

    Get PDF
    Several genetic variants associated with platelet count and mean platelet volume (MPV) were recently reported in people of European ancestry. In this meta-analysis of 7 genome-wide association studies (GWAS) enrolling African Americans, our aim was to identify novel genetic variants associated with platelet count and MPV. For all cohorts, GWAS analysis was performed using additive models after adjusting for age, sex, and population stratification. For both platelet phenotypes, meta-analyses were conducted using inverse-variance weighted fixed-effect models. Platelet aggregation assays in whole blood were performed in the participants of the GeneSTAR cohort. Genetic variants in ten independent regions were associated with platelet count (N = 16,388) with p<5×10−8 of which 5 have not been associated with platelet count in previous GWAS. The novel genetic variants associated with platelet count were in the following regions (the most significant SNP, closest gene, and p-value): 6p22 (rs12526480, LRRC16A, p = 9.1×10−9), 7q11 (rs13236689, CD36, p = 2.8×10−9), 10q21 (rs7896518, JMJD1C, p = 2.3×10−12), 11q13 (rs477895, BAD, p = 4.9×10−8), and 20q13 (rs151361, SLMO2, p = 9.4×10−9). Three of these loci (10q21, 11q13, and 20q13) were replicated in European Americans (N = 14,909) and one (11q13) in Hispanic Americans (N = 3,462). For MPV (N = 4,531), genetic variants in 3 regions were significant at p<5×10−8, two of which were also associated with platelet count. Previously reported regions that were also significant in this study were 6p21, 6q23, 7q22, 12q24, and 19p13 for platelet count and 7q22, 17q11, and 19p13 for MPV. The most significant SNP in 1 region was also associated with ADP-induced maximal platelet aggregation in whole blood (12q24). Thus through a meta-analysis of GWAS enrolling African Americans, we have identified 5 novel regions associated with platelet count of which 3 were replicated in other ethnic groups. In addition, we also found one region associated with platelet aggregation that may play a potential role in atherothrombosis

    Genome-wide association study identifies multiple susceptibility loci for pancreatic cancer

    Get PDF
    We performed a multistage genome-wide association study including 7,683 individuals with pancreatic cancer and 14,397 controls of European descent. Four new loci reached genome-wide significance: rs6971499 at 7q32.3 (LINC-PINT, per-allele odds ratio (OR) = 0.79, 95% confidence interval (CI) 0.74-0.84, P = 3.0 x 10(-12)), rs7190458 at 16q23.1 (BCAR1/CTRB1/CTRB2, OR = 1.46, 95% CI 1.30-1.65, P = 1.1 x 10(-10)), rs9581943 at 13q12.2 (PDX1, OR = 1.15, 95% CI 1.10-1.20, P = 2.4 x 10(-9)) and rs16986825 at 22q12.1 (ZNRF3, OR = 1.18, 95% CI 1.12-1.25, P = 1.2 x 10(-8)). We identified an independent signal in exon 2 of TERT at the established region 5p15.33 (rs2736098, OR = 0.80, 95% CI 0.76-0.85, P = 9.8 x 10(-14)). We also identified a locus at 8q24.21 (rs1561927, P = 1.3 x 10(-7)) that approached genome-wide significance located 455 kb telomeric of PVT1. Our study identified multiple new susceptibility alleles for pancreatic cancer that are worthy of follow-up studies

    Common non-synonymous SNPs associated with breast cancer susceptibility: findings from the Breast Cancer Association Consortium.

    Get PDF
    Candidate variant association studies have been largely unsuccessful in identifying common breast cancer susceptibility variants, although most studies have been underpowered to detect associations of a realistic magnitude. We assessed 41 common non-synonymous single-nucleotide polymorphisms (nsSNPs) for which evidence of association with breast cancer risk had been previously reported. Case-control data were combined from 38 studies of white European women (46 450 cases and 42 600 controls) and analyzed using unconditional logistic regression. Strong evidence of association was observed for three nsSNPs: ATXN7-K264R at 3p21 [rs1053338, per allele OR = 1.07, 95% confidence interval (CI) = 1.04-1.10, P = 2.9 × 10(-6)], AKAP9-M463I at 7q21 (rs6964587, OR = 1.05, 95% CI = 1.03-1.07, P = 1.7 × 10(-6)) and NEK10-L513S at 3p24 (rs10510592, OR = 1.10, 95% CI = 1.07-1.12, P = 5.1 × 10(-17)). The first two associations reached genome-wide statistical significance in a combined analysis of available data, including independent data from nine genome-wide association studies (GWASs): for ATXN7-K264R, OR = 1.07 (95% CI = 1.05-1.10, P = 1.0 × 10(-8)); for AKAP9-M463I, OR = 1.05 (95% CI = 1.04-1.07, P = 2.0 × 10(-10)). Further analysis of other common variants in these two regions suggested that intronic SNPs nearby are more strongly associated with disease risk. We have thus identified a novel susceptibility locus at 3p21, and confirmed previous suggestive evidence that rs6964587 at 7q21 is associated with risk. The third locus, rs10510592, is located in an established breast cancer susceptibility region; the association was substantially attenuated after adjustment for the known GWAS hit. Thus, each of the associated nsSNPs is likely to be a marker for another, non-coding, variant causally related to breast cancer risk. Further fine-mapping and functional studies are required to identify the underlying risk-modifying variants and the genes through which they act.BCAC is funded by Cancer Research UK (C1287/A10118, C1287/A12014) and by the European Community’s Seventh Framework Programme under grant agreement n8 223175 (HEALTH-F2–2009-223175) (COGS). Meetings of the BCAC have been funded by the European Union COST programme (BM0606). Genotyping of the iCOGS array was funded by the European Union (HEALTH-F2-2009-223175), Cancer Research UK (C1287/A10710), the Canadian Institutes of Health Research for the ‘CIHR Team in Familial Risks of Breast Cancer’ program and the Ministry of Economic Development, Innovation and Export Trade of Quebec (PSR-SIIRI-701). Additional support for the iCOGS infrastructure was provided by the National Institutes of Health (CA128978) and Post-Cancer GWAS initiative (1U19 CA148537, 1U19 CA148065 and 1U19 CA148112—the GAME-ON initiative), the Department of Defence (W81XWH-10-1-0341), Komen Foundation for the Cure, the Breast Cancer Research Foundation, and the Ovarian Cancer Research Fund. The ABCFS and OFBCR work was supported by grant UM1 CA164920 from the National Cancer Institute (USA). The content of this manuscript does not necessarily reflect the views or policies of the National Cancer Institute or any of the collaborating centers in the Breast Cancer Family Registry (BCFR), nor does mention of trade names, commercial products or organizations imply endorsement t by the US Government or the BCFR. The ABCFS was also supported by the National Health and Medical Research Council of Australia, the New South Wales Cancer Council, the Victorian Health Promotion Foundation (Australia) and the Victorian Breast Cancer Research Consortium. J.L.H. is a National Health and Medical Research Council (NHMRC) Senior Principal Research Fellow and M.C.S. is a NHMRC Senior Research Fellow. The OFBCR work was also supported by the Canadian Institutes of Health Research ‘CIHR Team in Familial Risks of Breast Cancer’ program. The ABCS was funded by the Dutch Cancer Society Grant no. NKI2007-3839 and NKI2009-4363. The ACP study is funded by the Breast Cancer Research Trust, UK. The work of the BBCC was partly funded by ELAN-Programme of the University Hospital of Erlangen. The BBCS is funded by Cancer Research UK and Breakthrough Breast Cancer and acknowledges NHS funding to the NIHR Biomedical Research Centre, and the National Cancer Research Network (NCRN). E.S. is supported by NIHR Comprehensive Biomedical Research Centre, Guy’s & St. Thomas’ NHS Foundation Trust in partnership with King’s College London, UK. Core funding to the Wellcome Trust Centre for Human Genetics was provided by the Wellcome Trust (090532/Z/09/Z). I.T. is supported by the Oxford Biomedical Research Centre. The BSUCH study was supported by the Dietmar-Hopp Foundation, the Helmholtz Society and the German Cancer Research Center (DKFZ). The CECILE study was funded by the Fondation de France, the French National Institute of Cancer (INCa), The National League against Cancer, the National Agency for Environmental l and Occupational Health and Food Safety (ANSES), the National Agency for Research (ANR), and the Association for Research against Cancer (ARC). The CGPS was supported by the Chief Physician Johan Boserup and Lise Boserup Fund, the Danish Medical Research Council and Herlev Hospital.The CNIO-BCS was supported by the Genome Spain Foundation the Red Temática de Investigación Cooperativa en Cáncer and grants from the Asociación Española Contra el Cáncer and the Fondo de Investigación Sanitario PI11/00923 and PI081120). The Human Genotyping-CEGEN Unit, CNIO is supported by the Instituto de Salud Carlos III. D.A. was supported by a Fellowship from the Michael Manzella Foundation (MMF) and was a participant in the CNIO Summer Training Program. The CTS was initially supported by the California Breast Cancer Act of 1993 and the California Breast Cancer Research Fund (contract 97-10500) and is currently funded through the National Institutes of Health (R01 CA77398). Collection of cancer incidence e data was supported by the California Department of Public Health as part of the statewide cancer reporting program mandated by California Health and Safety Code Section 103885. HAC receives support from the Lon V Smith Foundation (LVS39420). The ESTHER study was supported by a grant from the Baden Württemberg Ministry of Science, Research and Arts. Additional cases were recruited in the context of the VERDI study, which was supported by a grant from the German Cancer Aid (Deutsche Krebshilfe). The GENICA was funded by the Federal Ministry of Education and Research (BMBF) Germany grants 01KW9975/5, 01KW9976/8, 01KW9977/0 and 01KW0114, the Robert Bosch Foundation, Stuttgart, Deutsches Krebsforschungszentrum (DKFZ), Heidelberg Institute for Prevention and Occupational Medicine of the German Social Accident Insurance, Institute of the Ruhr University Bochum (IPA), as well as the Department of Internal Medicine , Evangelische Kliniken Bonn gGmbH, Johanniter Krankenhaus Bonn, Germany. The HEBCS was supported by the Helsinki University Central Hospital Research Fund, Academy of Finland (132473), the Finnish Cancer Society, The Nordic Cancer Union and the Sigrid Juselius Foundation. The HERPACC was supported by a Grant-in-Aid for Scientific Research on Priority Areas from the Ministry of Education, Science, Sports, Culture and Technology of Japan, by a Grant-in-Aid for the Third Term Comprehensive 10-Year strategy for Cancer Control from Ministry Health, Labour and Welfare of Japan, by a research grant from Takeda Science Foundation , by Health and Labour Sciences Research Grants for Research on Applying Health Technology from Ministry Health, Labour and Welfare of Japan and by National Cancer Center Research and Development Fund. The HMBCS was supported by short-term fellowships from the German Academic Exchange Program (to N.B), and the Friends of Hannover Medical School (to N.B.). Financial support for KARBAC was provided through the regional agreement on medical training and clinical research (ALF) between Stockholm County Council and Karolinska Institutet, the Stockholm Cancer Foundation and the Swedish Cancer Society. The KBCP was financially supported by the special Government Funding (EVO) of Kuopio University Hospital grants, Cancer Fund of North Savo, the Finnish Cancer Organizations, the Academy of Finland and by the strategic funding of the University of Eastern Finland. kConFab is supported by grants from the National Breast Cancer Foundation , the NHMRC, the Queensland Cancer Fund, the Cancer Councils of New South Wales, Victoria, Tasmania and South Australia and the Cancer Foundation of Western Australia. The kConFab Clinical Follow Up Study was funded by the NHMRC (145684, 288704, 454508). Financial support for the AOCS was provided by the United States Army Medical Research and Materiel Command (DAMD17-01-1-0729), the Cancer Council of Tasmania and Cancer Foundation of Western Australia and the NHMRC (199600). G.C.T. and P.W. are supported by the NHMRC. LAABC is supported by grants (1RB-0287, 3PB-0102, 5PB-0018 and 10PB-0098) from the California Breast Cancer Research Program. Incident breast cancer cases were collected by the USC Cancer Surveillance Program (CSP) which is supported under subcontract by the California Department of Health. The CSP is also part of the National Cancer Institute’s Division of Cancer Prevention and Control Surveillance, Epidemiology, and End Results Program, under contract number N01CN25403. LMBC is supported by the ‘Stichting tegen Kanker’ (232-2008 and 196-2010). The MARIE study was supported by the Deutsche Krebshilfe e.V. (70-2892-BR I), the Federal Ministry of Education Research (BMBF) Germany (01KH0402), the Hamburg Cancer Society and the German Cancer Research Center (DKFZ). MBCSG is supported by grants from the Italian Association ciation for Cancer Research (AIRC) and by funds from the Italian citizens who allocated a 5/1000 share of their tax payment in support of the Fondazione IRCCS Istituto Nazionale Tumori, according to Italian laws (INT-Institutional strategic projects ‘5 × 1000’). The MCBCS was supported by the NIH grants (CA122340, CA128978) and a Specialized Program of Research Excellence (SPORE) in Breast Cancer (CA116201), the Breast Cancer Research Foundation and a generous gift from the David F. and Margaret T. Grohne Family Foundation and the Ting Tsung and Wei Fong Chao Foundation. MCCS cohort recruitment was funded by VicHealth and Cancer Council Victoria. The MCCS was further supported by Australian NHMRC grants 209057, 251553 and 504711 and by infrastructure provided by Cancer Council Victoria. The MEC was supported by NIH grants CA63464, CA54281, CA098758 and CA132839. The work of MTLGEBCS was supported by the Quebec Breast Cancer Foundation, the Canadian Institutes of Health Research (grant CRN-87521) and the Ministry of Economic Development, Innovation and Export Trade (grant PSR-SIIRI-701). MYBRCA is funded by research grants from the Malaysian Ministry of Science, Technology and Innovation (MOSTI), Malaysian Ministry of Higher Education (UM.C/HlR/MOHE/06) and Cancer Research Initiatives Foundation (CARIF). Additional controls were recruited by the Singapore Eye Research Institute, which was supported by a grant from the Biomedical Research Council (BMRC08/1/35/19,tel:08/1/35/19./550), Singapore and the National medical Research Council, Singapore (NMRC/CG/SERI/2010). The NBCS was supported by grants from the Norwegian Research council (155218/V40, 175240/S10 to A.L.B.D., FUGE-NFR 181600/ V11 to V.N.K. and a Swizz Bridge Award to A.L.B.D.). The NBHS was supported by NIH grant R01CA100374. Biological sample preparation was conducted the Survey and Biospecimen Shared Resource, which is supported by P30 CA68485. The OBCS was supported by research grants from the Finnish Cancer Foundation, the Sigrid Juselius Foundation, the Academy of Finland, the University of Oulu, and the Oulu University Hospital. The ORIGO study was supported by the Dutch Cancer Society (RUL 1997-1505) and the Biobanking and Biomolecular Resources Research Infrastructure (BBMRI-NLCP16). The PBCS was funded by Intramural Research Funds of the National Cancer Institute, Department of Health and Human Services, USA. pKARMA is a combination of the KARMA and LIBRO-1 studies. KARMA was supported by Ma¨rit and Hans Rausings Initiative Against Breast Cancer. KARMA and LIBRO-1 were supported the Cancer Risk Prediction Center (CRisP; www.crispcenter.org), a Linnaeus Centre (Contract ID 70867902) financed by the Swedish Research Council. The RBCS was funded by the Dutch Cancer Society (DDHK 2004-3124, DDHK 2009-4318). SASBAC was supported by funding from the Agency for Science, Technology and Research of Singapore (A∗STAR), the US National Institute of Health (NIH) and the Susan G. Komen Breast Cancer Foundation KC was financed by the Swedish Cancer Society (5128-B07-01PAF). The SBCGS was supported primarily by NIH grants R01CA64277, R01CA148667, and R37CA70867. Biological sample preparation was conducted the Survey and Biospecimen Shared Resource, which is supported by P30 CA68485. The SBCS was supported by Yorkshire Cancer Research S305PA, S299 and S295. Funding for the SCCS was provided by NIH grant R01 CA092447. The Arkansas Central Cancer Registry is fully funded by a grant from National Program of Cancer Registries, Centers for Disease Control and Prevention (CDC). Data on SCCS cancer cases from Mississippi were collected by the Mississippi Cancer Registry which participates in the National Program of Cancer Registries (NPCR) of the Centers for Disease Control and Prevention (CDC). The contents of this publication are solely the responsibility of the authors and do not necessarily represent the official views of the CDC or the Mississippi Cancer Registry. SEARCH is funded by a programme grant from Cancer Research UK (C490/A10124) and supported by the UK National Institute for Health Research Biomedical Research Centre at the University of Cambridge. The SEBCS was supported by the BRL (Basic Research Laboratory) program through the National Research Foundation of Korea funded by the Ministry of Education, Science and Technology (2012-0000347). SGBCC is funded by the National Medical Research Council Start-up Grant and Centre Grant (NMRC/CG/NCIS /2010). The recruitment of controls by the Singapore Consortium of Cohort Studies-Multi-ethnic cohort (SCCS-MEC) was funded by the Biomedical Research Council (grant number: 05/1/21/19/425). SKKDKFZS is supported by the DKFZ. The SZBCS was supported by Grant PBZ_KBN_122/P05/2004. K. J. is a fellow of International PhD program, Postgraduate School of Molecular Medicine, Warsaw Medical University, supported by the Polish Foundation of Science. The TNBCC was supported by the NIH grant (CA128978), the Breast Cancer Research Foundation , Komen Foundation for the Cure, the Ohio State University Comprehensive Cancer Center, the Stefanie Spielman Fund for Breast Cancer Research and a generous gift from the David F. and Margaret T. Grohne Family Foundation and the Ting Tsung and Wei Fong Chao Foundation. Part of the TNBCC (DEMOKRITOS) has been co-financed by the European Union (European Social Fund – ESF) and Greek National Funds through the Operational Program ‘Education and Life-long Learning’ of the National Strategic Reference Framework (NSRF)—Research Funding Program of the General Secretariat for Research & Technology: ARISTEIA. The TWBCS is supported by the Institute of Biomedical Sciences, Academia Sinica and the National Science Council, Taiwan. The UKBGS is funded by Breakthrough Breast Cancer and the Institute of Cancer Research (ICR). ICR acknowledges NHS funding to the NIHR Biomedical Research Centre. Funding to pay the Open Access publication charges for this article was provided by the Wellcome Trust.This is the advanced access published version distributed under a Creative Commons Attribution License 2.0, which can also be viewed on the publisher's webstie at: http://hmg.oxfordjournals.org/content/early/2014/07/04/hmg.ddu311.full.pdf+htm

    Fine-Scale Mapping of the 4q24 Locus Identifies Two Independent Loci Associated with Breast Cancer Risk

    Get PDF
    Background: A recent association study identified a common variant (rs9790517) at 4q24 to be associated with breast cancer risk. Independent association signals and potential functional variants in this locus have not been explored. Methods: We conducted a fine-mapping analysis in 55,540 breast cancer cases and 51,168 controls from the Breast Cancer Association Consortium. Results: Conditional analyses identified two independent association signals among women of European ancestry, represented by rs9790517 [conditional P = 2.51 × 10−4; OR, 1.04; 95% confidence interval (CI), 1.02–1.07] and rs77928427 (P = 1.86 × 10−4; OR, 1.04; 95% CI, 1.02–1.07). Functional annotation using data from the Encyclopedia of DNA Elements (ENCODE) project revealed two putative functional variants, rs62331150 and rs73838678 in linkage disequilibrium (LD) with rs9790517 (r2 ≥ 0.90) residing in the active promoter or enhancer, respectively, of the nearest gene, TET2. Both variants are located in DNase I hypersensitivity and transcription factor–binding sites. Using data from both The Cancer Genome Atlas (TCGA) and Molecular Taxonomy of Breast Cancer International Consortium (METABRIC), we showed that rs62331150 was associated with level of expression of TET2 in breast normal and tumor tissue. Conclusion: Our study identified two independent association signals at 4q24 in relation to breast cancer risk and suggested that observed association in this locus may be mediated through the regulation of TET2. Impact: Fine-mapping study with large sample size warranted for identification of independent loci for breast cancer risk
    corecore