27 research outputs found

    Genome and gene alterations by insertions and deletions in the evolution of human and chimpanzee chromosome 22

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Understanding structure and function of human genome requires knowledge of genomes of our closest living relatives, the primates. Nucleotide insertions and deletions (indels) play a significant role in differentiation that underlies phenotypic differences between humans and chimpanzees. In this study, we evaluated distribution, evolutionary history, and function of indels found by comparing syntenic regions of the human and chimpanzee genomes.</p> <p>Results</p> <p>Specifically, we identified 6,279 indels of 10 bp or greater in a ~33 Mb alignment between human and chimpanzee chromosome 22. After the exclusion of those in repetitive DNA, 1,429 or 23% of indels still remained. This group was characterized according to the local or genome-wide repetitive nature, size, location relative to genes, and other genomic features. We defined three major classes of these indels, using local structure analysis: (i) those indels found uniquely without additional copies of indel sequence in the surrounding (10 Kb) region, (ii) those with at least one exact copy found nearby, and (iii) those with similar but not identical copies found locally. Among these classes, we encountered a high number of exactly repeated indel sequences, most likely due to recent duplications. Many of these indels (683 of 1,429) were in proximity of known human genes. Coding sequences and splice sites contained significantly fewer of these indels than expected from random expectations, suggesting that selection is a factor in limiting their persistence. A subset of indels from coding regions was experimentally validated and their impacts were predicted based on direct sequencing in several human populations as well as chimpanzees, bonobos, gorillas, and two subspecies of orangutans.</p> <p>Conclusion</p> <p>Our analysis demonstrates that while indels are distributed essentially randomly in intergenic and intronic genomic regions, they are significantly under-represented in coding sequences. There are substantial differences in representation of indel classes among genomic elements, most likely caused by differences in their evolutionary histories. Using local sequence context, we predicted origins and phylogenetic relationships of gene-impacting indels in primate species. These results suggest that genome plasticity is a major force behind speciation events separating the great ape lineages.</p

    Atlas of prostate cancer heritability in European and African-American men pinpoints tissue-specific regulation.

    Get PDF
    Although genome-wide association studies have identified over 100 risk loci that explain ∼33% of familial risk for prostate cancer (PrCa), their functional effects on risk remain largely unknown. Here we use genotype data from 59,089 men of European and African American ancestries combined with cell-type-specific epigenetic data to build a genomic atlas of single-nucleotide polymorphism (SNP) heritability in PrCa. We find significant differences in heritability between variants in prostate-relevant epigenetic marks defined in normal versus tumour tissue as well as between tissue and cell lines. The majority of SNP heritability lies in regions marked by H3k27 acetylation in prostate adenoc7arcinoma cell line (LNCaP) or by DNaseI hypersensitive sites in cancer cell lines. We find a high degree of similarity between European and African American ancestries suggesting a similar genetic architecture from common variation underlying PrCa risk. Our findings showcase the power of integrating functional annotation with genetic data to understand the genetic basis of PrCa.This work was supported by NIH fellowship F32 GM106584 (AG), NIH grants R01 MH101244(A.G.), R01 CA188392 (B.P.), U01 CA194393(B.P.), R01 GM107427 (M.L.F.), R01 CA193910 (M.L.F./M.P.) and Prostate Cancer Foundation Challenge Award (M.L.F./M.P.). This study makes use of data generated by the Wellcome Trust Case Control Consortium and the Wellcome Trust Sanger Institute. A full list of the investigators who contributed to the generation of the Wellcome Trust Case Control Consortium data is available on www.wtccc.org.uk. Funding for the Wellcome Trust Case Control Consortium project was provided by the Wellcome Trust under award 076113. This study makes use of data generated by the UK10K Consortium. A full list of the investigators who contributed to the generation of the data is available online (http://www.UK10K.org). The PRACTICAL consortium was supported by the following grants: European Commission's Seventh Framework Programme grant agreement n° 223175 (HEALTH-F2-2009-223175), Cancer Research UK Grants C5047/A7357, C1287/A10118, C5047/A3354, C5047/A10692, C16913/A6135 and The National Institute of Health (NIH) Cancer Post-Cancer GWAS initiative Grant: no. 1 U19 CA 148537-01 (the GAME-ON initiative); Cancer Research UK (C1287/A10118, C1287/A 10710, C12292/A11174, C1281/A12014, C5047/A8384, C5047/A15007 and C5047/A10692), the National Institutes of Health (CA128978) and Post-Cancer GWAS initiative (1U19 CA148537, 1U19 CA148065 and 1U19 CA148112—the GAME-ON initiative), the Department of Defense (W81XWH-10-1-0341), A Linneus Centre (Contract ID 70867902), Swedish Research Council (grant no K2010-70X-20430-04-3), the Swedish Cancer Foundation (grant no 09-0677), grants RO1CA056678, RO1CA082664 and RO1CA092579 from the US National Cancer Institute, National Institutes of Health; US National Cancer Institute (R01CA72818); support from The National Health and Medical Research Council, Australia (126402, 209057, 251533, 396414, 450104, 504700, 504702, 504715, 623204, 940394 and 614296); NIH grants CA63464, CA54281 and CA098758; US National Cancer Institute (R01CA128813, PI: J.Y. Park); Bulgarian National Science Fund, Ministry of Education and Science (contract DOO-119/2009; DUNK01/2–2009; DFNI-B01/28/2012); Cancer Research UK grants [C8197/A10123] and [C8197/A10865]; grant code G0500966/75466; NIHR Health Technology Assessment Programme (projects 96/20/06 and 96/20/99); Cancer Research UK grant number C522/A8649, Medical Research Council of England grant number G0500966, ID 75466 and The NCRI, UK; The US Dept of Defense award W81XWH-04-1-0280; Australia Project Grant [390130, 1009458] and Enabling Grant [614296 to APCB]; the Prostate Cancer Foundation of Australia (Project Grant [PG7] and Research infrastructure grant [to APCB]); NIH grant R01 CA092447; Vanderbilt-Ingram Cancer Center (P30 CA68485); Cancer Research UK [C490/A10124] and supported by the UK National Institute for Health Research Biomedical Research Centre at the University of Cambridge; Competitive Research Funding of the Tampere University Hospital (9N069 and X51003); Award Number P30CA042014 from the National Cancer Institute.This is the final version of the article. It first appeared from Nature Publishing Group via http://dx.doi.org/0.1038/ncomms1097

    Single Nucleotide Polymorphisms and Haplotypes in the IL10 Region Associated with HCV Clearance

    No full text
    Hepatitis C virus (HCV) is an infectious blood-borne pathogen that usually persists as a chronic infection. However, approximately 15% of the time, patients can clear the virus, indicating that host differences could be critical in determining the course of HCV infection. The inflammatory response is crucial to resolving or failing to resolve an acute HCV infection. Some previous reports have implicated interleukin 10 (IL10) polymorphisms with successful anti-HCV therapy and natural viral clearance. We tested 54 single nucleotide polymorphisms (SNPs) in the IL10 region (+/-300 kb and 24 within the IL10 gene itself), which contains 13 genes including the IL10 immunomodulatory paralogs IL19, IL20, and IL24, for association with HCV clearance vs persistence. SNPs from two haplotype block regions, one at IL10 and the other from IL19/IL20, were associated with HCV clearance in African Americans (91 clearance cases and 183 chronically infected matched controls; P=0.05-0.002) while with expectation-maximization algorithm-reconstructed haplotypes, these associations remained (P=0.05-0.002). However, no significant associations were detected in European Americans (108 clearance and 245 chronic). Our results indicate that variants of the immunomodulatory IL10 and IL19/IL20 genes may be involved in natural clearance of HCV in the African-American population

    Evaluation of IL10, IL19 and IL20 Gene Polymorphisms and Chronic Hepatitis B Infection Outcome

    No full text
    Hepatitis B virus (HBV) infection remains a serious global health problem despite the availability of a highly effective vaccine. Approximately 5% of HBV-infected adults develop chronic hepatitis B, which may result in liver cirrhosis or hepatocellular carcinoma. Variants of interleukin-10 (IL10) have been previously associated with chronic hepatitis B infection and progression to hepatocellular carcinoma. Single nucleotide polymorphisms (SNP; n = 42) from the IL10, IL19 and IL20 gene regions were examined for an association with HBV infection outcome, either chronic or recovered, in a nested case–control study of African Americans and European Americans. Among African Americans, three nominally statistically significant SNP associations in IL10, two in IL20, and one haplotype association were observed with different HBV infection outcomes (P = 0.005–0.04). A SNP (rs1518108) in IL20 deviated significantly from Hardy–Weinberg equilibrium in African Americans, with a large excess of heterozygotes in chronic HBV-infected cases (P = 0.0006), which suggests a strong genetic effect. Among European Americans, a nominally statistically significant SNP association in IL20 and an IL20haplotype were associated with HBV recovery (P = 0.01–0.04). These results suggest that IL10 and IL20 gene variants influence HBV infection outcome and encourage the pursuit of further studies of these cytokines in HBV pathogenesis

    A High-Density Admixture Map for Disease Gene Discovery in African Americans

    Get PDF
    Admixture mapping (also known as “mapping by admixture linkage disequilibrium,” or MALD) provides a way of localizing genes that cause disease, in admixed ethnic groups such as African Americans, with ∼100 times fewer markers than are required for whole-genome haplotype scans. However, it has not been possible to perform powerful scans with admixture mapping because the method requires a dense map of validated markers known to have large frequency differences between Europeans and Africans. To create such a map, we screened through databases containing ∼450,000 single-nucleotide polymorphisms (SNPs) for which frequencies had been estimated in African and European population samples. We experimentally confirmed the frequencies of the most promising SNPs in a multiethnic panel of unrelated samples and identified 3,011 as a MALD map (1.2 cM average spacing). We estimate that this map is ∼70% informative in differentiating African versus European origins of chromosomal segments. This map provides a practical and powerful tool, which is freely available without restriction, for screening for disease genes in African American patient cohorts. The map is especially appropriate for those diseases that differ in incidence between the parental African and European populations

    Prostate Cancer Susceptibility in Men of African Ancestry at 8q24

    No full text
    The 8q24 region harbors multiple risk variants for distinct cancers, including &gt;8 for prostate cancer. In this study, we conducted fine mapping of the 8q24 risk region (127.8-128.8Mb) in search of novel associations with common and rare variation in 4853 prostate cancer case patients and 4678 control subjects of African ancestry. All statistical tests were two-sided. We identified three independent associations at P values of less than 5.00×10(-8), all of which were replicated in studies from Ghana and Uganda (combined sample = 5869 case patients, 5615 control subjects; rs114798100: risk allele frequency [RAF] = 0.04, per-allele odds ratio [OR] = 2.31, 95% confidence interval [CI] = 2.04 to 2.61, P = 2.38×10(-40); rs72725879: RAF = 0.33, OR = 1.37, 95% CI = 1.30 to 1.45, P = 3.04×10(-27); and rs111906932: RAF = 0.03, OR = 1.79, 95% CI = 1.53 to 2.08, P = 1.39×10(-13)). Risk variants rs114798100 and rs111906923 are only found in men of African ancestry, with rs111906923 representing a novel association signal. The three variants are located within or near a number of prostate cancer-associated long noncoding RNAs (lncRNAs), including PRNCR1, PCAT1, and PCAT2. These findings highlight ancestry-specific risk variation and implicate prostate-specific lncRNAs at the 8q24 prostate cancer susceptibility region

    Integration of multiethnic fine-mapping and genomic annotation to prioritize candidate functional SNPs at prostate cancer susceptibility regions

    No full text
    Interpretation of biological mechanisms underlying genetic risk associations for prostate cancer is complicated by the relatively large number of risk variants (n = 100) and the thousands of surrogate SNPs in linkage disequilibrium. Here, we combined three distinct approaches: multiethnic fine-mapping, putative functional annotation (based upon epigenetic data and genome-encoded features), and expression quantitative trait loci (eQTL) analyses, in an attempt to reduce this complexity. We examined 67 risk regions using genotyping and imputation-based fine-mapping in populations of European (cases/controls: 8600/6946), African (cases/controls: 5327/5136), Japanese (cases/controls: 2563/4391) and Latino (cases/controls: 1034/1046) ancestry. Markers at 55 regions passed a region-specific significance threshold (P-value cutoff range: 3.9 × 10(-4)-5.6 × 10(-3)) and in 30 regions we identified markers that were more significantly associated with risk than the previously reported variants in the multiethnic sample. Novel secondary signals (P \u3c 5.0 × 10(-6)) were also detected in two regions (rs13062436/3q21 and rs17181170/3p12). Among 666 variants in the 55 regions with P-values within one order of magnitude of the most-associated marker, 193 variants (29%) in 48 regions overlapped with epigenetic or other putative functional marks. In 11 of the 55 regions, cis-eQTLs were detected with nearby genes. For 12 of the 55 regions (22%), the most significant region-specific, prostate-cancer associated variant represented the strongest candidate functional variant based on our annotations; the number of regions increased to 20 (36%) and 27 (49%) when examining the 2 and 3 most significantly associated variants in each region, respectively. These results have prioritized subsets of candidate variants for downstream functional evaluation
    corecore