15 research outputs found

    The hazards of genotype imputation in chromosomal regions under selection: A case study using the Lactase gene region

    Get PDF
    Although imputation of missing SNP results has been widely used in genetic studies, claims about the quality and usefulness of imputation have outnumbered the few studies that have questioned its limitations. But it is becoming clear that these limitations are real—for example, disease association signals can be missed in regions of LD breakdown. Here, as a case study, using the chromosomal region of the well-known lactase gene, LCT, we address the issue of imputation in the context of variants that have become frequent in a limited number of modern population groups only recently, due to selection. We study SNPs in a 500 bp region covering the enhancer of LCT, and compare imputed genotypes with directly genotyped data. We examine the haplotype pairs of all individuals with discrepant and missing genotypes. We highlight the nonrandom nature of the allelic errors and show that most incorrect imputations and missing data result from long haplotypes that are evolutionarily closely related to those carrying the derived alleles, while some relate to rare and recombinant haplotypes. We conclude that bias of incorrectly imputed and missing genotypes can decrease the accuracy of imputed results substantially

    Transcriptome analysis identifies a robust gene expression program in the mouse intestinal epithelium on aging

    Get PDF
    The intestinal epithelium undergoes constant regeneration driven by intestinal stem cells. How old age affects the transcriptome in this highly dynamic tissue is an important, but poorly explored question. Using transcriptomics on sorted intestinal stem cells and adult enterocytes, we identified candidate genes, which change expression on aging. Further validation of these on intestinal epithelium of multiple middle-aged versus old-aged mice highlighted the consistent up-regulation of the expression of the gene encoding chemokine receptor Ccr2, a mediator of inflammation and several disease processes. We observed also increased expression of Strc, coding for stereocilin, and dramatically decreased expression of Rps4l, coding for a ribosome subunit. Ccr2 and Rps4l are located close to the telomeric regions of chromosome 9 and 6, respectively. As only few genes were differentially expressed and we did not observe significant protein level changes of identified ageing markers, our analysis highlights the overall robustness of murine intestinal epithelium gene expression to old age

    Smarcad1 mediates microbiota-induced inflammation in mouse and coordinates gene expression in the intestinal epithelium

    Get PDF
    Background How intestinal epithelial cells interact with the microbiota and how this is regulated at the gene expression level are critical questions. Smarcad1 is a conserved chromatin remodeling factor with a poorly understood tissue function. As this factor is highly expressed in the stem and proliferative zones of the intestinal epithelium, we explore its role in this tissue. Results Specific deletion of Smarcad1 in the mouse intestinal epithelium leads to colitis resistance and substantial changes in gene expression, including a striking increase of expression of several genes linked to innate immunity. Absence of Smarcad1 leads to changes in chromatin accessibility and significant changes in histone H3K9me3 over many sites, including genes that are differentially regulated upon Smarcad1 deletion. We identify candidate members of the gut microbiome that elicit a Smarcad1-dependent colitis response, including members of the poorly understood TM7 phylum. Conclusions Our study sheds light onto the role of the chromatin remodeling machinery in intestinal epithelial cells in the colitis response and shows how a highly conserved chromatin remodeling factor has a distinct role in anti-microbial defense. This work highlights the importance of the intestinal epithelium in the colitis response and the potential of microbial species as pharmacological and probiotic targets in the context of inflammatory diseases

    Diversity of Lactase Persistence Alleles in Ethiopia:Signature of a Soft Selective Sweep

    Get PDF
    The persistent expression of lactase into adulthood in humans is a recent genetic adaptation that allows the consumption of milk from other mammals after weaning. In Europe, a single allele (-13910(∗)T, rs4988235) in an upstream region that acts as an enhancer to the expression of the lactase gene LCT is responsible for lactase persistence and appears to have been under strong directional selection in the last 5,000 years, evidenced by the widespread occurrence of this allele on an extended haplotype. In Africa and the Middle East, the situation is more complicated and at least three other alleles (-13907(∗)G, rs41525747; -13915(∗)G, rs41380347; -14010(∗)C, rs145946881) in the same LCT enhancer region can cause continued lactase expression. Here we examine the LCT enhancer sequence in a large lactose-tolerance-tested Ethiopian cohort of more than 350 individuals. We show that a further SNP, -14009T>G (ss 820486563), is significantly associated with lactose-digester status, and in vitro functional tests confirm that the -14009(∗)G allele also increases expression of an LCT promoter construct. The derived alleles in the LCT enhancer region are spread through several ethnic groups, and we report a greater genetic diversity in lactose digesters than in nondigesters. By examining flanking markers to control for the effects of mutation and demography, we further describe, from empirical evidence, the signature of a soft selective sweep

    World-wide distributions of lactase persistence alleles and the complex effects of recombination and selection

    Get PDF
    The genetic trait of lactase persistence (LP) is associated with at least five independent functional single nucleotide variants in a regulatory region about 14 kb upstream of the lactase gene [-13910*T (rs4988235), -13907*G (rs41525747), -13915*G (rs41380347), -14009*G (rs869051967) and -14010*C (rs145946881)]. These alleles have been inferred to have spread recently and present-day frequencies have been attributed to positive selection for the ability of adult humans to digest lactose without risk of symptoms of lactose intolerance. One of the inferential approaches used to estimate the level of past selection has been to determine the extent of haplotype homozygosity (EHH) of the sequence surrounding the SNP of interest. We report here new data on the frequencies of the known LP alleles in the 'Old World' and their haplotype lineages. We examine and confirm EHH of each of the LP alleles in relation to their distinct lineages, but also show marked EHH for one of the older haplotypes that does not carry any of the five LP alleles. The region of EHH of this (B) haplotype exactly coincides with a region of suppressed recombination that is detectable in families as well as in population data, and the results show how such suppression may have exaggerated haplotype-based measures of past selection

    The -14010*C variant associated with lactase persistence is located between an Oct-1 and HNF1a binding site and increases lactase promoter activity

    No full text
    In most people worldwide intestinal lactase expression declines in childhood. In many others, particularly in Europeans, lactase expression persists into adult life. The lactase persistence phenotype is in Europe associated with the -13910*T single nucleotide variant located 13,910 bp upstream the lactase gene in an enhancer region that affects lactase promoter activity. This variant falls in an Oct-1 binding site and shows greater Oct-1 binding than the ancestral variant and increases enhancer activity. Several other variants have been identified very close to the -13910 position, which are associated with lactase persistence in the Middle East and Africa. One of them, the -14010*C, is associated with lactase persistence in Africa. Here we show by deletion analysis that the -14010 position is located in a 144 bp region that reduces the enhancer activity. In transfections the -14010*C allele shows a stronger enhancer effect than the ancestral -4010*G allele. Binding sites for Oct-1 and HNF1α surrounding the -14010 position were identified by gel shift assays, which indicated that -14010*C has greater binding affinity to Oct-1 than -14010*G
    corecore