9 research outputs found

    Detection of regulatory SNPs in human genome using ChIP-seq ENCODE data.

    Get PDF
    A vast amount of SNPs derived from genome-wide association studies are represented by non-coding ones, therefore exacerbating the need for effective identification of regulatory SNPs (rSNPs) among them. However, this task remains challenging since the regulatory part of the human genome is annotated much poorly as opposed to coding regions. Here we describe an approach aggregating the whole set of ENCODE ChIP-seq data in order to search for rSNPs, and provide the experimental evidence of its efficiency. Its algorithm is based on the assumption that the enrichment of a genomic region with transcription factor binding loci (ChIP-seq peaks) indicates its regulatory function, and thereby SNPs located in this region are more likely to influence transcription regulation. To ensure that the approach preferably selects functionally meaningful SNPs, we performed enrichment analysis of several human SNP datasets associated with phenotypic manifestations. It was shown that all samples are significantly enriched with SNPs falling into the regions of multiple ChIP-seq peaks as compared with the randomly selected SNPs. For experimental verification, 40 SNPs falling into overlapping regions of at least 7 TF binding loci were selected from OMIM. The effect of SNPs on the binding of the DNA fragments containing them to the nuclear proteins from four human cell lines (HepG2, HeLaS3, HCT-116, and K562) has been tested by EMSA. A radical change in the binding pattern has been observed for 29 SNPs, besides, 6 more SNPs also demonstrated less pronounced changes. Taken together, the results demonstrate the effective way to search for potential rSNPs with the aid of ChIP-seq data provided by ENCODE project

    A variety of SNP effects on binding of the corresponding oligonucleotides to nuclear proteins from K562 cells.

    No full text
    <p>rs79734816:C>T (<i>A</i>), rs2071002:A>C (<i>B</i>), and rs74393987:C>T (<i>C</i>) change the number and intensity of bands, while rs75996864:G>T (<i>D</i>) affect only band intensity, and 7961894:C>T (<i>E</i>) do not have any. Changes in the binding of allelic variants with the nuclear proteins are indicated by arrows.</p

    Localization of putative rSNPs within OTFR belonging to APC gene.

    No full text
    <p>SNPs from the different parts of OTFR were taken in EMSA to study the effect of SNP location within OTFR on the protein binding.</p

    The used approach to genome-wide selection of rSNPs.

    No full text
    <p>Computational analysis was applied to identify the SNPs in the most likely regulatory regions of the human genome and predict rSNPs for experimental verification.</p

    enrichment of S<sub>gwas</sub> sample and its high-confidence derivatives, S<sub>pV</sub>, S<sub>OR</sub> and S<sub>int</sub>, as well as S<sub>r</sub> sample with putative rSNPs as a function of cut-off number of overlapping TF binding loci (<i>i</i>) for defining OTFRs.

    No full text
    <p>500 bootstrap iterations were performed for each point. The resulting standard deviations and confidence intervals are shown by error bars and colour-filled areas,respectively. The subsamples of S<sub>gwas</sub> were generated with filtering of SNPs by P-value <1 e–7 (S<sub>pV</sub>), OR>3 and <0.3 (S<sub>OR</sub>), and by both criteria (S<sub>int</sub>). S<sub>gwas</sub> sample was extracted from NHGRI GWAS catalog.</p

    The enrichment of functionally-associated S<sub>omim</sub>, S<sub>clinic</sub>, S<sub>gwas</sub> samples, and S<sub>r</sub> sample with putative rSNPs as a function of cut-off number of overlapping TF binding loci (<i>i</i>) for defining OTFRs.

    No full text
    <p>500 bootstrap iterations were performed for each point. The resulting standard deviations and confidence intervals are shown by error bars and colour-filled areas,respectively. S<sub>clinic</sub>, S<sub>omim</sub>, and S<sub>gwas</sub> consist of SNPs associated with phenotypic manifestations and extracted from dbSNP NCBI Clinical/LSDB Submissions Resources, OMIM catalog, and NHGRI GWAS catalog, respectively. Sample S<sub>r</sub> of random SNPs was created without applying any phenotypic preferences and used as control. Genomic region chr6:29,909,708-31,325,212 was excluded from the analysis of S<sub>clinic</sub> sample, since it caused enrichment overestimation at lower <i>i</i> (a large number of SNPs concentrate in this region due to its extensive use in genotyping assays).</p
    corecore