11 research outputs found

    CATCHprofiles: Clustering and Alignment Tool for ChIP Profiles

    Get PDF
    Chromatin Immuno Precipitation (ChIP) profiling detects in vivo protein-DNA binding, and has revealed a large combinatorial complexity in the binding of chromatin associated proteins and their post-translational modifications. To fully explore the spatial and combinatorial patterns in ChIP-profiling data and detect potentially meaningful patterns, the areas of enrichment must be aligned and clustered, which is an algorithmically and computationally challenging task. We have developed CATCHprofiles, a novel tool for exhaustive pattern detection in ChIP profiling data. CATCHprofiles is built upon a computationally efficient implementation for the exhaustive alignment and hierarchical clustering of ChIP profiling data. The tool features a graphical interface for examination and browsing of the clustering results. CATCHprofiles requires no prior knowledge about functional sites, detects known binding patterns “ab initio”, and enables the detection of new patterns from ChIP data at a high resolution, exemplified by the detection of asymmetric histone and histone modification patterns around H2A.Z-enriched sites. CATCHprofiles' capability for exhaustive analysis combined with its ease-of-use makes it an invaluable tool for explorative research based on ChIP profiling data

    IFN-Îť3, not IFN-Îť4, likely mediates IFNL3-IFNL4 haplotype-dependent hepatic inflammation and fibrosis

    Get PDF
    Genetic variation in the IFNL3-IFNL4 (interferon-Îť3-interferon-Îť4) region is associated with hepatic inflammation and fibrosis. Whether IFN-Îť3 or IFN-Îť4 protein drives this association is not known. We demonstrate that hepatic inflammation, fibrosis stage, fibrosis progression rate, hepatic infiltration of immune cells, IFN-Îť3 expression, and serum sCD163 levels (a marker of activated macrophages) are greater in individuals with the IFNL3-IFNL4 risk haplotype that does not produce IFN-Îť4, but produces IFN-Îť3. No difference in these features was observed according to genotype at rs117648444, which encodes a substitution at position 70 of the IFN-Îť4 protein and reduces IFN-Îť4 activity, or between patients encoding functionally defective IFN-Îť4 (IFN-Îť4-Ser70) and those encoding fully active IFN-Îť4-Pro70. The two proposed functional variants (rs368234815 and rs4803217) were not superior to the discovery SNP rs12979860 with respect to liver inflammation or fibrosis phenotype. IFN-Îť3 rather than IFN-Îť4 likely mediates IFNL3-IFNL4 haplotype-dependent hepatic inflammation and fibrosis

    Novel variation and <i>de novo </i>mutation rates in population-wide <i>de novo</i> assembled Danish trios

    Get PDF
    Building a population-specific catalogue of single nucleotide variants (SNVs), indels and structural variants (SVs) with frequencies, termed a national pan-genome, is critical for further advancing clinical and public health genetics in large cohorts. Here we report a Danish pan-genome obtained from sequencing 10 trios to high depth (50 × ). We report 536k novel SNVs and 283k novel short indels from mapping approaches and develop a population-wide de novo assembly approach to identify 132k novel indels larger than 10 nucleotides with low false discovery rates. We identify a higher proportion of indels and SVs than previous efforts showing the merits of high coverage and de novo assembly approaches. In addition, we use trio information to identify de novo mutations and use a probabilistic method to provide direct estimates of 1.27e−8 and 1.5e−9 per nucleotide per generation for SNVs and indels, respectively

    Sequencing and de novo assembly of 150 genomes from Denmark as a population reference

    Get PDF
    Hundreds of thousands of human genomes are now being sequenced to characterize genetic variation and use this information to augment association mapping studies of complex disorders and other phenotypic traits. Genetic variation is identified mainly by mapping short reads to the reference genome or by performing local assembly. However, these approaches are biased against discovery of structural variants and variation in the more complex parts of the genome. Hence, large-scale de novo assembly is needed. Here we show that it is possible to construct excellent de novo assemblies from high-coverage sequencing with mate-pair libraries extending up to 20 kilobases. We report de novo assemblies of 150 individuals (50 trios) from the GenomeDenmark project. The quality of these assemblies is similar to those obtained using the more expensive long-read technology. We use the assemblies to identify a rich set of structural variants including many novel insertions and demonstrate how this variant catalogue enables further deciphering of known association mapping signals. We leverage the assemblies to provide 100 completely resolved major histocompatibility complex haplotypes and to resolve major parts of the Y chromosome. Our study provides a regional reference genome that we expect will improve the power of future association mapping studies and hence pave the way for precision medicine initiatives, which now are being launched in many countries including Denmark
    corecore