10 research outputs found

    Genome-wide prediction of cis-regulatory regions using supervised deep learning methods

    No full text
    Background: In the human genome, 98% of DNA sequences are non-protein-coding regions that were previously disregarded as junk DNA. In fact, non-coding regions host a variety of cis-regulatory regions which precisely control the expression of genes. Thus, Identifying active cis-regulatory regions in the human genome is critical for understanding gene regulation and assessing the impact of genetic variation on phenotype. The developments of high-throughput sequencing and machine learning technologies make it possible to predict cis-regulatory regions genome wide. Results: Based on rich data resources such as the Encyclopedia of DNA Elements (ENCODE) and the Functional Annotation of the Mammalian Genome (FANTOM) projects, we introduce DECRES based on supervised deep learning approaches for the identification of enhancer and promoter regions in the human genome. Due to their ability to discover patterns in large and complex data, the introduction of deep learning methods enables a significant advance in our knowledge of the genomic locations of cis-regulatory regions. Using models for well-characterized cell lines, we identify key experimental features that contribute to the predictive performance. Applying DECRES, we delineate locations of 300,000 candidate enhancers genome wide (6.8% of the genome, of which 40,000 are supported by bidirectional transcription data), and 26,000 candidate promoters (0.6% of the genome). Conclusion: The predicted annotations of cis-regulatory regions will provide broad utility for genome interpretation from functional genomics to clinical applications. The DECRES model demonstrates potentials of deep learning technologies when combined with high-throughput sequencing data, and inspires the development of other advanced neural network models for further improvement of genome annotations.Peer reviewed: YesNRC publication: Ye

    The selection and function of cell type-specific enhancers

    No full text
    The human body contains several hundred cell types, all with the same genome. In metazoans, much of the regulatory code that drives cell type-specific gene expression resides in distal elements called enhancers. Enhancers are activated by proteins called transcription factors that bind specific DNA motifs and recruit co-regulators to ultimately activate transcription. While the human genome contains millions of potential enhancers, only a small subset of them is active in a given cell type. Densely spaced clusters of active enhancers, referred to as super-enhancers, are associated with the expression of genes that specify cell identity and function. On a genomic scale, the function of enhancers is influenced by, and in turn affects higher-order chromatin structure and nuclear organization

    Transcriptional enhancers: from properties to genome-wide predictions

    No full text

    <em>Drosophila</em> models of metastasis

    No full text

    Transcription factors involved in drought tolerance and their possible role in developing drought tolerant cultivars with emphasis on wheat (Triticum aestivum L.)

    No full text
    corecore