33 research outputs found

    Profile analysis and prediction of tissue-specific CpG island methylation classes

    Get PDF
    Background: The computational prediction of DNA methylation has become an important topic in the recent years due to its role in the epigenetic control of normal and cancer-related processes. While previous prediction approaches focused merely on differences between methylated and unmethylated DNA sequences, recent experimental results have shown the presence of much more complex patterns of methylation across tissues and time in the human genome. These patterns are only partially described by a binary model of DNA methylation. In this work we propose a novel approach, based on profile analysis of tissue-specific methylation that uncovers significant differences in the sequences of CpG islands (CGIs) that predispose them to a tissuespecific methylation pattern. Results: We defined CGI methylation profiles that separate not only between constitutively methylated and unmethylated CGIs, but also identify CGIs showing a differential degree of methylation across tissues and cell-types or a lack of methylation exclusively in sperm. These profiles are clearly distinguished by a number of CGI attributes including their evolutionary conservation, their significance, as well as the evolutionary evidence of prior methylation. Additionally, we assess profile functionality with respect to the different compartments of protein coding genes and their possible use in the prediction of DNA methylation. Conclusion: Our approach provides new insights into the biological features that determine if a CGI has a functional role in the epigenetic control of gene expression and the features associated with CGI methylation susceptibility. Moreover, we show that the ability to predict CGI methylation is based primarily on the quality of the biological information used and the relationships uncovered between different sources of knowledge. The strategy presented here is able to predict, besides the constitutively methylated and unmethylated classes, two more tissue specific methylation classes conserving the accuracy provided by leading binary methylation classification methods.publishedVersionPeer Reviewe

    Profile analysis and prediction of tissue-specific CpG island methylation classes

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>The computational prediction of DNA methylation has become an important topic in the recent years due to its role in the epigenetic control of normal and cancer-related processes. While previous prediction approaches focused merely on differences between methylated and unmethylated DNA sequences, recent experimental results have shown the presence of much more complex patterns of methylation across tissues and time in the human genome. These patterns are only partially described by a binary model of DNA methylation. In this work we propose a novel approach, based on profile analysis of tissue-specific methylation that uncovers significant differences in the sequences of CpG islands (CGIs) that predispose them to a tissue- specific methylation pattern.</p> <p>Results</p> <p>We defined CGI methylation profiles that separate not only between constitutively methylated and unmethylated CGIs, but also identify CGIs showing a differential degree of methylation across tissues and cell-types or a lack of methylation exclusively in sperm. These profiles are clearly distinguished by a number of CGI attributes including their evolutionary conservation, their significance, as well as the evolutionary evidence of prior methylation. Additionally, we assess profile functionality with respect to the different compartments of protein coding genes and their possible use in the prediction of DNA methylation.</p> <p>Conclusion</p> <p>Our approach provides new insights into the biological features that determine if a CGI has a functional role in the epigenetic control of gene expression and the features associated with CGI methylation susceptibility. Moreover, we show that the ability to predict CGI methylation is based primarily on the quality of the biological information used and the relationships uncovered between different sources of knowledge. The strategy presented here is able to predict, besides the constitutively methylated and unmethylated classes, two more tissue specific methylation classes conserving the accuracy provided by leading binary methylation classification methods.</p

    Prediction of CpG-island function: CpG clustering vs. sliding-window methods

    Get PDF
    Background Unmethylated stretches of CpG dinucleotides (CpG islands) are an outstanding property of mammal genomes. Conventionally, these regions are detected by sliding window approaches using %G + C, CpG observed/expected ratio and length thresholds as main parameters. Recently, clustering methods directly detect clusters of CpG dinucleotides as a statistical property of the genome sequence. Results We compare sliding-window to clustering (i.e. CpGcluster) predictions by applying new ways to detect putative functionality of CpG islands. Analyzing the co-localization with several genomic regions as a function of window size vs. statistical significance (p-value), CpGcluster shows a higher overlap with promoter regions and highly conserved elements, at the same time showing less overlap with Alu retrotransposons. The major difference in the prediction was found for short islands (CpG islets), often exclusively predicted by CpGcluster. Many of these islets seem to be functional, as they are unmethylated, highly conserved and/or located within the promoter region. Finally, we show that window-based islands can spuriously overlap several, differentially regulated promoters as well as different methylation domains, which might indicate a wrong merge of several CpG islands into a single, very long island. The shorter CpGcluster islands seem to be much more specific when concerning the overlap with alternative transcription start sites or the detection of homogenous methylation domains. Conclusions The main difference between sliding-window approaches and clustering methods is the length of the predicted islands. Short islands, often differentially methylated, are almost exclusively predicted by CpGcluster. This suggests that CpGcluster may be the algorithm of choice to explore the function of these short, but putatively functional CpG islands

    CpGcluster: a distance-based algorithm for CpG-island detection

    Get PDF
    BACKGROUND: Despite their involvement in the regulation of gene expression and their importance as genomic markers for promoter prediction, no objective standard exists for defining CpG islands (CGIs), since all current approaches rely on a large parameter space formed by the thresholds of length, CpG fraction and G+C content. RESULTS: Given the higher frequency of CpG dinucleotides at CGIs, as compared to bulk DNA, the distance distributions between neighboring CpGs should differ for bulk and island CpGs. A new algorithm (CpGcluster) is presented, based on the physical distance between neighboring CpGs on the chromosome and able to predict directly clusters of CpGs, while not depending on the subjective criteria mentioned above. By assigning a p-value to each of these clusters, the most statistically significant ones can be predicted as CGIs. CpGcluster was benchmarked against five other CGI finders by using a test sequence set assembled from an experimental CGI library. CpGcluster reached the highest overall accuracy values, while showing the lowest rate of false-positive predictions. Since a minimum-length threshold is not required, CpGcluster can find short but fully functional CGIs usually missed by other algorithms. The CGIs predicted by CpGcluster present the lowest degree of overlap with Alu retrotransposons and, simultaneously, the highest overlap with vertebrate Phylogenetic Conserved Elements (PhastCons). CpGcluster's CGIs overlapping with the Transcription Start Site (TSS) show the highest statistical significance, as compared to the islands in other genome locations, thus qualifying CpGcluster as a valuable tool in discriminating functional CGIs from the remaining islands in the bulk genome. CONCLUSION: CpGcluster uses only integer arithmetic, thus being a fast and computationally efficient algorithm able to predict statistically significant clusters of CpG dinucleotides. Another outstanding feature is that all predicted CGIs start and end with a CpG dinucleotide, which should be appropriate for a genomic feature whose functionality is based precisely on CpG dinucleotides. The only search parameter in CpGcluster is the distance between two consecutive CpGs, in contrast to previous algorithms. Therefore, none of the main statistical properties of CpG islands (neither G+C content, CpG fraction nor length threshold) are needed as search parameters, which may lead to the high specificity and low overlap with spurious Alu elements observed for CpGcluster predictions

    Dynamic regulation of the transcription initiation landscape at single nucleotide resolution during vertebrate embryogenesis

    Get PDF
    Spatiotemporal control of gene expression is central to animal development. Core promoters represent a previously unanticipated regulatory level by interacting with cis-regulatory elements and transcription initiation in different physiological and developmental contexts. Here, we provide a first and comprehensive description of the core promoter repertoire and its dynamic use during the development of a vertebrate embryo. By using cap analysis of gene expression (CAGE), we mapped transcription initiation events at single nucleotide resolution across 12 stages of zebrafish development. These CAGE-based transcriptome maps reveal genome-wide rules of core promoter usage, structure, and dynamics, key to understanding the control of gene regulation during vertebrate ontogeny. They revealed the existence of multiple classes of pervasive intra- and intergenic post-transcriptionally processed RNA products and their developmental dynamics. Among these RNAs, we report splice donor site-associated intronic RNA (sRNA) to be specific to genes of the splicing machinery. For the identification of conserved features, we compared the zebrafish data sets to the first CAGE promoter map of Tetraodon and the existing human CAGE data. We show that a number of features, such as promoter type, newly discovered promoter properties such as a specialized purine-rich initiator motif, as well as sRNAs and the genes in which they are detected, are conserved in mammalian and Tetraodon CAGE-defined promoter maps. The zebrafish developmental promoterome represents a powerful resource for studying developmental gene regulation and revealing promoter features shared across vertebrates.publishedVersio

    Drug sensitivity profiling of 3D tumor tissue cultures in the pediatric precision oncology program INFORM

    Full text link
    The international precision oncology program INFORM enrolls relapsed/refractory pediatric cancer patients for comprehensive molecular analysis. We report a two-year pilot study implementing ex vivo drug sensitivity profiling (DSP) using a library of 75-78 clinically relevant drugs. We included 132 viable tumor samples from 35 pediatric oncology centers in seven countries. DSP was conducted on multicellular fresh tumor tissue spheroid cultures in 384-well plates with an overall mean processing time of three weeks. In 89 cases (67%), sufficient viable tissue was received; 69 (78%) passed internal quality controls. The DSP results matched the identified molecular targets, including BRAF, ALK, MET, and TP53 status. Drug vulnerabilities were identified in 80% of cases lacking actionable (very) high-evidence molecular events, adding value to the molecular data. Striking parallels between clinical courses and the DSP results were observed in selected patients. Overall, DSP in clinical real-time is feasible in international multicenter precision oncology programs

    Drug sensitivity profiling of 3D tumor tissue cultures in the pediatric precision oncology program INFORM

    Get PDF
    The international precision oncology program INFORM enrolls relapsed/refractory pediatric cancer patients for comprehensive molecular analysis. We report a two-year pilot study implementing ex vivo drug sensitivity profiling (DSP) using a library of 75–78 clinically relevant drugs. We included 132 viable tumor samples from 35 pediatric oncology centers in seven countries. DSP was conducted on multicellular fresh tumor tissue spheroid cultures in 384-well plates with an overall mean processing time of three weeks. In 89 cases (67%), sufficient viable tissue was received; 69 (78%) passed internal quality controls. The DSP results matched the identified molecular targets, including BRAF, ALK, MET, and TP53 status. Drug vulnerabilities were identified in 80% of cases lacking actionable (very) high-evidence molecular events, adding value to the molecular data. Striking parallels between clinical courses and the DSP results were observed in selected patients. Overall, DSP in clinical real-time is feasible in international multicenter precision oncology programs

    Drug sensitivity profiling of 3D tumor tissue cultures in the pediatric precision oncology program INFORM

    Get PDF
    The international precision oncology program INFORM enrolls relapsed/refractory pediatric cancer patients for comprehensive molecular analysis. We report a two-year pilot study implementing ex vivo drug sensitivity profiling (DSP) using a library of 75-78 clinically relevant drugs. We included 132 viable tumor samples from 35 pediatric oncology centers in seven countries. DSP was conducted on multicellular fresh tumor tissue spheroid cultures in 384-well plates with an overall mean processing time of three weeks. In 89 cases (67%), sufficient viable tissue was received; 69 (78%) passed internal quality controls. The DSP results matched the identified molecular targets, including BRAF, ALK, MET, and TP53 status. Drug vulnerabilities were identified in 80% of cases lacking actionable (very) high-evidence molecular events, adding value to the molecular data. Striking parallels between clinical courses and the DSP results were observed in selected patients. Overall, DSP in clinical real-time is feasible in international multicenter precision oncology programs.Peer reviewe
    corecore