22 research outputs found
Savant Genome Browser 2: visualization and analysis for population-scale genomics
High-throughput sequencing (HTS) technologies are providing an unprecedented capacity for data generation, and there is a corresponding need for efficient data exploration and analysis capabilities. Although most existing tools for HTS data analysis are developed for either automated (e.g. genotyping) or visualization (e.g. genome browsing) purposes, such tools are most powerful when combined. For example, integration of visualization and computation allows users to iteratively refine their analyses by updating computational parameters within the visual framework in real-time. Here we introduce the second version of the Savant Genome Browser, a standalone program for visual and computational analysis of HTS data. Savant substantially improves upon its predecessor and existing tools by introducing innovative visualization modes and navigation interfaces for several genomic datatypes, and synergizing visual and automated analyses in a way that is powerful yet easy even for non-expert users. We also present a number of plugins that were developed by the Savant Community, which demonstrate the power of integrating visual and automated analyses using Savant. The Savant Genome Browser is freely available (open source) at www.savantbrowser.co
Twenty years of geomagnetic field observations at Mario Zucchelli Station (Antarctica)
During the 1986-87 austral summer a geomagnetic observatory was installed at
Terra Nova Bay. During the first years both geomagnetic field time variation
monitoring and absolute measurements were carried out only during summer. Since 1991 variometer measurements are automatically performed during the whole year, while absolute measurements are still performed only during summer. In spite of this, interesting observations were obtained during the life (quite long for Antarctica) of the geomagnetic observatory. In particular in this paper some of the most relevant results are briefly presented: studies about secular variation, daily variation (and its dependence from solar cycle and seasons) and geomagnetic higher frequency variations, such as geomagnetic pulsations
Correction: Detection and classification of peaks in 5' cap RNA sequencing data
[No abstract available
Detection and classification of peaks in 5' cap RNA sequencing data
Background
The large-scale sequencing of 5' cap enriched cDNA promises to reveal the diversity of transcription initiation across entire genomes. The process of transcription is noisy, and there is often no single, exact start site. This creates the need for a fast and simple method of identifying transcription start peaks based on this type of data. Due to both biological and technical noise, many of the peaks seen are not real transcription initiation events. Classification of the observed peaks is an essential filtering step in the discovery of genuine initiation locations.
Results
We develop a two-stage approach consisting of a fast and simple algorithm based on a sliding window with Poisson null distribution for detecting the genomic locations of peaks, followed by a linear support vector machine classifier to distinguish between peaks which represent the initiation of transcription and peaks that do not. Comparison of classification performance to the best existing method based on whole genome segmentation showed comparable precision and improved recall. Internal features, which are intrinsic to the data and require no further experiments, had high precision and recall rates. Addition of pooled external data or matched RNA sequencing data resulted in gains of recall with equivalent precision.
Conclusions
The Poisson sliding window model is an effective and fast way of taking the peak neighbourhood into account, and finding statistically significant peaks over a range of transcript expression values. It is orders of magnitude faster than doing whole genome segmentation. The support vector classification scheme has better precision and recall than existing methods. Integrating additional datasets is shown to provide minor gains in recall, in comparison to using only the cap-sequencing data
Cross-Platform Omics Prediction procedure: a statistical machine learning framework for wider implementation of precision medicine.
In this modern era of precision medicine, molecular signatures identified from advanced omics technologies hold great promise to better guide clinical decisions. However, current approaches are often location-specific due to the inherent differences between platforms and across multiple centres, thus limiting the transferability of molecular signatures. We present Cross-Platform Omics Prediction (CPOP), a penalised regression model that can use omics data to predict patient outcomes in a platform-independent manner and across time and experiments. CPOP improves on the traditional prediction framework of using gene-based features by selecting ratio-based features with similar estimated effect sizes. These components gave CPOP the ability to have a stable performance across datasets of similar biology, minimising the effect of technical noise often generated by omics platforms. We present a comprehensive evaluation using melanoma transcriptomics data to demonstrate its potential to be used as a critical part of a clinical screening framework for precision medicine. Additional assessment of generalisation was demonstrated with ovarian cancer and inflammatory bowel disease studies
Acetylation of H2A.Z is a key epigenetic modification associated with gene deregulation and epigenetic remodeling in cancer
Histone H2A.Z (H2A.Z) is an evolutionarily conserved H2A variant implicated in the regulation of gene expression; however, its role in transcriptional deregulation in cancer remains poorly understood. Using genome-wide studies, we investigated the role of promoter-associated H2A.Z and acetylated H2A.Z (acH2A.Z) in gene deregulation and its relationship with DNA methylation and H3K27me3 in prostate cancer. Our results reconcile the conflicting reports of positive and negative roles for histone H2A.Z and gene expression states. We find that H2A.Z is enriched in a bimodal distribution at nucleosomes, surrounding the transcription start sites (TSSs) of both active and poised gene promoters. In addition, H2A.Z spreads across the entire promoter of inactive genes in a deacetylated state. In contrast, acH2A.Z is only localized at the TSSs of active genes. Gene deregulation in cancer is also associated with a reorganization of acH2A.Z and H2A.Z nucleosome occupancy across the promoter region and TSS of genes. Notably, in cancer cells we find that a gain of acH2A.Z at the TSS occurs with an overall decrease of H2A.Z levels, in concert with oncogene activation. Furthermore, deacetylation of H2A.Z at TSSs is increased with silencing of tumor suppressor genes. We also demonstrate that acH2A.Z anti-correlates with promoter H3K27me3 and DNA methylation. We show for the first time, that acetylation of H2A.Z is a key modification associated with gene activity in normal cells and epigenetic gene deregulation in tumorigenesis
Discovery pipeline for epigenetically deregulated miRNAs in cancer: integration of primary miRNA transcription
<p>Abstract</p> <p>Background</p> <p>Cancer is commonly associated with widespread disruption of DNA methylation, chromatin modification and miRNA expression. In this study, we established a robust discovery pipeline to identify epigenetically deregulated miRNAs in cancer.</p> <p>Results</p> <p>Using an integrative approach that combines primary transcription, genome-wide DNA methylation and H3K9Ac marks with microRNA (miRNA) expression, we identified miRNA genes that were epigenetically modified in cancer. We find miR-205, miR-21, and miR-196b to be epigenetically repressed, and miR-615 epigenetically activated in prostate cancer cells.</p> <p>Conclusions</p> <p>We show that detecting changes in primary miRNA transcription levels is a valuable method for detection of local epigenetic modifications that are associated with changes in mature miRNA expression.</p
Quantitative Performance Evaluator for Proteomics (QPEP): Web-based Application for Reproducible Evaluation of Proteomics Preprocessing Methods
Tandem mass spectrometry is one of the most popular techniques for quantitation of proteomes. There exists a large variety of options in each stage of data preprocessing that impact the bias and variance of the summarized protein-level values. Using a newly released data set satisfying a replicated Latin squares design, a diverse set of performance metrics has been developed and implemented in a web-based application, Quantitative Performance Evaluator for Proteomics (QPEP). QPEP has the flexibility to allow users to apply their own method to preprocess this data set and share the results, allowing direct and straightforward comparison of new methodologies. Application of these new metrics to three case studies highlights that (i) the summarization of peptides to proteins is robust to the choice of peptide summary used, (ii) the differences between iTRAQ labels are stronger than the differences between experimental runs, and (iii) the commercial software ProteinPilot performs equivalently well at between-sample normalization to more complicated methods developed by academics. Importantly, finding (ii) underscores the benefits of using the principles of randomization and blocking to avoid the experimental measurements being confounded by technical factors. Data are available via ProteomeXchange with identifier PXD003608
Wnt inhibitory factor 1 (WIF1) is a marker of osteoblastic differentiation stage and is not silenced by DNA methylation in osteosarcoma
Wnt pathway targeting is of high clinical interest for treating bone loss disorders such as osteoporosis. These therapies inhibit the action of negative regulators of osteoblastic Wnt signaling. The report that Wnt inhibitory factor 1 (WIF1) was epigenetically silenced via promoter DNA methylation in osteosarcoma (OS) raised potential concerns for such treatment approaches. Here we confirm that Wif1 expression is frequently reduced in OS. However, we demonstrate that silencing is not driven by DNA methylation. Treatment of mouse and human OS cells showed that Wif1 expression was robustly induced by HDAC inhibition but not by methylation inhibition. Consistent with HDAC dependent silencing, the Wif1 locus in OS was characterized by low acetylation levels and a bivalent H3K4/H3K27-trimethylation state. Wif1 expression marked late stages of normal osteoblast maturation and stratified OS tumors based on differentiation stage across species. Culture of OS cells under differentiation inductive conditions increased expression of Wif1. Together these results demonstrate that Wif1 is not targeted for silencing by DNA methylation in OS. Instead, the reduced expression of Wif1 in OS cells is in context with their stage in differentiation
Repitools: an R package for the analysis of enrichment-based epigenomic data.
Item does not contain fulltextSUMMARY: Epigenetics, the study of heritable somatic phenotypic changes not related to DNA sequence, has emerged as a critical component of the landscape of gene regulation. The epigenetic layers, such as DNA methylation, histone modifications and nuclear architecture are now being extensively studied in many cell types and disease settings. Few software tools exist to summarize and interpret these datasets. We have created a toolbox of procedures to interrogate and visualize epigenomic data (both array- and sequencing-based) and make available a software package for the cross-platform R language. AVAILABILITY: The package is freely available under LGPL from the R-Forge web site (http://repitools.r-forge.r-project.org/) CONTACT: [email protected]
