13,992 research outputs found
Post-transcriptional knowledge in pathway analysis increases the accuracy of phenotypes classification
Motivation: Prediction of phenotypes from high-dimensional data is a crucial
task in precision biology and medicine. Many technologies employ genomic
biomarkers to characterize phenotypes. However, such elements are not
sufficient to explain the underlying biology. To improve this, pathway analysis
techniques have been proposed. Nevertheless, such methods have shown lack of
accuracy in phenotypes classification. Results: Here we propose a novel
methodology called MITHrIL (Mirna enrIched paTHway Impact anaLysis) for the
analysis of signaling pathways, which has built on top of the work of Tarca et
al., 2009. MITHrIL extends pathways by adding missing regulatory elements, such
as microRNAs, and their interactions with genes. The method takes as input the
expression values of genes and/or microRNAs and returns a list of pathways
sorted according to their deregulation degree, together with the corresponding
statistical significance (p-values). Our analysis shows that MITHrIL
outperforms its competitors even in the worst case. In addition, our method is
able to correctly classify sets of tumor samples drawn from TCGA. Availability:
MITHrIL is freely available at the following URL:
http://alpha.dmi.unict.it/mithril
Mapping Dynamic Histone Acetylation Patterns to Gene Expression in Nanog-depleted Murine Embryonic Stem Cells
Embryonic stem cells (ESC) have the potential to self-renew indefinitely and
to differentiate into any of the three germ layers. The molecular mechanisms
for self-renewal, maintenance of pluripotency and lineage specification are
poorly understood, but recent results point to a key role for epigenetic
mechanisms. In this study, we focus on quantifying the impact of histone 3
acetylation (H3K9,14ac) on gene expression in murine embryonic stem cells. We
analyze genome-wide histone acetylation patterns and gene expression profiles
measured over the first five days of cell differentiation triggered by
silencing Nanog, a key transcription factor in ESC regulation. We explore the
temporal and spatial dynamics of histone acetylation data and its correlation
with gene expression using supervised and unsupervised statistical models. On a
genome-wide scale, changes in acetylation are significantly correlated to
changes in mRNA expression and, surprisingly, this coherence increases over
time. We quantify the predictive power of histone acetylation for gene
expression changes in a balanced cross-validation procedure. In an in-depth
study we focus on genes central to the regulatory network of Mouse ESC,
including those identified in a recent genome-wide RNAi screen and in the
PluriNet, a computationally derived stem cell signature. We find that compared
to the rest of the genome, ESC-specific genes show significantly more
acetylation signal and a much stronger decrease in acetylation over time, which
is often not reflected in an concordant expression change. These results shed
light on the complexity of the relationship between histone acetylation and
gene expression and are a step forward to dissect the multilayer regulatory
mechanisms that determine stem cell fate.Comment: accepted at PLoS Computational Biolog
A urinary Common Rejection Module (uCRM) score for non-invasive kidney transplant monitoring.
A Common Rejection Module (CRM) consisting of 11 genes expressed in allograft biopsies was previously reported to serve as a biomarker for acute rejection (AR), correlate with the extent of graft injury, and predict future allograft damage. We investigated the use of this gene panel on the urine cell pellet of kidney transplant patients. Urinary cell sediments collected from patients with biopsy-confirmed acute rejection, borderline AR (bAR), BK virus nephropathy (BKVN), and stable kidney grafts with normal protocol biopsies (STA) were analyzed for expression of these 11 genes using quantitative polymerase chain reaction (qPCR). We assessed these 11 CRM genes for their abundance, autocorrelation, and individual expression levels. Expression of 10/11 genes were elevated in AR when compared to STA. Psmb9 and Cxcl10could classify AR versus STA as accurately as the 11-gene model (sensitivity = 93.6%, specificity = 97.6%). A uCRM score, based on the geometric mean of the expression levels, could distinguish AR from STA with high accuracy (AUC = 0.9886) and correlated specifically with histologic measures of tubulitis and interstitial inflammation rather than tubular atrophy, glomerulosclerosis, intimal proliferation, tubular vacuolization or acute glomerulitis. This urine gene expression-based score may enable the non-invasive and quantitative monitoring of AR
Strategies for increasing the applicability of biological network inference
The manipulation of cellular state has many promising applications, including stem cell biology and regenerative medicine, biofuel production, and stress resistant crop development. The construction of interaction maps promises to enhance our ability to engineer cellular behavior. Within the last 15 years, many methods have been developed to infer the structure of the gene regulatory interaction map from gene abundance snapshots provided by high-throughput experimental data. However, relatively little research has focused on using gene regulatory network models for the prediction and manipulation of cellular behavior. This dissertation examines and applies strategies to utilize the predictive power of gene network models to guide experimentation and engineering efforts. First, we developed methods to improve gene network models by integrating interaction evidence sources, in order to utilize the full predictive power of the models. Next, we explored the power of networks models to guide experimental efforts through inference and analysis of a regulatory network in the pathogenic fungus Cryptococcus neoformans. Finally, we develop a novel, network-guided algorithm to select genetic interventions for engineering transcriptional state. We apply this method to select intervention strains for improving biofuel production in a mixed glucose-xylose environment. The contributions in this dissertation provide the first thorough examination, systematic application, and quantitative evaluation of the utilization of network models for guiding cellular engineering
An integrated machine learning and experimental approach to uncover ageing-associated processes in Fission Yeast
This work attempts to bring together knowledge of different pathways associated with cellular ageing and create connections between them using both machine learning and experimental methods. Initially, I describe the development of a novel proxy for chronological lifespan as part of the analysis pipeline of a high-throughput chronological lifespan assay in fission yeast. I then use this technique to go on to develop novel machine learning models that can predict lifespan, a complex phenotype, from simple traits, and identify ageing-associated phenotypes in fission yeast.
Complementary to this, I investigate a transcription factor of interest, Hsr1, for its involvement in cellular ageing and ageing-associated processes. I describe direct regulatory targets and how it forms a network with at least four other ageing-associated transcription factors which bridges the gaps between models of ageing, and suggest mechanisms for these interactions.
In this way, this work provides novel links between cellular ageing mechanisms and ageing-associated processes from both machine learning and experimental sources
MicroRNA-Integrated and Network-Embedded Gene Selection with Diffusion Distance
Gene network information has been used to improve gene selection in microarray-based studies by selecting marker genes based both on their expression and the coordinate expression of genes within their gene network under a given condition. Here we propose a new network-embedded gene selection model. In this model, we first address the limitations of microarray data. Microarray data, although widely used for gene selection, measures only mRNA abundance, which does not always reflect the ultimate gene phenotype, since it does not account for post-transcriptional effects. To overcome this important (critical in certain cases) but ignored-in-almost-all-existing-studies limitation, we design a new strategy to integrate together microarray data with the information of microRNA, the major post-transcriptional regulatory factor. We also handle the challenges led by gene collaboration mechanism. To incorporate the biological facts that genes without direct interactions may work closely due to signal transduction and that two genes may be functionally connected through multi paths, we adopt the concept of diffusion distance. This concept permits us to simulate biological signal propagation and therefore to estimate the collaboration probability for all gene pairs, directly or indirectly-connected, according to multi paths connecting them. We demonstrate, using type 2 diabetes (DM2) as an example, that the proposed strategies can enhance the identification of functional gene partners, which is the key issue in a network-embedded gene selection model. More importantly, we show that our gene selection model outperforms related ones. Genes selected by our model 1) have improved classification capability; 2) agree with biological evidence of DM2-association; and 3) are involved in many well-known DM2-associated pathways
- …