374 research outputs found

    Random walks on mutual microRNA-target gene interaction network improve the prediction of disease-associated microRNAs

    Get PDF
    Background: MicroRNAs (miRNAs) have been shown to play an important role in pathological initiation, progression and maintenance. Because identification in the laboratory of disease-related miRNAs is not straightforward, numerous network-based methods have been developed to predict novel miRNAs in silico. Homogeneous networks (in which every node is a miRNA) based on the targets shared between miRNAs have been widely used to predict their role in disease phenotypes. Although such homogeneous networks can predict potential disease-associated miRNAs, they do not consider the roles of the target genes of the miRNAs. Here, we introduce a novel method based on a heterogeneous network that not only considers miRNAs but also the corresponding target genes in the network model. Results: Instead of constructing homogeneous miRNA networks, we built heterogeneous miRNA networks consisting of both miRNAs and their target genes, using databases of known miRNA-target gene interactions. In addition, as recent studies demonstrated reciprocal regulatory relations between miRNAs and their target genes, we considered these heterogeneous miRNA networks to be undirected, assuming mutual miRNA-target interactions. Next, we introduced a novel method (RWRMTN) operating on these mutual heterogeneous miRNA networks to rank candidate disease-related miRNAs using a random walk with restart (RWR) based algorithm. Using both known disease-associated miRNAs and their target genes as seed nodes, the method can identify additional miRNAs involved in the disease phenotype. Experiments indicated that RWRMTN outperformed two existing state-of-the-art methods: RWRMDA, a network-based method that also uses a RWR on homogeneous (rather than heterogeneous) miRNA networks, and RLSMDA, a machine learning-based method. Interestingly, we could relate this performance gain to the emergence of "disease modules" in the heterogeneous miRNA networks used as input for the algorithm. Moreover, we could demonstrate that RWRMTN is stable, performing well when using both experimentally validated and predicted miRNA-target gene interaction data for network construction. Finally, using RWRMTN, we identified 76 novel miRNAs associated with 23 disease phenotypes which were present in a recent database of known disease-miRNA associations. Conclusions: Summarizing, using random walks on mutual miRNA-target networks improves the prediction of novel disease-associated miRNAs because of the existence of "disease modules" in these networks

    Inference and Analysis of Multilayered Mirna-Mediated Networks in Cancer

    Get PDF
    MicroRNAs (miRNAs) are small noncoding transcripts that can regulate gene expression, thereby controlling diverse biological processes. Aberrant disruptions of miRNA expression and their interactions with other biological agents (e.g., coding and noncoding transcripts) have been associated with several types of cancer. The goal of this dissertation is to use multidimensional genomic data to model two different gene regulation mechanisms by miRNAs in cancer. This dissertation results from two research projects. The first project investigates a miRNA-mediated gene regulation mechanism called competing endogenous RNA (ceRNA) interactions, which suggests that some transcripts can indirectly regulate one another\u27s activity through their interactions with a common set of miRNAs. Identification of context-specific ceRNA interactions is a challenging task. To address that, we proposed a computational method called Cancerin to identify genome-wide cancer-associated ceRNA interactions. Cancerin incorporates DNA methylation (DM), copy number alteration (CNA), and gene and miRNA expression datasets to construct cancer-specific ceRNA networks. Cancerin was applied to three cancer datasets from the Cancer Genome Atlas (TCGA) project. We found that the RNAs involved in ceRNA interactions were enriched with cancer-related genes and have high prognostic power. Moreover, the ceRNA modules in the inferred ceRNA networks were involved in cancer-associated biological processes. The second project investigates what biological functions are regulated by both miRNAs and transcription factors (TFs). While it has been known that miRNAs and TFs can coregulate common target genes having similar biological functions, it is challenging to associate specific biological functions to specific miRNAs and TFs. In this project, we proposed a computational method called CanMod to identify gene regulatory modules. Each module consists of miRNAs, TFs and their coregulated target genes. CanMod was applied on the breast cancer dataset from TCGA. Many hub regulators (i.e., miRNAs and TFs) found in the inferred modules were known cancer genes, and CanMod was able to find experimentally validated regulator-target interactions. In addition, the modules were associated with distinguishable and cancer-related biological processes. Given the biological findings obtained from Cancerin and CanMod, we believe that the two computational methods are valuable tools to explore novel miRNA involvement in cancer

    Quantification of miRNA-mRNA Interactions

    Get PDF
    miRNAs are small RNA molecules (′ 22nt) that interact with their corresponding target mRNAs inhibiting the translation of the mRNA into proteins and cleaving the target mRNA. This second effect diminishes the overall expression of the target mRNA. Several miRNA-mRNA relationship databases have been deployed, most of them based on sequence complementarities. However, the number of false positives in these databases is large and they do not overlap completely. Recently, it has been proposed to combine expression measurement from both miRNA and mRNA and sequence based predictions to achieve more accurate relationships. In our work, we use LASSO regression with non-positive constraints to integrate both sources of information. LASSO enforces the sparseness of the solution and the non-positive constraints restrict the search of miRNA targets to those with down-regulation effects on the mRNA expression. We named this method TaLasso (miRNA-Target LASSO)

    Whole transcriptome analysis reveals non-coding RNA's competing endogenous gene pairs as novel form of motifs in serous ovarian cancer

    Get PDF
    Publisher Copyright: © 2022The non-coding RNA (ncRNA) regulation appears to be associated to the diagnosis and targeted therapy of complex diseases. Motifs of non-coding RNAs and genes in the competing endogenous RNA (ceRNA) network would probably contribute to the accurate prediction of serous ovarian carcinoma (SOC). We conducted a microarray study profiling the whole transcriptomes of eight human SOCs and eight controls and constructed a ceRNA network including mRNAs, long ncRNAs, and circular RNAs (circRNAs). Novel form of motifs (mRNA-ncRNA-mRNA) were identified from the ceRNA network and defined as non-coding RNA's competing endogenous gene pairs (ceGPs), using a proposed method denoised individualized pair analysis of gene expression (deiPAGE). 18 cricRNA's ceGPs (cceGPs) were identified from multiple cohorts and were fused as an indicator (SOC index) for SOC discrimination, which carried a high predictive capacity in independent cohorts. SOC index was negatively correlated with the CD8+/CD4+ ratio in tumour-infiltration, reflecting the migration and growth of tumour cells in ovarian cancer progression. Moreover, most of the RNAs in SOC index were experimentally validated involved in ovarian cancer development. Our results elucidate the discriminative capability of SOC index and suggest that the novel competing endogenous motifs play important roles in expression regulation and could be potential target for investigating ovarian cancer mechanism or its therapy.Peer reviewe

    Lecture notes on ridge regression

    Full text link
    The linear regression model cannot be fitted to high-dimensional data, as the high-dimensionality brings about empirical non-identifiability. Penalized regression overcomes this non-identifiability by augmentation of the loss function by a penalty (i.e. a function of regression coefficients). The ridge penalty is the sum of squared regression coefficients, giving rise to ridge regression. Here many aspect of ridge regression are reviewed e.g. moments, mean squared error, its equivalence to constrained estimation, and its relation to Bayesian regression. Finally, its behaviour and use are illustrated in simulation and on omics data. Subsequently, ridge regression is generalized to allow for a more general penalty. The ridge penalization framework is then translated to logistic regression and its properties are shown to carry over. To contrast ridge penalized estimation, the final chapter introduces its lasso counterpart

    Dissecting the expression relationships between RNA-binding proteins and their cognate targets in eukaryotic post-transcriptional regulatory networks

    Get PDF
    RNA-binding proteins (RBPs) are pivotal in orchestrating several steps in the metabolism of RNA in eukaryotes thereby controlling an extensive network of RBP-RNA interactions. Here, we employed CLIP (cross-linking immunoprecipitation)-seq datasets for 60 human RBPs and RIP-ChIP (RNP immunoprecipitation-microarray) data for 69 yeast RBPs to construct a network of genome-wide RBP- target RNA interactions for each RBP. We show in humans that majority (~78%) of the RBPs are strongly associated with their target transcripts at transcript level while ~95% of the studied RBPs were also found to be strongly associated with expression levels of target transcripts when protein expression levels of RBPs were employed. At transcript level, RBP - RNA interaction data for the yeast genome, exhibited a strong association for 63% of the RBPs, confirming the association to be conserved across large phylogenetic distances. Analysis to uncover the features contributing to these associations revealed the number of target transcripts and length of the selected protein-coding transcript of an RBP at the transcript level while intensity of the CLIP signal, number of RNA-Binding domains, location of the binding site on the transcript, to be significant at the protein level. Our analysis will contribute to improved modelling and prediction of post-transcriptional network
    • …
    corecore