14,318 research outputs found

    Herb Target Prediction Based on Representation Learning of Symptom related Heterogeneous Network.

    Get PDF
    Traditional Chinese Medicine (TCM) has received increasing attention as a complementary approach or alternative to modern medicine. However, experimental methods for identifying novel targets of TCM herbs heavily relied on the current available herb-compound-target relationships. In this work, we present an Herb-Target Interaction Network (HTINet) approach, a novel network integration pipeline for herb-target prediction mainly relying on the symptom related associations. HTINet focuses on capturing the low-dimensional feature vectors for both herbs and proteins by network embedding, which incorporate the topological properties of nodes across multi-layered heterogeneous network, and then performs supervised learning based on these low-dimensional feature representations. HTINet obtains performance improvement over a well-established random walk based herb-target prediction method. Furthermore, we have manually validated several predicted herb-target interactions from independent literatures. These results indicate that HTINet can be used to integrate heterogeneous information to predict novel herb-target interactions

    Predictive response-relevant clustering of expression data provides insights into disease processes

    Get PDF
    This article describes and illustrates a novel method of microarray data analysis that couples model-based clustering and binary classification to form clusters of ;response-relevant' genes; that is, genes that are informative when discriminating between the different values of the response. Predictions are subsequently made using an appropriate statistical summary of each gene cluster, which we call the ;meta-covariate' representation of the cluster, in a probit regression model. We first illustrate this method by analysing a leukaemia expression dataset, before focusing closely on the meta-covariate analysis of a renal gene expression dataset in a rat model of salt-sensitive hypertension. We explore the biological insights provided by our analysis of these data. In particular, we identify a highly influential cluster of 13 genes-including three transcription factors (Arntl, Bhlhe41 and Npas2)-that is implicated as being protective against hypertension in response to increased dietary sodium. Functional and canonical pathway analysis of this cluster using Ingenuity Pathway Analysis implicated transcriptional activation and circadian rhythm signalling, respectively. Although we illustrate our method using only expression data, the method is applicable to any high-dimensional datasets

    Pathway relevance ranking for tumor samples through network-based data integration

    Get PDF
    The study of cancer, a highly heterogeneous disease with different causes and clinical outcomes, requires a multi-angle approach and the collection of large multi-omics datasets that, ideally, should be analyzed simultaneously. We present a new pathway relevance ranking method that is able to prioritize pathways according to the information contained in any combination of tumor related omics datasets. Key to the method is the conversion of all available data into a single comprehensive network representation containing not only genes but also individual patient samples. Additionally, all data are linked through a network of previously identified molecular interactions. We demonstrate the performance of the new method by applying it to breast and ovarian cancer datasets from The Cancer Genome Atlas. By integrating gene expression, copy number, mutation and methylation data, the method's potential to identify key pathways involved in breast cancer development shared by different molecular subtypes is illustrated. Interestingly, certain pathways were ranked equally important for different subtypes, even when the underlying (epi)-genetic disturbances were diverse. Next to prioritizing universally high-scoring pathways, the pathway ranking method was able to identify subtype-specific pathways. Often the score of a pathway could not be motivated by a single mutation, copy number or methylation alteration, but rather by a combination of genetic and epi-genetic disturbances, stressing the need for a network-based data integration approach. The analysis of ovarian tumors, as a function of survival-based subtypes, demonstrated the method's ability to correctly identify key pathways, irrespective of tumor subtype. A differential analysis of survival-based subtypes revealed several pathways with higher importance for the bad-outcome patient group than for the good-outcome patient group. Many of the pathways exhibiting higher importance for the bad-outcome patient group could be related to ovarian tumor proliferation and survival

    Expression of Regulatory Platelet MicroRNAs in Patients with Sickle Cell Disease

    Get PDF
    Background: Increased platelet activation in sickle cell disease (SCD) contributes to a state of hypercoagulability and confers a risk of thromboembolic complications. The role for post-transcriptional regulation of the platelet transcriptome by microRNAs (miRNAs) in SCD has not been previously explored. This is the first study to determine whether platelets from SCD exhibit an altered miRNA expression profile. Methods and Findings: We analyzed the expression of miRNAs isolated from platelets from a primary cohort (SCD = 19, controls = 10) and a validation cohort (SCD = 7, controls = 7) by hybridizing to the Agilent miRNA microarrays. A dramatic difference in miRNA expression profiles between patients and controls was noted in both cohorts separately. A total of 40 differentially expressed platelet miRNAs were identified as common in both cohorts (p-value 0.05, fold change>2) with 24 miRNAs downregulated. Interestingly, 14 of the 24 downregulated miRNAs were members of three families - miR-329, miR-376 and miR-154 - which localized to the epigenetically regulated, maternally imprinted chromosome 14q32 region. We validated the downregulated miRNAs, miR-376a and miR-409-3p, and an upregulated miR-1225-3p using qRT-PCR. Over-expression of the miR-1225-3p in the Meg01 cells was followed by mRNA expression profiling to identify mRNA targets. This resulted in significant transcriptional repression of 1605 transcripts. A combinatorial approach using Meg01 mRNA expression profiles following miR-1225-3p overexpression, a computational prediction analysis of miRNA target sequences and a previously published set of differentially expressed platelet transcripts from SCD patients, identified three novel platelet mRNA targets: PBXIP1, PLAGL2 and PHF20L1. Conclusions: We have identified significant differences in functionally active platelet miRNAs in patients with SCD as compared to controls. These data provide an important inventory of differentially expressed miRNAs in SCD patients and an experimental framework for future studies of miRNAs as regulators of biological pathways in platelets. Ā© 2013 Jain et al

    aFold ā€“ using polynomial uncertainty modelling for differential gene expression estimation from RNA sequencing data

    No full text
    Data normalization and identification of significant differential expression represent crucial steps in RNA-Seq analysis. Many available tools rely on assumptions that are often not met by real data, including the common assumption of symmetrical distribution of up- and down-regulated genes, the presence of only few differentially expressed genes and/or few outliers. Moreover, the cut-off for selecting significantly differentially expressed genes for further downstream analysis often depend on arbitrary choices
    • ā€¦
    corecore