704 research outputs found

    Tackling hypotheticals in helminth genomes

    Get PDF
    Advancements in genome sequencing have led to the rapid accumulation of uncharacterized ‘hypothetical proteins’ in the public databases. Here we provide a community perspective and some best-practice approaches for the accurate functional annotation of uncharacterized genomic sequences

    Prioritizing human cancer microRNAs based on genes’ functional consistency between microRNA and cancer

    Get PDF
    The identification of human cancer-related microRNAs (miRNAs) is important for cancer biology research. Although several identification methods have achieved remarkable success, they have overlooked the functional information associated with miRNAs. We present a computational framework that can be used to prioritize human cancer miRNAs by measuring the association between cancer and miRNAs based on the functional consistency score (FCS) of the miRNA target genes and the cancer-related genes. This approach proved successful in identifying the validated cancer miRNAs for 11 common human cancers with area under ROC curve (AUC) ranging from 71.15% to 96.36%. The FCS method had a significant advantage over miRNA differential expression analysis when identifying cancer-related miRNAs with a fine regulatory mechanism, such as miR-27a in colorectal cancer. Furthermore, a case study examining thyroid cancer showed that the FCS method can uncover novel cancer-related miRNAs such as miR-27a/b, which were showed significantly upregulated in thyroid cancer samples by qRT-PCR analysis. Our method can be used on a web-based server, CMP (cancer miRNA prioritization) and is freely accessible at http://bioinfo.hrbmu.edu.cn/CMP. This time- and cost-effective computational framework can be a valuable complement to experimental studies and can assist with future studies of miRNA involvement in the pathogenesis of cancers

    Predicting Gene Ontology Function of Human MicroRNAs by Integrating Multiple Networks

    Get PDF
    MicroRNAs (miRNAs) have been demonstrated to play significant biological roles in many human biological processes. Inferring the functions of miRNAs is an important strategy for understanding disease pathogenesis at the molecular level. In this paper, we propose an integrated model, PmiRGO, to infer the gene ontology (GO) functions of miRNAs by integrating multiple data sources, including the expression profiles of miRNAs, miRNA-target interactions, and protein-protein interactions (PPI). PmiRGO starts by building a global network consisting of three networks. Then, it employs DeepWalk to learn latent representations as network features of the global heterogeneous network. Finally, the SVM-based models are applied to label the GO terms of miRNAs. The experimental results show that PmiRGO has a significantly better performance than existing state-of-the-art methods in terms of Fmax. A case study further demonstrates the feasibility of PmiRGO to annotate the potential functions of miRNAs

    Altered microRNA and target gene expression related to Tetralogy of Fallot

    Get PDF
    MicroRNAs (miRNAs) play an important role in guiding development and maintaining function of the human heart. Dysregulation of miRNAs has been linked to various congenital heart diseases including Tetralogy of Fallot (TOF), which represents the most common cyanotic heart malformation in humans. Several studies have identified dysregulated miRNAs in right ventricular (RV) tissues of TOF patients. In this study, we profiled genome-wide the whole transcriptome and analyzed the relationship of miRNAs and mRNAs of RV tissues of a homogeneous group of 22 non-syndromic TOF patients. Observed profiles were compared to profiles obtained from right and left ventricular tissue of normal hearts. To reduce the commonly observed large list of predicted target genes of dysregulated miRNAs, we applied a stringent target prediction pipeline integrating probabilities for miRNA-mRNA interaction. The final list of disease-related miRNA-mRNA pairs comprises novel as well as known miRNAs including miR-1 and miR-133, which are essential to cardiac development and function by regulating KCNJ2, FBN2, SLC38A3 and TNNI1. Overall, our study provides additional insights into post-transcriptional gene regulation of malformed hearts of TOF patients

    Discovering lesser known molecular players and mechanistic patterns in Alzheimer's disease using an integrative disease modelling approach

    Get PDF
    Convergence of exponentially advancing technologies is driving medical research with life changing discoveries. On the contrary, repeated failures of high-profile drugs to battle Alzheimer's disease (AD) has made it one of the least successful therapeutic area. This failure pattern has provoked researchers to grapple with their beliefs about Alzheimer's aetiology. Thus, growing realisation that Amyloid-β and tau are not 'the' but rather 'one of the' factors necessitates the reassessment of pre-existing data to add new perspectives. To enable a holistic view of the disease, integrative modelling approaches are emerging as a powerful technique. Combining data at different scales and modes could considerably increase the predictive power of the integrative model by filling biological knowledge gaps. However, the reliability of the derived hypotheses largely depends on the completeness, quality, consistency, and context-specificity of the data. Thus, there is a need for agile methods and approaches that efficiently interrogate and utilise existing public data. This thesis presents the development of novel approaches and methods that address intrinsic issues of data integration and analysis in AD research. It aims to prioritise lesser-known AD candidates using highly curated and precise knowledge derived from integrated data. Here much of the emphasis is put on quality, reliability, and context-specificity. This thesis work showcases the benefit of integrating well-curated and disease-specific heterogeneous data in a semantic web-based framework for mining actionable knowledge. Furthermore, it introduces to the challenges encountered while harvesting information from literature and transcriptomic resources. State-of-the-art text-mining methodology is developed to extract miRNAs and its regulatory role in diseases and genes from the biomedical literature. To enable meta-analysis of biologically related transcriptomic data, a highly-curated metadata database has been developed, which explicates annotations specific to human and animal models. Finally, to corroborate common mechanistic patterns — embedded with novel candidates — across large-scale AD transcriptomic data, a new approach to generate gene regulatory networks has been developed. The work presented here has demonstrated its capability in identifying testable mechanistic hypotheses containing previously unknown or emerging knowledge from public data in two major publicly funded projects for Alzheimer's, Parkinson's and Epilepsy diseases

    MHIF-MSEA: a novel model of miRNA set enrichment analysis based on multi-source heterogeneous information fusion

    Get PDF
    Introduction: MicroRNAs (miRNAs) are a class of non-coding RNA molecules that play a crucial role in the regulation of diverse biological processes across various organisms. Despite not encoding proteins, miRNAs have been found to have significant implications in the onset and progression of complex human diseases.Methods: Conventional methods for miRNA functional enrichment analysis have certain limitations, and we proposed a novel method called MiRNA Set Enrichment Analysis based on Multi-source Heterogeneous Information Fusion (MHIF-MSEA). Three miRNA similarity networks (miRSN-DA, miRSN-GOA, and miRSN-PPI) were constructed in MHIF-MSEA. These networks were built based on miRNA-disease association, gene ontology (GO) annotation of target genes, and protein-protein interaction of target genes, respectively. These miRNA similarity networks were fused into a single similarity network with the averaging method. This fused network served as the input for the random walk with restart algorithm, which expanded the original miRNA list. Finally, MHIF-MSEA performed enrichment analysis on the expanded list.Results and Discussion: To determine the optimal network fusion approach, three case studies were introduced: colon cancer, breast cancer, and hepatocellular carcinoma. The experimental results revealed that the miRNA-miRNA association network constructed using miRSN-DA and miRSN-GOA exhibited superior performance as the input network. Furthermore, the MHIF-MSEA model performed enrichment analysis on differentially expressed miRNAs in breast cancer and hepatocellular carcinoma. The achieved p-values were 2.17e(-75) and 1.50e(-77), and the hit rates improved by 39.01% and 44.68% compared to traditional enrichment analysis methods, respectively. These results confirm that the MHIF-MSEA method enhances the identification of enriched miRNA sets by leveraging multiple sources of heterogeneous information, leading to improved insights into the functional implications of miRNAs in complex diseases

    Integrative methods for analyzing big data in precision medicine

    Get PDF
    We provide an overview of recent developments in big data analyses in the context of precision medicine and health informatics. With the advance in technologies capturing molecular and medical data, we entered the area of “Big Data” in biology and medicine. These data offer many opportunities to advance precision medicine. We outline key challenges in precision medicine and present recent advances in data integration-based methods to uncover personalized information from big data produced by various omics studies. We survey recent integrative methods for disease subtyping, biomarkers discovery, and drug repurposing, and list the tools that are available to domain scientists. Given the ever-growing nature of these big data, we highlight key issues that big data integration methods will face
    corecore