72,861 research outputs found

    Prediction of Metabolic Pathways Involvement in Prokaryotic UniProtKB Data by Association Rule Mining

    Full text link
    The widening gap between known proteins and their functions has encouraged the development of methods to automatically infer annotations. Automatic functional annotation of proteins is expected to meet the conflicting requirements of maximizing annotation coverage, while minimizing erroneous functional assignments. This trade-off imposes a great challenge in designing intelligent systems to tackle the problem of automatic protein annotation. In this work, we present a system that utilizes rule mining techniques to predict metabolic pathways in prokaryotes. The resulting knowledge represents predictive models that assign pathway involvement to UniProtKB entries. We carried out an evaluation study of our system performance using cross-validation technique. We found that it achieved very promising results in pathway identification with an F1-measure of 0.982 and an AUC of 0.987. Our prediction models were then successfully applied to 6.2 million UniProtKB/TrEMBL reference proteome entries of prokaryotes. As a result, 663,724 entries were covered, where 436,510 of them lacked any previous pathway annotations

    On the Road to Accurate Biomarkers for Cardiometabolic Diseases by Integrating Precision and Gender Medicine Approaches

    Get PDF
    The need to facilitate the complex management of cardiometabolic diseases (CMD) has led to the detection of many biomarkers, however, there are no clear explanations of their role in the prevention, diagnosis or prognosis of these diseases. Molecules associated with disease pathways represent valid disease surrogates and well-fitted CMD biomarkers. To address this challenge, data from multi-omics types (genomics, epigenomics, transcriptomics, proteomics, metabolomics, microbiomics, and nutrigenomics), from human and animal models, have become available. However, individual omics types only provide data on a small part of molecules involved in the complex CMD mechanisms, whereas, here, we propose that their integration leads to multidimensional data. Such data provide a better understanding of molecules related to CMD mechanisms and, consequently, increase the possibility of identifying well-fitted biomarkers. In addition, the application of gender medicine also helps to identify accurate biomarkers according to gender, facilitating a differential CMD management. Accordingly, the impact of gender differences in CMD pathophysiology has been widely demonstrated, where gender is referred to the complex interrelation and integration of sex (as a biological and functional marker of the human body) and psychological and cultural behavior (due to ethnical, social, and religious background). In this review, all these aspects are described and discussed, as well as potential limitations and future directions in this incipient field

    Epigenomes in Cardiovascular Disease.

    Get PDF
    If unifying principles could be revealed for how the same genome encodes different eukaryotic cells and for how genetic variability and environmental input are integrated to impact cardiovascular health, grand challenges in basic cell biology and translational medicine may succumb to experimental dissection. A rich body of work in model systems has implicated chromatin-modifying enzymes, DNA methylation, noncoding RNAs, and other transcriptome-shaping factors in adult health and in the development, progression, and mitigation of cardiovascular disease. Meanwhile, deployment of epigenomic tools, powered by next-generation sequencing technologies in cardiovascular models and human populations, has enabled description of epigenomic landscapes underpinning cellular function in the cardiovascular system. This essay aims to unpack the conceptual framework in which epigenomes are studied and to stimulate discussion on how principles of chromatin function may inform investigations of cardiovascular disease and the development of new therapies

    Pharmacogenetics : the science of predictive clinical pharmacology

    Get PDF
    The study of pharmacogenetics has expanded from what were initially casual family-based clinical drug response observations, to a fully-fledged science with direct therapeutic applications, all within a time-span of less than 60 years. A wide spectrum of polymorphisms, located within several genes, are now recognised to influence the pharmacokinetics and pharmacodynamics of the majority of drugs within our therapeutic armamentarium. This information forms the basis for the new development of pharmacogenetic genotyping tests, which can be used to predict the therapeutic and/or adverse effects of a specific drug in a particular patient. Pharmacogenetic-guided, patient targeted therapy has now become the developing fulcrum of personalized medicine, as it provides the best means to optimize benefit/risk ratio in pharmacological management.peer-reviewe

    EFICAz²: enzyme function inference by a combined approach enhanced by machine learning

    Get PDF
    ©2009 Arakaki et al; licensee BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. The electronic version of this article is the complete one and can be found online at: http://www.biomedcentral.com/1471-2105/10/107doi:10.1186/1471-2105-10-107Background: We previously developed EFICAz, an enzyme function inference approach that combines predictions from non-completely overlapping component methods. Two of the four components in the original EFICAz are based on the detection of functionally discriminating residues (FDRs). FDRs distinguish between member of an enzyme family that are homofunctional (classified under the EC number of interest) or heterofunctional (annotated with another EC number or lacking enzymatic activity). Each of the two FDR-based components is associated to one of two specific kinds of enzyme families. EFICAz exhibits high precision performance, except when the maximal test to training sequence identity (MTTSI) is lower than 30%. To improve EFICAz's performance in this regime, we: i) increased the number of predictive components and ii) took advantage of consensual information from the different components to make the final EC number assignment. Results: We have developed two new EFICAz components, analogs to the two FDR-based components, where the discrimination between homo and heterofunctional members is based on the evaluation, via Support Vector Machine models, of all the aligned positions between the query sequence and the multiple sequence alignments associated to the enzyme families. Benchmark results indicate that: i) the new SVM-based components outperform their FDR-based counterparts, and ii) both SVM-based and FDR-based components generate unique predictions. We developed classification tree models to optimally combine the results from the six EFICAz components into a final EC number prediction. The new implementation of our approach, EFICAz², exhibits a highly improved prediction precision at MTTSI < 30% compared to the original EFICAz, with only a slight decrease in prediction recall. A comparative analysis of enzyme function annotation of the human proteome by EFICAz² and KEGG shows that: i) when both sources make EC number assignments for the same protein sequence, the assignments tend to be consistent and ii) EFICAz² generates considerably more unique assignments than KEGG. Conclusion: Performance benchmarks and the comparison with KEGG demonstrate that EFICAz² is a powerful and precise tool for enzyme function annotation, with multiple applications in genome analysis and metabolic pathway reconstruction. The EFICAz² web service is available at: http://cssb.biology.gatech.edu/skolnick/webservice/EFICAz2/index.htm
    • …
    corecore