115 research outputs found

    Overlapping Clusters and Support Vector Machines Based Interval Type-2 Fuzzy System for the Prediction of Peptide Binding Affinity

    Get PDF
    In the post-genome era, it is becoming more complex to process high dimensional, low-instance available, and nonlinear biological datasets. This paper aims to address these characteristics as they have adverse effects on the performance of predictive models in bioinformatics. In this paper, an interval type-2 Takagi Sugeno fuzzy predictive model is proposed in order to manage high-dimensionality and nonlinearity of such datasets which is the common feature in bioinformatics. A new clustering framework is proposed for this purpose to simplify antecedent operations for an interval type-2 fuzzy system. This new clustering framework is based on overlapping regions between the clusters. The cluster analysis of partitions and statistical information derived from them has identified the upper and lower membership functions forming the premise part. This is further enhanced by adapting the regression version of support vector machines in the consequent part. The proposed method is used in experiments to quantitatively predict affinities of peptide bindings to biomolecules. This case study imposes a challenge in post-genome studies and remains an open problem due to the complexity of the biological system, diversity of peptides, and curse of dimensionality of amino acid index representation characterizing the peptides. Utilizing four different peptide binding affinity datasets, the proposed method resulted in better generalization ability for all of them yielding an improved prediction accuracy of up to 58.2% on unseen peptides in comparison with the predictive methods presented in the literature. Source code of the algorithm is available at https://github.com/sekerbigdatalab

    Advanced Immunoinformatics Approaches for Precision Medicine

    Get PDF
    Genomic sequencing and other ’-omic’ technologies are slowly changing biomedical practice. As a result, patients now can be treated based on their molecular profile. Especially the immune system’s variability, in particular that of the human leukocyte antigen (HLA) gene cluster, makes such a paradigm indispensable when treating illnesses such as cancer, autoimmune diseases, or infectious diseases. It can be, however, costly and time-consuming to determine the HLA genotype with traditional means, as these methods do not utilize often pre-existing sequencing data. We therefore proposed an algorithmic approach that can use these data sources to infer the HLA genotype. HLA genotyping inference can be cast into a set covering problem under special biological constraints and can be solved efficiently via integer linear programming. Our proposed approach outperformed previously published methods and remains one of the most accurate methods to date. We then introduced two applications in which a HLA-based stratification is vital for the efficacy of the treatment and the reduction of its adverse effects. In the first example, we dealt with the optimal design of string-of-beads vaccines (SOB). We developed a mathematical model that maximizes the efficacy of such vaccines while minimizing their side effects based on a given HLA distribution. Comparisons of our optimally designed SOB with experimentally tested designs yielded promising results. In the second example, we considered the problem of anti-drug antibody (ADA) formation of biotherapeutics caused by HLA presented peptides. We combined a new statistical model for mutation effect prediction together with a quantitative measure of immunogenicity to formulate an optimization problem that finds alterations to reduce the risk of ADA formation. To efficiently solve this bi-objective problem, we developed a distributed solver that is up to 25-times faster than state-of-the art solvers. We used our approach to design the C2 domain of factor VIII, which is linked to ADA formation in hemophilia A. Our experimental evaluations of the proposed designs are encouraging and demonstrate the prospects of our approach. Bioinformatics is an integral part of modern biomedical research. The translation of advanced methods into clinical use is often complicated. To ease the translation, we developed a programming library for computational immunology and used it to implement a Galaxy-based web server for vaccine design and a KNIME extension for desktop PCs. These platforms allow researchers to develop their own immunoinformatics workflows utilizing the platform’s graphical programming capabilities
    • …
    corecore