18 research outputs found

    Support Vector Machine-based Fuzzy Systems for Quantitative Prediction of Peptide Binding Affinity

    Get PDF
    Reliable prediction of binding affinity of peptides is one of the most challenging but important complex modelling problems in the post-genome era due to the diversity and functionality of the peptides discovered. Generally, peptide binding prediction models are commonly used to find out whether a binding exists between a certain peptide(s) and a major histocompatibility complex (MHC) molecule(s). Recent research efforts have been focused on quantifying the binding predictions. The objective of this thesis is to develop reliable real-value predictive models through the use of fuzzy systems. A non-linear system is proposed with the aid of support vector-based regression to improve the fuzzy system and applied to the real value prediction of degree of peptide binding. This research study introduced two novel methods to improve structure and parameter identification of fuzzy systems. First, the support-vector based regression is used to identify initial parameter values of the consequent part of type-1 and interval type-2 fuzzy systems. Second, an overlapping clustering concept is used to derive interval valued parameters of the premise part of the type-2 fuzzy system. Publicly available peptide binding affinity data sets obtained from the literature are used in the experimental studies of this thesis. First, the proposed models are blind validated using the peptide binding affinity data sets obtained from a modelling competition. In that competition, almost an equal number of peptide sequences in the training and testing data sets (89, 76, 133 and 133 peptides for the training and 88, 76, 133 and 47 peptides for the testing) are provided to the participants. Each peptide in the data sets was represented by 643 bio-chemical descriptors assigned to each amino acid. Second, the proposed models are cross validated using mouse class I MHC alleles (H2-Db, H2-Kb and H2-Kk). H2-Db, H2-Kb, and H2-Kk consist of 65 nona-peptides, 62 octa-peptides, and 154 octa-peptides, respectively. Compared to the previously published results in the literature, the support vector-based type-1 and support vector-based interval type-2 fuzzy models yield an improvement in the prediction accuracy. The quantitative predictive performances have been improved as much as 33.6\% for the first group of data sets and 1.32\% for the second group of data sets. The proposed models not only improved the performance of the fuzzy system (which used support vector-based regression), but the support vector-based regression benefited from the fuzzy concept also. The results obtained here sets the platform for the presented models to be considered for other application domains in computational and/or systems biology. Apart from improving the prediction accuracy, this research study has also identified specific features which play a key role(s) in making reliable peptide binding affinity predictions. The amino acid features "Polarity", "Positive charge", "Hydrophobicity coefficient", and "Zimm-Bragg parameter" are considered as highly discriminating features in the peptide binding affinity data sets. This information can be valuable in the design of peptides with strong binding affinity to a MHC I molecule(s). This information may also be useful when designing drugs and vaccines

    Quantitative prediction of peptide binding affinity by using hybrid fuzzy support vector regression

    Get PDF
    Support vector machines have a wide use for the prediction problems in life sciences. It has been shown to offer more generalisation ability in input–output mapping. However, the performance of predictive models is often negatively influenced due to the complex, high-dimensional, and non-linear nature of the post-genome data. Soft computing methods can be used to model such non-linear systems. Fuzzy systems are one of the widely used methods of soft computing that model uncertainties. It is formed of interpretable rules aiding one to gain insight into applied model. This study is therefore concerned to provide more interpretable and efficient biological model with the development of a hybrid method that integrates the fuzzy system and support vector regression. In order to demonstrate the robustness of this new hybrid method, it is applied to the prediction of peptide binding affinity being one of the most challenging problems in the post-genomic era due to diversity in peptide families and complexity and high-dimensionality in the characteristic features of the peptides. Having used four different case studies, this hybrid predictive model has yielded the highest predictive power in all the four cases and achieved an improvement of as much as 34% compared to the results presented in the literature. Availability: Matlab scripts are available at https://github.com/sekerbigdatalab/tsksvr

    Quantitative prediction of peptide binding affinity by using hybrid fuzzy support vector regression

    Get PDF
    Support vector machines have a wide use for the prediction problems in life sciences. It has been shown to offer more generalisation ability in input–output mapping. However, the performance of predictive models is often negatively influenced due to the complex, high-dimensional, and non-linear nature of the post-genome data. Soft computing methods can be used to model such non-linear systems. Fuzzy systems are one of the widely used methods of soft computing that model uncertainties. It is formed of interpretable rules aiding one to gain insight into applied model. This study is therefore concerned to provide more interpretable and efficient biological model with the development of a hybrid method that integrates the fuzzy system and support vector regression. In order to demonstrate the robustness of this new hybrid method, it is applied to the prediction of peptide binding affinity being one of the most challenging problems in the post-genomic era due to diversity in peptide families and complexity and high-dimensionality in the characteristic features of the peptides. Having used four different case studies, this hybrid predictive model has yielded the highest predictive power in all the four cases and achieved an improvement of as much as 34% compared to the results presented in the literature. Availability: Matlab scripts are available at https://github.com/sekerbigdatalab/tsksvr

    Overlapping Clusters and Support Vector Machines Based Interval Type-2 Fuzzy System for the Prediction of Peptide Binding Affinity

    Get PDF
    In the post-genome era, it is becoming more complex to process high dimensional, low-instance available, and nonlinear biological datasets. This paper aims to address these characteristics as they have adverse effects on the performance of predictive models in bioinformatics. In this paper, an interval type-2 Takagi Sugeno fuzzy predictive model is proposed in order to manage high-dimensionality and nonlinearity of such datasets which is the common feature in bioinformatics. A new clustering framework is proposed for this purpose to simplify antecedent operations for an interval type-2 fuzzy system. This new clustering framework is based on overlapping regions between the clusters. The cluster analysis of partitions and statistical information derived from them has identified the upper and lower membership functions forming the premise part. This is further enhanced by adapting the regression version of support vector machines in the consequent part. The proposed method is used in experiments to quantitatively predict affinities of peptide bindings to biomolecules. This case study imposes a challenge in post-genome studies and remains an open problem due to the complexity of the biological system, diversity of peptides, and curse of dimensionality of amino acid index representation characterizing the peptides. Utilizing four different peptide binding affinity datasets, the proposed method resulted in better generalization ability for all of them yielding an improved prediction accuracy of up to 58.2% on unseen peptides in comparison with the predictive methods presented in the literature. Source code of the algorithm is available at https://github.com/sekerbigdatalab

    Texture based characterization of sub-skin features by specified laser speckle effects at λ=650nm region

    Get PDF
    The file attached to this record is the author's final peer reviewed version. The Publisher's final version can be found by following the DOI link.Objective: The textural structure of “skin age” related sub-skin components enables us to identify and analyse their unique characteristics, thus making substantial progress towards establishing an accurate skin age model. Methods: This is achieved by a two stage process. First by the application of textural analysis using laser speckle imaging, which is sensitive to textural effects within the λ=650 nm spectral band region. In the second stage a Bayesian inference method is used to select attributes from which a predictive model is built. Results: This technique enables us to contrast different skin age models, such as the laser-speckle effect against the more widely used normal light (LED) imaging method, whereby it is shown that our laser speckle based technique yields better results. Conclusion: The method introduced here is non-invasive, low-cost and capable of operating in real-time; having the potential to compete against high-cost instrumentation such as confocal microscopy or similar imaging devices used for skin age identification purposes

    Overlapping Clusters and Support Vector Machines based Interval Type-2 Fuzzy System for the Prediction of Peptide Binding Affinity

    Get PDF
    In the post-genome era, it is becoming more complex to process high-dimensional, low-instance available and nonlinear biological datasets. This study aims at addressing these characteristics as they have adverse effects on the performance of predictive models in bioinformatics. In this paper, an interval type-2 Takagi Sugeno fuzzy predictive model is proposed in order to manage high-dimensionality and nonlinearity of such datasets which is the common feature in bioinformatics. A new clustering framework is proposed for this purpose to simplify antecedent operations for an interval type-2 fuzzy system. This new clustering framework is based on overlapping regions between the clusters. The cluster analysis of partitions and statistical information derived from them have identified the upper and lower membership functions forming the premise part. This is further enhanced by adapting the regression version of support vector machines in the consequent part. The proposed method is used in experiments to quantitatively predict affinities of peptide bindings to biomolecules. This case study imposes a challenge in post-genome studies and remains an open problem due to the complexity of the biological system, diversity of peptides and curse of dimensionality of amino acid index representation characterising the peptides. Utilizing four different peptide binding affinity datasets, the proposed method resulted in better generalisation ability for all of them yielding an improved prediction accuracy of up to 58.2% on unseen peptides in comparison with the predictive methods presented in the literature

    The quantitative prediction of HLA-B*2705 peptide binding affinities using Support Vector Regression to gain insights into its role for the Spondyloarthropathies

    No full text
    Computational methods are increasingly utilised in many immunoinformatics problems such as the prediction of binding affinity of peptides. The peptides could provide valuable insight into the drug design and development such as vaccines. Moreover, they can be used to diagnose diseases. The presence of human class I MHC allele HLA-B*2705 is one of the strong hypothesis that would lead spondyloarthropathies. In this paper, Support Vector Regression is used in order to predict binding affinity of peptides with the aid of experimentally determined peptide-MHC binding affinities of 222 peptides to HLA-B*2705 to get more insight into this problematic disease. The results yield a high correlation coefficient as much as 0.65 and the SVR-based predictive models can be considered as a useful tool in order to predict the binding affinities for newly discovered peptides

    Binding affinity prediction of S. cerevisiae 14-3-3 and GYF peptide-recognition domains using support vector regression

    No full text
    Proteins interact with other proteins and bio-molecules to carry out biological processes in a cell. Computational models help understanding complex biochemical processes that happens throughout the life of a cell. Domain-mediated protein interaction to peptides one such complex problem in bioinformatics that requires computational predictive models to identify meaningful bindings. In this study, domain-peptide binding affinity prediction models are proposed based on support vector regression. Proposed models are applied to yeast bmh 14-3-3 and syh GYF peptide-recognition domains. The cross validated results of the domain-peptide binding affinity data sets show that predictive performance of the support vector based models are efficient

    TSKSVR: Support Vector-based TSK Fuzzy Modelling

    No full text
    If you use our method, please cite our article: V. Uslan and H. Seker, “Quantitative prediction of peptide binding affinity by using hybrid fuzzy support vector regression,” Applied Soft Computing, vol. 43, pp. 210–221, 2016

    Face Mask Recognition to Identify People Wearing Masks to Support Covid-19 Prevention Policies

    No full text
    Nowadays, Covid-19 pandemic has penetrated all the human beings daily life. One of the most common and strongest way to prevent spread is wearing a face mask in public places. People gained a new term to their vocabulary; “new normal”. The new normal contains wearing a face mask as well. Therefore, a face mask recognition system is a vital need for helping daily life processes. This paper acquaints a face mask recognition to identify masked and unmasked faces to support Covid-19 policies. The face mask recognition in this paper developed by deep learning algorithm using the CNN architecture VGG-16. Our results suggest that deep learning-based method achieved high accuracy (99%) in both the validation and testing datasets
    corecore