Search CORE

6 research outputs found

DeepFrag-k: A Fragment-Based Deep Learning Approach for Protein Fold Recognition

Author: Elhefnawy Wessam
Li Min
Li Yaohang
Wang Jianxin
Publication venue: ODU Digital Commons
Publication date: 01/11/2020
Field of study

Background: One of the most essential problems in structural bioinformatics is protein fold recognition. In this paper, we design a novel deep learning architecture, so-called DeepFrag-k, which identifies fold discriminative features at fragment level to improve the accuracy of protein fold recognition. DeepFrag-k is composed of two stages: the first stage employs a multi-modal Deep Belief Network (DBN) to predict the potential structural fragments given a sequence, represented as a fragment vector, and then the second stage uses a deep convolutional neural network (CNN) to classify the fragment vector into the corresponding fold. Results: Our results show that DeepFrag-k yields 92.98% accuracy in predicting the top-100 most popular fragments, which can be used to generate discriminative fragment feature vectors to improve protein fold recognition. Conclusions: There is a set of fragments that can serve as structural “keywords” distinguishing between major protein folds. The deep learning architecture in DeepFrag-k is able to accurately identify these fragments as structure features to improve protein fold recognition

Old Dominion University

Predicting MoRFs in protein sequences using HMM profiles

Author
Publication venue: BioMed Central
Publication date
Field of study

Springer - Publisher Connector

ProFold: Protein Fold Classification with Additional Structural Features and a Novel Ensemble Classifier

Author
Publication venue: 'Hindawi Limited'
Publication date: 01/01/2016
Field of study

Crossref

A segmentation - based method to extract structural and evolutionary features for protein fold recognition

Author: Dehzangi A.
Lyons J.
Paliwal K.K.
Sattar A.
Sharma Alokanand
Publication venue
Publication date: 01/01/2014
Field of study

Protein fold recognition (PFR) is considered as an important step towards the protein structure prediction problem. Despite all the efforts that have been made so far, finding an accurate and fast computational approach to solve the PFR still remains a challenging problem for bioinformatics and computational biology. In this study, we propose the concept of segmented-based feature extraction technique to provide local evolutionary information embedded in Position Specific Scoring Matrix (PSSM) and structural information embedded in the predicted secondary structure of proteins using SPINE-X. We also employ the concept of occurrence feature to extract global discriminatory information from PSSM and SPINE-X. By applying a Support Vector Machine (SVM) to our extracted features, we enhance the protein fold prediction accuracy for 7.4% over the best results reported in the literature. We also report 73.8% prediction accuracy for a data set consisting of proteins with less than 25% sequence similarity rates and 80.7% prediction accuracy for a data set with proteins belonging to 110 folds with less than 40% sequence similarity rates.We also investigate the relation between the number of folds and the number of features being used and show that the number of features should be increased to get better protein fold prediction results when the number of folds is relatively large

CiteSeerX

University of the South Pacific Electronic Research Repository

Highly Accurate Fragment Library for Protein Fold Recognition

Author: Elhefnawy Wessam
Publication venue: ODU Digital Commons
Publication date: 01/04/2019
Field of study

Proteins play a crucial role in living organisms as they perform many vital tasks in every living cell. Knowledge of protein folding has a deep impact on understanding the heterogeneity and molecular functions of proteins. Such information leads to crucial advances in drug design and disease understanding. Fold recognition is a key step in the protein structure discovery process, especially when traditional computational methods fail to yield convincing structural homologies. In this work, we present a new protein fold recognition approach using machine learning and data mining methodologies. First, we identify a protein structural fragment library (Frag-K) composed of a set of backbone fragments ranging from 4 to 20 residues as the structural “keywords” that can effectively distinguish between major protein folds. We firstly apply randomized spectral clustering and random forest algorithms to construct representative and sensitive protein fragment libraries from a large-scale of high-quality, non-homologous protein structures available in PDB. We analyze the impacts of clustering cut-offs on the performance of the fragment libraries. Then, the Frag-K fragments are employed as structural features to classify protein structures in major protein folds defined by SCOP (Structural Classification of Proteins). Our results show that a structural dictionary with ~400 4- to 20-residue Frag-K fragments is capable of classifying major SCOP folds with high accuracy. Then, based on Frag-k, we design a novel deep learning architecture, so-called DeepFrag-k, which identifies fold discriminative features to improve the accuracy of protein fold recognition. DeepFrag-k is composed of two stages: the first stage employs a multimodal Deep Belief Network (DBN) to predict the potential structural fragments given a sequence, represented as a fragment vector, and then the second stage uses a deep convolution neural network (CNN) to classify the fragment vectors into the corresponding folds. Our results show that DeepFrag-k yields 92.98% accuracy in predicting the top-100 most popular fragments, which can be used to generate discriminative fragment feature vectors to improve protein fold recognition

Old Dominion University

A Segmentation-Based Method to Extract Structural and Evolutionary Features for Protein Fold Recognition

Author: Abdollah Dehzangi
Abdul Sattar
Alok Sharma
James Lyons
Kuldip Paliwal
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date
Field of study

Crossref