59,765 research outputs found

    Searching for Protein Classification Features

    Get PDF
    A genetic algorithm is used to search for a set of classification features for a protein superfamily which is as unique as possible to the superfamily. These features may then be used for very fast classification of a query sequence into a protein superfamily. The features are based on windows onto modified consensus sequences of multiple aligned members of a training set for the protein superfamily. The efficacy of the method is demonstrated using receiver operating characteristic (ROC) values and the performance of resulting algorithm is compared with other database search algorithms

    Using RRC Algorithm Classify the Proteins and Visualize in Biological Databases

    Get PDF
    Visualize biological database for protein is very complicated without Classify the protein properties.Protein classification is one of the major application of machine learning algorithms in the field of bio-informatics.The searching classification model works in two steps.Firstly, the correlation based feature selection for protein classification will be taken and strongly correlated features will be considered for classification using MST based . In second step, using Robust Regression, the classification will be performed. Based on results of RRC algorithm, it is highly has classification ratio than traditional machine learning algorithms such as SVM, Naļæ½ve-bayes , Decision Trees

    Using RRC Algorithm Classify the Proteins and Visualize in Biological Databases

    Get PDF
    Visualize biological database for protein is very complicated without Classify the protein properties.Protein classification is one of the major application of machine learning algorithms in the field of bio-informatics.The searching classification model works in two steps.Firstly, the correlation based feature selection for protein classification will be taken and strongly correlated features will be considered for classification using MST based . In second step, using Robust Regression, the classification will be performed. Based on results of RRC algorithm, it is highly has classification ratio than traditional machine learning algorithms such as SVM, Naļæ½ve-bayes , Decision Trees

    Protein sequence analysis using the MPI Bioinformatics Toolkit

    Get PDF
    The MPI Bioinformatics Toolkit (https://toolkit.tuebingen.mpg.de) provides interactive access to a wide range of the bestā€performing bioinformatics tools and databases, including the stateā€ofā€theā€art protein sequence comparison methods HHblits and HHpred. The Toolkit currently includes 35 external and inā€house tools, covering functionalities such as sequence similarity searching, prediction of sequence features, and sequence classification. Due to this breadth of functionality, the tight interconnection of its constituent tools, and its ease of use, the Toolkit has become an important resource for biomedical research and for teaching protein sequence analysis to students in the life sciences. In this article, we provide detailed information on utilizing the three most widely accessed tools within the Toolkit: HHpred for the detection of homologs, HHpred in conjunction with MODELLER for structure prediction and homology modeling, and CLANS for the visualization of relationships in large sequence datasets. Basic Protocol 1: Sequence similarity searching using HHpred Alternate Protocol: Pairwise sequence comparison using HHpred Support Protocol: Building a custom multiple sequence alignment using PSIā€BLAST and forwarding it as input to HHpred Basic Protocol 2: Calculation of homology models using HHpred and MODELLER Basic Protocol 3: Cluster analysis using CLAN

    The RCSB Protein Data Bank: views of structural biology for basic and applied research and education.

    Get PDF
    The RCSB Protein Data Bank (RCSB PDB, http://www.rcsb.org) provides access to 3D structures of biological macromolecules and is one of the leading resources in biology and biomedicine worldwide. Our efforts over the past 2 years focused on enabling a deeper understanding of structural biology and providing new structural views of biology that support both basic and applied research and education. Herein, we describe recently introduced data annotations including integration with external biological resources, such as gene and drug databases, new visualization tools and improved support for the mobile web. We also describe access to data files, web services and open access software components to enable software developers to more effectively mine the PDB archive and related annotations. Our efforts are aimed at expanding the role of 3D structure in understanding biology and medicine

    A^2-Net: Molecular Structure Estimation from Cryo-EM Density Volumes

    Full text link
    Constructing of molecular structural models from Cryo-Electron Microscopy (Cryo-EM) density volumes is the critical last step of structure determination by Cryo-EM technologies. Methods have evolved from manual construction by structural biologists to perform 6D translation-rotation searching, which is extremely compute-intensive. In this paper, we propose a learning-based method and formulate this problem as a vision-inspired 3D detection and pose estimation task. We develop a deep learning framework for amino acid determination in a 3D Cryo-EM density volume. We also design a sequence-guided Monte Carlo Tree Search (MCTS) to thread over the candidate amino acids to form the molecular structure. This framework achieves 91% coverage on our newly proposed dataset and takes only a few minutes for a typical structure with a thousand amino acids. Our method is hundreds of times faster and several times more accurate than existing automated solutions without any human intervention.Comment: 8 pages, 5 figures, 4 table
    • ā€¦
    corecore