3,557 research outputs found

    The Application of Hybridized Genetic Algorithms to the Protein Folding Problem

    Get PDF
    The protein folding problem consists of attempting to determine the native conformation of a protein given its primary structure. This study examines various methods of hybridizing a genetic algorithm implementation in order to minimize an energy function and predict the conformation (structure) of Met-enkephalin. Genetic Algorithms are semi-optimal algorithms designed to explore and exploit a search space. The genetic algorithm uses selection, recombination, and mutation operators on populations of strings which represent possible solutions to the given problem. One step in solving the protein folding problem is the design of efficient energy minimization techniques. A conjugate gradient minimization technique is described and tested with different replacement frequencies. Baidwinian, Lamarckian, and probabilistic Lamarckian evolution are all tested. Another extension of simple genetic algorithms can be accomplished with niching. Niching works by de-emphasizing solutions based on their proximity to other solutions in the space. Several variations of niching are tested. Experiments are conducted to determine the benefits of each hybridization technique versus each other and versus the genetic algorithm by itself. The experiments are geared toward trying to find the lowest possible energy and hence the minimum conformation of Met-enkephalin. In the experiments, probabilistic Lamarckian strategies were successful in achieving energies below that of the published minimum in QUANTA

    A Multiobjective Approach Applied to the Protein Structure Prediction Problem

    Get PDF
    Interest in discovering a methodology for solving the Protein Structure Prediction problem extends into many fields of study including biochemistry, medicine, biology, and numerous engineering and science disciplines. Experimental approaches, such as, x-ray crystallographic studies or solution Nuclear Magnetic Resonance Spectroscopy, to mathematical modeling, such as minimum energy models are used to solve this problem. Recently, Evolutionary Algorithm studies at the Air Force Institute of Technology include the following: Simple Genetic Algorithm (GA), messy GA, fast messy GA, and Linkage Learning GA, as approaches for potential protein energy minimization. Prepackaged software like GENOCOP, GENESIS, and mGA are in use to facilitate experimentation of these techniques. In addition to this software, a parallelized version of the fmGA, the so-called parallel fast messy GA, is found to be good at finding semi-optimal answers in reasonable wall clock time. The aim of this work is to apply a Multiobjective approach to solving this problem using a modified fast messy GA. By dividing the CHARMm energy model into separate objectives, it should be possible to find structural configurations of a protein that yield lower energy values and ultimately more correct conformations

    Parallel evolution strategy for protein threading.

    Get PDF
    A protein-sequence folds into a specific shape in order to function in its aqueous state. If the primary sequence of a protein is given, what is its three dimensional structure? This is a long-standing problem in the field of molecular biology and it has large implication to drug design and cure. Among several proposed approaches, protein threading represents one of the most promising technique. The protein threading problem (PTP) is the problem of determining the three-dimensional structure of a given but arbitrary protein sequence from a set of known structures of other proteins. This problem is known to be NP-hard and current computational approaches to threading are time-consuming and data-intensive. In this thesis, we proposed an evolution strategy (ES) based approach for protein threading (EST). We also developed two parallel approaches for the PTP problem and both are parallelizations of our novel EST. The first method, we call SQST-PEST (Single Query Single Template Parallel EST) threads a single query against a single template. We use ES to find the best alignment between the query and the template, and ES is parallelized. The second method, we call SQMT-PEST (Single Query Multiple Templates Parallel EST) to allow for threading a single query against multiple templates within reasonable time. We obtained better results than current comparable approaches, as well as significant reduction in execution time.Dept. of Computer Science. Paper copy at Leddy Library: Theses & Major Papers - Basement, West Bldg. / Call Number: Thesis2005 .I85. Source: Masters Abstracts International, Volume: 44-03, page: 1403. Thesis (M.Sc.)--University of Windsor (Canada), 2005

    Feature Reduction in Clinical Data Classification using Augmented Genetic Algorithm

    Get PDF
    In clinical data, we have a large set of diagnostic feature and recorded details of patients for certain diseases. In a clinical environment a doctor reaches a treatment decision based on his theoretical knowledge, information attained from patients, and the clinical reports of the patient. It is very difficult to work with huge data in machine learning; hence to reduce the data, feature reduction is applied. Feature reduction has gained interest in many research areas which deals with machine learning and data mining, because it enhances the classifiers in terms of faster execution, cost-effectiveness, and accuracy. Using feature reduction we intend to find the relevant features of the data set. In this paper, we have analyzed Modified GA (MGA), PCA and combination of PCA and Modified Genetic algorithm for feature reduction. We have found that correctly classified rate of combination of PCA and Modified Genetic algorithm higher compared to other feature reduction method

    Development of genetic algorithm for optimisation of predicted membrane protein structures

    Get PDF
    Due to the inherent problems with their structural elucidation in the laboratory, the computational prediction of membrane protein structure is an essential step toward understanding the function of these leading targets for drug discovery. In this work, the development of a genetic algorithm technique is described that is able to generate predictive 3D structures of membrane proteins in an ab initio fashion that possess high stability and similarity to the native structure. This is accomplished through optimisation of the distances between TM regions and the end-on rotation of each TM helix. The starting point for the genetic algorithm is from the model of general TM region arrangement predicted using the TMRelate program. From these approximate starting coordinates, the TMBuilder program is used to generate the helical backbone 3D coordinates. The amino acid side chains are constructed using the MaxSprout algorithm. The genetic algorithm is designed to represent a TM protein structure by encoding each alpha carbon atom starting position, the starting atom of the initial residue of each helix, and operates by manipulating these starting positions. To evaluate each predicted structure, the SwissPDBViewer software (incorporating the GROMOS force field software) is employed to calculate the free potential energy. For the first time, a GA has been successfully applied to the problem of predicting membrane protein structure. Comparison between newly predicted structures (tests) and the native structure (control) indicate that the developed GA approach represents an efficient and fast method for refinement of predicted TM protein structures. Further enhancement of the performance of the GA allows the TMGA system to generate predictive structures with comparable energetic stability and reasonable structural similarity to the native structure

    Do theoretical physicists care about the protein-folding problem?

    Full text link
    The prediction of the biologically active native conformation of a protein is one of the fundamental challenges of structural biology. This problem remains yet unsolved mainly due to three factors: the partial knowledge of the effective free energy function that governs the folding process, the enormous size of the conformational space of a protein and, finally, the relatively small differences of energy between conformations, in particular, between the native one and the ones that make up the unfolded state. Herein, we recall the importance of taking into account, in a detailed manner, the many interactions involved in the protein folding problem (such as steric volume exclusion, Ramachandran forces, hydrogen bonds, weakly polar interactions, coulombic energy or hydrophobic attraction) and we propose a strategy to effectively construct a free energy function that, including the effects of the solvent, could be numerically tractable. It must be pointed out that, since the internal free energy function that is mainly described does not include the constraints of the native conformation, it could only help to reach the 'molten globule' state. We also discuss about the limits and the lacks from which suffer the simple models that we, physicists, love so much.Comment: 27 pages, 4 figures, LaTeX file, aipproc package. To be published in the book: "Meeting on Fundamental Physics 'Alberto Galindo'", Alvarez-Estrada R. F. et al. (Ed.), Madrid: Aula Documental, 200

    Bioinformatics

    Get PDF
    This book is divided into different research areas relevant in Bioinformatics such as biological networks, next generation sequencing, high performance computing, molecular modeling, structural bioinformatics, molecular modeling and intelligent data analysis. Each book section introduces the basic concepts and then explains its application to problems of great relevance, so both novice and expert readers can benefit from the information and research works presented here
    • …
    corecore