153 research outputs found

    OptCDR: a general computational method for the design of antibody complementarity determining regions for targeted epitope binding

    Get PDF
    Antibodies are an important class of proteins with many biomedical and biotechnical applications. Although there are a plethora of experimental techniques geared toward their efficient production, there is a paucity of computational methods for their de novo design. OptCDR is a general computational method to design the binding portions of antibodies to have high specificity and affinity against any targeted epitope of an antigen. First, combinations of canonical structures for the antibody complementarity determining regions (CDRs) that are most likely to be able to favorably bind the antigen are selected. This is followed by the simultaneous refinement of the CDR structures' backbones and optimal amino acid selection for each position. OptCDR is applied to three computational test cases: a peptide from the capsid of hepatitis C, the hapten fluorescein and the protein vascular endothelial growth factor. The results demonstrate that OptCDR can efficiently generate diverse antibody libraries of a pre-specified size with promising antigen affinity potential as exemplified by computationally derived binding metrics. Keywords: antibody design/computational protein design/ fluorescein/hepatitis C/vascular endothelial growth facto

    Elucidation of Directionality for Co-Expressed Genes: Predicting Intra-Operon Termination Sites

    Full text link
    We present a novel framework for inferring regulatory and sequence-level information from gene co-expression networks. The key idea of our methodology is the systematic integration of network inference and network topological analysis approaches for uncovering biological insights. We determine the gene co-expression network of Bacillus subtilis using Affymetrix GeneChip time series data and show how the inferred network topology can be linked to sequence-level information hard-wired in the organism's genome. We propose a systematic way for determining the correlation threshold at which two genes are assessed to be co-expressed by using the clustering coefficient and we expand the scope of the gene co-expression network by proposing the slope ratio metric as a means for incorporating directionality on the edges. We show through specific examples for B. subtilis that by incorporating expression level information in addition to the temporal expression patterns, we can uncover sequence-level biological insights. In particular, we are able to identify a number of cases where (i) the co-expressed genes are part of a single transcriptional unit or operon and (ii) the inferred directionality arises due to the presence of intra-operon transcription termination sites.Comment: 7 pages, 8 figures, accepted in Bioinformatic

    Unbiased Global Optimization of Lennard-Jones Clusters for N <= 201 by Conformational Space Annealing Method

    Full text link
    We apply the conformational space annealing (CSA) method to the Lennard-Jones clusters and find all known lowest energy configurations up to 201 atoms, without using extra information of the problem such as the structures of the known global energy minima. In addition, the robustness of the algorithm with respect to the randomness of initial conditions of the problem is demonstrated by ten successful independent runs up to 183 atoms. Our results indicate that the CSA method is a general and yet efficient global optimization algorithm applicable to many systems.Comment: revtex, 4 pages, 2 figures. Physical Review Letters, in pres

    Global Optimization by Basin-Hopping and the Lowest Energy Structures of Lennard-Jones Clusters Containing up to 110 Atoms

    Full text link
    We describe a global optimization technique using `basin-hopping' in which the potential energy surface is transformed into a collection of interpenetrating staircases. This method has been designed to exploit the features which recent work suggests must be present in an energy landscape for efficient relaxation to the global minimum. The transformation associates any point in configuration space with the local minimum obtained by a geometry optimization started from that point, effectively removing transition state regions from the problem. However, unlike other methods based upon hypersurface deformation, this transformation does not change the global minimum. The lowest known structures are located for all Lennard-Jones clusters up to 110 atoms, including a number that have never been found before in unbiased searches.Comment: 8 pages, 3 figures, revte

    Testing Nelder-Mead based repulsion algorithms for multiple roots of nonlinear systems via a two-level factorial design of experiments

    Get PDF
    This paper addresses the challenging task of computing multiple roots of a system of nonlinear equations. A repulsion algorithm that invokes the Nelder-Mead (N-M) local search method and uses a penalty-type merit function based on the error function, known as 'erf', is presented. In the N-M algorithm context, different strategies are proposed to enhance the quality of the solutions and improve the overall efficiency. The main goal of this paper is to use a two-level factorial design of experiments to analyze the statistical significance of the observed differences in selected performance criteria produced when testing different strategies in the N-M based repulsion algorithm. The main goal of this paper is to use a two-level factorial design of experiments to analyze the statistical significance of the observed differences in selected performance criteria produced when testing different strategies in the N-M based repulsion algorithm.Fundação para a Ciência e Tecnologia (FCT

    Exploiting the genetic diversity of maize using a combined metabolomic, enzyme activity profiling, and metabolic modelling approach to link leaf physiology to kernel yield

    Get PDF
    A combined metabolomic, biochemical, fluxomic, and metabolic modeling approach was developed using 19 genetically distant maize (Zea mays) lines from Europe and America. Considerable differences were detected between the lines when leaf metabolic profiles and activities of the main enzymes involved in primary metabolism were compared. During grain filling, the leaf metabolic composition appeared to be a reliable marker, allowing a classification matching the genetic diversity of the lines. During the same period, there was a significant correlation between the genetic distance of the lines and the activities of enzymes involved in carbon metabolism, notably glycolysis. Although large differences were observed in terms of leaf metabolic fluxes, these variations were not tightly linked to the genome structure of the lines. Both correlation studies and metabolic network analyses allowed the description of a maize ideotype with a high grain yield potential. Such an ideotype is characterized by low accumulation of soluble amino acids and carbohydrates in the leaves and high activity of enzymes involved in the C4 photosynthetic pathway and in the biosynthesis of amino acids derived from glutamate. Chlorogenates appear to be important markers that can be used to select for maize lines that produce larger kernels

    Reconstruction of Genome-Scale Active Metabolic Networks for 69 Human Cell Types and 16 Cancer Types Using INIT

    Get PDF
    Development of high throughput analytical methods has given physicians the potential access to extensive and patient-specific data sets, such as gene sequences, gene expression profiles or metabolite footprints. This opens for a new approach in health care, which is both personalized and based on system-level analysis. Genome-scale metabolic networks provide a mechanistic description of the relationships between different genes, which is valuable for the analysis and interpretation of large experimental data-sets. Here we describe the generation of genome-scale active metabolic networks for 69 different cell types and 16 cancer types using the INIT (Integrative Network Inference for Tissues) algorithm. The INIT algorithm uses cell type specific information about protein abundances contained in the Human Proteome Atlas as the main source of evidence. The generated models constitute the first step towards establishing a Human Metabolic Atlas, which will be a comprehensive description (accessible online) of the metabolism of different human cell types, and will allow for tissue-level and organism-level simulations in order to achieve a better understanding of complex diseases. A comparative analysis between the active metabolic networks of cancer types and healthy cell types allowed for identification of cancer-specific metabolic features that constitute generic potential drug targets for cancer treatment

    Zea mays iRS1563: A Comprehensive Genome-Scale Metabolic Reconstruction of Maize Metabolism

    Get PDF
    The scope and breadth of genome-scale metabolic reconstructions have continued to expand over the last decade. Herein, we introduce a genome-scale model for a plant with direct applications to food and bioenergy production (i.e., maize). Maize annotation is still underway, which introduces significant challenges in the association of metabolic functions to genes. The developed model is designed to meet rigorous standards on gene-protein-reaction (GPR) associations, elementally and charged balanced reactions and a biomass reaction abstracting the relative contribution of all biomass constituents. The metabolic network contains 1,563 genes and 1,825 metabolites involved in 1,985 reactions from primary and secondary maize metabolism. For approximately 42% of the reactions direct literature evidence for the participation of the reaction in maize was found. As many as 445 reactions and 369 metabolites are unique to the maize model compared to the AraGEM model for A. thaliana. 674 metabolites and 893 reactions are present in Zea mays iRS1563 that are not accounted for in maize C4GEM. All reactions are elementally and charged balanced and localized into six different compartments (i.e., cytoplasm, mitochondrion, plastid, peroxisome, vacuole and extracellular). GPR associations are also established based on the functional annotation information and homology prediction accounting for monofunctional, multifunctional and multimeric proteins, isozymes and protein complexes. We describe results from performing flux balance analysis under different physiological conditions, (i.e., photosynthesis, photorespiration and respiration) of a C4 plant and also explore model predictions against experimental observations for two naturally occurring mutants (i.e., bm1 and bm3). The developed model corresponds to the largest and more complete to-date effort at cataloguing metabolism for a plant species

    A Genome-Scale Metabolic Reconstruction of Mycoplasma genitalium, iPS189

    Get PDF
    With a genome size of ∼580 kb and approximately 480 protein coding regions, Mycoplasma genitalium is one of the smallest known self-replicating organisms and, additionally, has extremely fastidious nutrient requirements. The reduced genomic content of M. genitalium has led researchers to suggest that the molecular assembly contained in this organism may be a close approximation to the minimal set of genes required for bacterial growth. Here, we introduce a systematic approach for the construction and curation of a genome-scale in silico metabolic model for M. genitalium. Key challenges included estimation of biomass composition, handling of enzymes with broad specificities, and the lack of a defined medium. Computational tools were subsequently employed to identify and resolve connectivity gaps in the model as well as growth prediction inconsistencies with gene essentiality experimental data. The curated model, M. genitalium iPS189 (262 reactions, 274 metabolites), is 87% accurate in recapitulating in vivo gene essentiality results for M. genitalium. Approaches and tools described herein provide a roadmap for the automated construction of in silico metabolic models of other organisms
    • …
    corecore