155 research outputs found
OptCDR: a general computational method for the design of antibody complementarity determining regions for targeted epitope binding
Antibodies are an important class of proteins with many biomedical and biotechnical applications. Although there are a plethora of experimental techniques geared toward their efficient production, there is a paucity of computational methods for their de novo design. OptCDR is a general computational method to design the binding portions of antibodies to have high specificity and affinity against any targeted epitope of an antigen. First, combinations of canonical structures for the antibody complementarity determining regions (CDRs) that are most likely to be able to favorably bind the antigen are selected. This is followed by the simultaneous refinement of the CDR structures' backbones and optimal amino acid selection for each position. OptCDR is applied to three computational test cases: a peptide from the capsid of hepatitis C, the hapten fluorescein and the protein vascular endothelial growth factor. The results demonstrate that OptCDR can efficiently generate diverse antibody libraries of a pre-specified size with promising antigen affinity potential as exemplified by computationally derived binding metrics. Keywords: antibody design/computational protein design/ fluorescein/hepatitis C/vascular endothelial growth facto
Elucidation of Directionality for Co-Expressed Genes: Predicting Intra-Operon Termination Sites
We present a novel framework for inferring regulatory and sequence-level
information from gene co-expression networks. The key idea of our methodology
is the systematic integration of network inference and network topological
analysis approaches for uncovering biological insights. We determine the gene
co-expression network of Bacillus subtilis using Affymetrix GeneChip time
series data and show how the inferred network topology can be linked to
sequence-level information hard-wired in the organism's genome. We propose a
systematic way for determining the correlation threshold at which two genes are
assessed to be co-expressed by using the clustering coefficient and we expand
the scope of the gene co-expression network by proposing the slope ratio metric
as a means for incorporating directionality on the edges. We show through
specific examples for B. subtilis that by incorporating expression level
information in addition to the temporal expression patterns, we can uncover
sequence-level biological insights. In particular, we are able to identify a
number of cases where (i) the co-expressed genes are part of a single
transcriptional unit or operon and (ii) the inferred directionality arises due
to the presence of intra-operon transcription termination sites.Comment: 7 pages, 8 figures, accepted in Bioinformatic
Unbiased Global Optimization of Lennard-Jones Clusters for N <= 201 by Conformational Space Annealing Method
We apply the conformational space annealing (CSA) method to the Lennard-Jones
clusters and find all known lowest energy configurations up to 201 atoms,
without using extra information of the problem such as the structures of the
known global energy minima. In addition, the robustness of the algorithm with
respect to the randomness of initial conditions of the problem is demonstrated
by ten successful independent runs up to 183 atoms. Our results indicate that
the CSA method is a general and yet efficient global optimization algorithm
applicable to many systems.Comment: revtex, 4 pages, 2 figures. Physical Review Letters, in pres
Global Optimization by Basin-Hopping and the Lowest Energy Structures of Lennard-Jones Clusters Containing up to 110 Atoms
We describe a global optimization technique using `basin-hopping' in which
the potential energy surface is transformed into a collection of
interpenetrating staircases. This method has been designed to exploit the
features which recent work suggests must be present in an energy landscape for
efficient relaxation to the global minimum. The transformation associates any
point in configuration space with the local minimum obtained by a geometry
optimization started from that point, effectively removing transition state
regions from the problem. However, unlike other methods based upon hypersurface
deformation, this transformation does not change the global minimum. The lowest
known structures are located for all Lennard-Jones clusters up to 110 atoms,
including a number that have never been found before in unbiased searches.Comment: 8 pages, 3 figures, revte
Testing Nelder-Mead based repulsion algorithms for multiple roots of nonlinear systems via a two-level factorial design of experiments
This paper addresses the challenging task of computing multiple roots of a system of nonlinear equations. A repulsion algorithm that invokes the Nelder-Mead (N-M) local search method and uses a penalty-type merit function based on the error function, known as 'erf', is presented. In the N-M algorithm context, different strategies are proposed to enhance the quality of the solutions and improve the overall efficiency. The main goal of this paper is to use a two-level factorial design of experiments to analyze the statistical significance of the observed differences in selected performance criteria produced when testing different strategies in the N-M based repulsion algorithm. The main goal of this paper is to use a two-level factorial design of experiments to analyze the statistical significance of the observed differences in selected performance criteria produced when testing different strategies in the N-M based repulsion algorithm.Fundação para a Ciência e Tecnologia (FCT
Exploiting the genetic diversity of maize using a combined metabolomic, enzyme activity profiling, and metabolic modelling approach to link leaf physiology to kernel yield
A combined metabolomic, biochemical, fluxomic, and metabolic modeling approach was developed using 19 genetically distant maize (Zea mays) lines from Europe and America. Considerable differences were detected between the lines when leaf metabolic profiles and activities of the main enzymes involved in primary metabolism were compared. During grain filling, the leaf metabolic composition appeared to be a reliable marker, allowing a classification matching the genetic diversity of the lines. During the same period, there was a significant correlation between the genetic distance of the lines and the activities of enzymes involved in carbon metabolism, notably glycolysis. Although large differences were observed in terms of leaf metabolic fluxes, these variations were not tightly linked to the genome structure of the lines. Both correlation studies and metabolic network analyses allowed the description of a maize ideotype with a high grain yield potential. Such an ideotype is characterized by low accumulation of soluble amino acids and carbohydrates in the leaves and high activity of enzymes involved in the C4 photosynthetic pathway and in the biosynthesis of amino acids derived from glutamate. Chlorogenates appear to be important markers that can be used to select for maize lines that produce larger kernels
Reconstruction of Genome-Scale Active Metabolic Networks for 69 Human Cell Types and 16 Cancer Types Using INIT
Development of high throughput analytical methods has given physicians the potential access to extensive and patient-specific data sets, such as gene sequences, gene expression profiles or metabolite footprints. This opens for a new approach in health care, which is both personalized and based on system-level analysis. Genome-scale metabolic networks provide a mechanistic description of the relationships between different genes, which is valuable for the analysis and interpretation of large experimental data-sets. Here we describe the generation of genome-scale active metabolic networks for 69 different cell types and 16 cancer types using the INIT (Integrative Network Inference for Tissues) algorithm. The INIT algorithm uses cell type specific information about protein abundances contained in the Human Proteome Atlas as the main source of evidence. The generated models constitute the first step towards establishing a Human Metabolic Atlas, which will be a comprehensive description (accessible online) of the metabolism of different human cell types, and will allow for tissue-level and organism-level simulations in order to achieve a better understanding of complex diseases. A comparative analysis between the active metabolic networks of cancer types and healthy cell types allowed for identification of cancer-specific metabolic features that constitute generic potential drug targets for cancer treatment
Zea mays iRS1563: A Comprehensive Genome-Scale Metabolic Reconstruction of Maize Metabolism
The scope and breadth of genome-scale metabolic reconstructions have continued to expand over the last decade. Herein, we introduce a genome-scale model for a plant with direct applications to food and bioenergy production (i.e., maize). Maize annotation is still underway, which introduces significant challenges in the association of metabolic functions to genes. The developed model is designed to meet rigorous standards on gene-protein-reaction (GPR) associations, elementally and charged balanced reactions and a biomass reaction abstracting the relative contribution of all biomass constituents. The metabolic network contains 1,563 genes and 1,825 metabolites involved in 1,985 reactions from primary and secondary maize metabolism. For approximately 42% of the reactions direct literature evidence for the participation of the reaction in maize was found. As many as 445 reactions and 369 metabolites are unique to the maize model compared to the AraGEM model for A. thaliana. 674 metabolites and 893 reactions are present in Zea mays iRS1563 that are not accounted for in maize C4GEM. All reactions are elementally and charged balanced and localized into six different compartments (i.e., cytoplasm, mitochondrion, plastid, peroxisome, vacuole and extracellular). GPR associations are also established based on the functional annotation information and homology prediction accounting for monofunctional, multifunctional and multimeric proteins, isozymes and protein complexes. We describe results from performing flux balance analysis under different physiological conditions, (i.e., photosynthesis, photorespiration and respiration) of a C4 plant and also explore model predictions against experimental observations for two naturally occurring mutants (i.e., bm1 and bm3). The developed model corresponds to the largest and more complete to-date effort at cataloguing metabolism for a plant species
A Genome-Scale Metabolic Reconstruction of Mycoplasma genitalium, iPS189
With a genome size of ∼580 kb and approximately 480 protein coding regions, Mycoplasma genitalium is one of the smallest known self-replicating organisms and, additionally, has extremely fastidious nutrient requirements. The reduced genomic content of M. genitalium has led researchers to suggest that the molecular assembly contained in this organism may be a close approximation to the minimal set of genes required for bacterial growth. Here, we introduce a systematic approach for the construction and curation of a genome-scale in silico metabolic model for M. genitalium. Key challenges included estimation of biomass composition, handling of enzymes with broad specificities, and the lack of a defined medium. Computational tools were subsequently employed to identify and resolve connectivity gaps in the model as well as growth prediction inconsistencies with gene essentiality experimental data. The curated model, M. genitalium iPS189 (262 reactions, 274 metabolites), is 87% accurate in recapitulating in vivo gene essentiality results for M. genitalium. Approaches and tools described herein provide a roadmap for the automated construction of in silico metabolic models of other organisms
- …