365 research outputs found

    Protein design in a lattice model of hydrophobic and polar amino acids

    Full text link
    A general strategy is described for finding which amino acid sequences have native states in a desired conformation (inverse design). The approach is used to design sequences of 48 hydrophobic and polar aminoacids on three-dimensional lattice structures. Previous studies employing a sequence-space Monte-Carlo technique resulted in the successful design of one sequence in ten attempts. The present work also entails the exploration of conformations that compete significantly with the target structure for being its ground state. The design procedure is successful in all the ten cases.Comment: RevTeX, 12 pages, 1 figur

    From DNA sequence to application: possibilities and complications

    Get PDF
    The development of sophisticated genetic tools during the past 15 years have facilitated a tremendous increase of fundamental and application-oriented knowledge of lactic acid bacteria (LAB) and their bacteriophages. This knowledge relates both to the assignments of open reading frames (ORF’s) and the function of non-coding DNA sequences. Comparison of the complete nucleotide sequences of several LAB bacteriophages has revealed that their chromosomes have a fixed, modular structure, each module having a set of genes involved in a specific phase of the bacteriophage life cycle. LAB bacteriophage genes and DNA sequences have been used for the construction of temperature-inducible gene expression systems, gene-integration systems, and bacteriophage defence systems. The function of several LAB open reading frames and transcriptional units have been identified and characterized in detail. Many of these could find practical applications, such as induced lysis of LAB to enhance cheese ripening and re-routing of carbon fluxes for the production of a specific amino acid enantiomer. More knowledge has also become available concerning the function and structure of non-coding DNA positioned at or in the vicinity of promoters. In several cases the mRNA produced from this DNA contains a transcriptional terminator-antiterminator pair, in which the antiterminator can be stabilized either by uncharged tRNA or by interaction with a regulatory protein, thus preventing formation of the terminator so that mRNA elongation can proceed. Evidence has accumulated showing that also in LAB carbon catabolite repression in LAB is mediated by specific DNA elements in the vicinity of promoters governing the transcription of catabolic operons. Although some biological barriers have yet to be solved, the vast body of scientific information presently available allows the construction of tailor-made genetically modified LAB. Today, it appears that societal constraints rather than biological hurdles impede the use of genetically modified LAB.

    Targeted plasmid integration into the human genome by an engineered zinc-finger recombinase

    Get PDF
    The development of new methods for gene addition to mammalian genomes is necessary to overcome the limitations of conventional genetic engineering strategies. Although a variety of DNA-modifying enzymes have been used to directly catalyze the integration of plasmid DNA into mammalian genomes, there is still an unmet need for enzymes that target a single specific chromosomal site. We recently engineered zinc-finger recombinase (ZFR) fusion proteins that integrate plasmid DNA into a synthetic target site in the human genome with exceptional specificity. In this study, we present a two-step method for utilizing these enzymes in any cell type at randomly-distributed target site locations. The piggyBac transposase was used to insert recombinase target sites throughout the genomes of human and mouse cell lines. The ZFR efficiently and specifically integrated a transfected plasmid into these genomic target sites and into multiple transposons within a single cell. Plasmid integration was dependent on recombinase activity and the presence of recombinase target sites. This work demonstrates the potential for broad applicability of the ZFR technology in genome engineering, synthetic biology and gene therapy

    Alarming rates of virological failure and HIV-1 drug resistance amongst adolescents living with perinatal HIV in both urban and rural settings: evidence from the EDCTP READY-study in Cameroon

    Get PDF
    Objectives: Adolescents living with perinatal HIV infection (ALPHI) experience persistently high mortality rates, particularly in resource-limited settings. It is therefore clinically important for us to understand the therapeutic response, acquired HIV drug resistance (HIVDR) and associated factors among ALPHI, according to geographical location. Methods: A study was conducted among consenting ALPHI in two urban and two rural health facilities in the Centre Region of Cameroon. World Health Organization (WHO) clinical staging, self-reported adherence, HIVDR early warning indicators (EWIs), immunological status (CD4 count) and plasma viral load (VL) were assessed. For those experiencing virological failure (VF, VL ≥ 1000 copies/mL), HIVDR testing was performed and interpreted using the Stanford HIV Drug Resistance Database v.8.9-1. Results: Of the 270 participants, most were on nonnucleoside reverse transcriptase inhibitor (NNRTI)-based regimens (61.7% urban vs. 82.2% rural), and about one-third were poorly adherent (30.1% vs. 35.1%). Clinical failure rates (WHO-stage III/IV) in both settings were < 15%. In urban settings, the immunological failure (IF) rate (CD4  < 250 cells/μL) was 15.8%, statistically associated with late adolescence, female gender and poor adherence. The VF rate was 34.2%, statistically associated with poor adherence and NNRTI-based antiretroviral therapy. In the rural context, the IF rate was 26.9% and the VF rate was 52.7%, both statistically associated with advanced clinical stages. HIVDR rate was over 90% in both settings. EWIs were delayed drug pick-up, drug stock-outs and suboptimal viral suppression. Conclusions: Poor adherence, late adolescent age, female gender and advanced clinical staging worsen IF. The VF rate is high and consistent with the presence of HIVDR in both settings, driven by poor adherence, NNRTI-based regimen and advanced clinical staging

    Optimization of minimum set of protein–DNA interactions: a quasi exact solution with minimum over-fitting

    Get PDF
    Motivation: A major limitation in modeling protein interactions is the difficulty of assessing the over-fitting of the training set. Recently, an experimentally based approach that integrates crystallographic information of C2H2 zinc finger–DNA complexes with binding data from 11 mutants, 7 from EGR finger I, was used to define an improved interaction code (no optimization). Here, we present a novel mixed integer programming (MIP)-based method that transforms this type of data into an optimized code, demonstrating both the advantages of the mathematical formulation to minimize over- and under-fitting and the robustness of the underlying physical parameters mapped by the code

    Local Gene Regulation Details a Recognition Code within the LacI Transcriptional Factor Family

    Get PDF
    The specific binding of regulatory proteins to DNA sequences exhibits no clear patterns of association between amino acids (AAs) and nucleotides (NTs). This complexity of protein-DNA interactions raises the question of whether a simple set of wide-coverage recognition rules can ever be identified. Here, we analyzed this issue using the extensive LacI family of transcriptional factors (TFs). We searched for recognition patterns by introducing a new approach to phylogenetic footprinting, based on the pervasive presence of local regulation in prokaryotic transcriptional networks. We identified a set of specificity correlations –determined by two AAs of the TFs and two NTs in the binding sites– that is conserved throughout a dominant subgroup within the family regardless of the evolutionary distance, and that act as a relatively consistent recognition code. The proposed rules are confirmed with data of previous experimental studies and by events of convergent evolution in the phylogenetic tree. The presence of a code emphasizes the stable structural context of the LacI family, while defining a precise blueprint to reprogram TF specificity with many practical applications.Ministerio de Ciencia e Innovación, Spain (Formación de Profesorado Universitario fellowship)Ministerio de Ciencia e Innovación, Spain (grant BFU2008-03632/BMC)Madrid (Spain : Region) (grant CCG08-CSIC/SAL-3651

    Probing the Informational and Regulatory Plasticity of a Transcription Factor DNA–Binding Domain

    Get PDF
    Transcription factors have two functional constraints on their evolution: (1) their binding sites must have enough information to be distinguishable from all other sequences in the genome, and (2) they must bind these sites with an affinity that appropriately modulates the rate of transcription. Since both are determined by the biophysical properties of the DNA–binding domain, selection on one will ultimately affect the other. We were interested in understanding how plastic the informational and regulatory properties of a transcription factor are and how transcription factors evolve to balance these constraints. To study this, we developed an in vivo selection system in Escherichia coli to identify variants of the helix-turn-helix transcription factor MarA that bind different sets of binding sites with varying degrees of degeneracy. Unlike previous in vitro methods used to identify novel DNA binders and to probe the plasticity of the binding domain, our selections were done within the context of the initiation complex, selecting for both specific binding within the genome and for a physiologically significant strength of interaction to maintain function of the factor. Using MITOMI, quantitative PCR, and a binding site fitness assay, we characterized the binding, function, and fitness of some of these variants. We observed that a large range of binding preferences, information contents, and activities could be accessed with a few mutations, suggesting that transcriptional regulatory networks are highly adaptable and expandable

    High Diversity at PRDM9 in Chimpanzees and Bonobos

    Get PDF
    BACKGROUND: The PRDM9 locus in mammals has increasingly attracted research attention due to its role in mediating chromosomal recombination and possible involvement in hybrid sterility and hence speciation processes. The aim of this study was to characterize sequence variation at the PRDM9 locus in a sample of our closest living relatives, the chimpanzees and bonobos. METHODOLOGY/PRINCIPAL FINDINGS: PRDM9 contains a highly variable and repetitive zinc finger array. We amplified this domain using long-range PCR and determined the DNA sequences using conventional Sanger sequencing. From 17 chimpanzees representing three subspecies and five bonobos we obtained a total of 12 alleles differing at the nucleotide level. Based on a data set consisting of our data and recently published Pan PRDM9 sequences, we found that at the subspecies level, diversity levels did not differ among chimpanzee subspecies or between chimpanzee subspecies and bonobos. In contrast, the sample of chimpanzees harbors significantly more diversity at PRDM9 than samples of humans. Pan PRDM9 shows signs of rapid evolution including no alleles or ZnFs in common with humans as well as signals of positive selection in the residues responsible for DNA binding. CONCLUSIONS AND SIGNIFICANCE: The high number of alleles specific to the genus Pan, signs of positive selection in the DNA binding residues, and reported lack of conservation of recombination hotspots between chimpanzees and humans suggest that PRDM9 could be active in hotspot recruitment in the genus Pan. Chimpanzees and bonobos are considered separate species and do not have overlapping ranges in the wild, making the presence of shared alleles at the amino acid level between the chimpanzee and bonobo species interesting in view of the hypothesis that PRDM9 plays a universal role in interspecific hybrid sterility

    PDNAsite:identification of DNA-binding site from protein sequence by incorporating spatial and sequence context

    Get PDF
    Protein-DNA interactions are involved in many fundamental biological processes essential for cellular function. Most of the existing computational approaches employed only the sequence context of the target residue for its prediction. In the present study, for each target residue, we applied both the spatial context and the sequence context to construct the feature space. Subsequently, Latent Semantic Analysis (LSA) was applied to remove the redundancies in the feature space. Finally, a predictor (PDNAsite) was developed through the integration of the support vector machines (SVM) classifier and ensemble learning. Results on the PDNA-62 and the PDNA-224 datasets demonstrate that features extracted from spatial context provide more information than those from sequence context and the combination of them gives more performance gain. An analysis of the number of binding sites in the spatial context of the target site indicates that the interactions between binding sites next to each other are important for protein-DNA recognition and their binding ability. The comparison between our proposed PDNAsite method and the existing methods indicate that PDNAsite outperforms most of the existing methods and is a useful tool for DNA-binding site identification. A web-server of our predictor (http://hlt.hitsz.edu.cn:8080/PDNAsite/) is made available for free public accessible to the biological research community

    Computational Structural Analysis: Multiple Proteins Bound to DNA

    Get PDF
    BACKGROUND: With increasing numbers of crystal structures of proteinratioDNA and proteinratioproteinratioDNA complexes publically available, it is now possible to extract sufficient structural, physical-chemical and thermodynamic parameters to make general observations and predictions about their interactions. In particular, the properties of macromolecular assemblies of multiple proteins bound to DNA have not previously been investigated in detail. METHODOLOGY/PRINCIPAL FINDINGS: We have performed computational structural analyses on macromolecular assemblies of multiple proteins bound to DNA using a variety of different computational tools: PISA; PROMOTIF; X3DNA; ReadOut; DDNA and DCOMPLEX. Additionally, we have developed and employed an algorithm for approximate collision detection and overlapping volume estimation of two macromolecules. An implementation of this algorithm is available at http://promoterplot.fmi.ch/Collision1/. The results obtained are compared with structural, physical-chemical and thermodynamic parameters from proteinratioprotein and single proteinratioDNA complexes. Many of interface properties of multiple proteinratioDNA complexes were found to be very similar to those observed in binary proteinratioDNA and proteinratioprotein complexes. However, the conformational change of the DNA upon protein binding is significantly higher when multiple proteins bind to it than is observed when single proteins bind. The water mediated contacts are less important (found in less quantity) between the interfaces of components in ternary (proteinratioproteinratioDNA) complexes than in those of binary complexes (proteinratioprotein and proteinratioDNA).The thermodynamic stability of ternary complexes is also higher than in the binary interactions. Greater specificity and affinity of multiple proteins binding to DNA in comparison with binary protein-DNA interactions were observed. However, protein-protein binding affinities are stronger in complexes without the presence of DNA. CONCLUSIONS/SIGNIFICANCE: Our results indicate that the interface properties: interface area; number of interface residues/atoms and hydrogen bonds; and the distribution of interface residues, hydrogen bonds, van der Walls contacts and secondary structure motifs are independent of whether or not a protein is in a binary or ternary complex with DNA. However, changes in the shape of the DNA reduce the off-rate of the proteins which greatly enhances the stability and specificity of ternary complexes compared to binary ones
    corecore