284 research outputs found

    Optimization of minimum set of protein–DNA interactions: a quasi exact solution with minimum over-fitting

    Get PDF
    Motivation: A major limitation in modeling protein interactions is the difficulty of assessing the over-fitting of the training set. Recently, an experimentally based approach that integrates crystallographic information of C2H2 zinc finger–DNA complexes with binding data from 11 mutants, 7 from EGR finger I, was used to define an improved interaction code (no optimization). Here, we present a novel mixed integer programming (MIP)-based method that transforms this type of data into an optimized code, demonstrating both the advantages of the mathematical formulation to minimize over- and under-fitting and the robustness of the underlying physical parameters mapped by the code

    The Evolution of Epigenetic Regulators CTCF and BORIS/CTCFL in Amniotes

    Get PDF
    CTCF is an essential, ubiquitously expressed DNA-binding protein responsible for insulator function, nuclear architecture, and transcriptional control within vertebrates. The gene CTCF was proposed to have duplicated in early mammals, giving rise to a paralogue called “brother of regulator of imprinted sites” (BORIS or CTCFL) with DNA binding capabilities similar to CTCF, but testis-specific expression in humans and mice. CTCF and BORIS have opposite regulatory effects on human cancer-testis genes, the anti-apoptotic BAG1 gene, the insulin-like growth factor 2/H19 imprint control region (IGF2/H19 ICR), and show mutually exclusive expression in humans and mice, suggesting that they are antagonistic epigenetic regulators. We discovered orthologues of BORIS in at least two reptilian species and found traces of its sequence in the chicken genome, implying that the duplication giving rise to BORIS occurred much earlier than previously thought. We analysed the expression of CTCF and BORIS in a range of amniotes by conventional and quantitative PCR. BORIS, as well as CTCF, was found widely expressed in monotremes (platypus) and reptiles (bearded dragon), suggesting redundancy or cooperation between these genes in a common amniote ancestor. However, we discovered that BORIS expression was gonad-specific in marsupials (tammar wallaby) and eutherians (cattle), implying that a functional change occurred in BORIS during the early evolution of therian mammals. Since therians show imprinting of IGF2 but other vertebrate taxa do not, we speculate that CTCF and BORIS evolved specialised functions along with the evolution of imprinting at this and other loci, coinciding with the restriction of BORIS expression to the germline and potential antagonism with CTCF

    High Diversity at PRDM9 in Chimpanzees and Bonobos

    Get PDF
    BACKGROUND: The PRDM9 locus in mammals has increasingly attracted research attention due to its role in mediating chromosomal recombination and possible involvement in hybrid sterility and hence speciation processes. The aim of this study was to characterize sequence variation at the PRDM9 locus in a sample of our closest living relatives, the chimpanzees and bonobos. METHODOLOGY/PRINCIPAL FINDINGS: PRDM9 contains a highly variable and repetitive zinc finger array. We amplified this domain using long-range PCR and determined the DNA sequences using conventional Sanger sequencing. From 17 chimpanzees representing three subspecies and five bonobos we obtained a total of 12 alleles differing at the nucleotide level. Based on a data set consisting of our data and recently published Pan PRDM9 sequences, we found that at the subspecies level, diversity levels did not differ among chimpanzee subspecies or between chimpanzee subspecies and bonobos. In contrast, the sample of chimpanzees harbors significantly more diversity at PRDM9 than samples of humans. Pan PRDM9 shows signs of rapid evolution including no alleles or ZnFs in common with humans as well as signals of positive selection in the residues responsible for DNA binding. CONCLUSIONS AND SIGNIFICANCE: The high number of alleles specific to the genus Pan, signs of positive selection in the DNA binding residues, and reported lack of conservation of recombination hotspots between chimpanzees and humans suggest that PRDM9 could be active in hotspot recruitment in the genus Pan. Chimpanzees and bonobos are considered separate species and do not have overlapping ranges in the wild, making the presence of shared alleles at the amino acid level between the chimpanzee and bonobo species interesting in view of the hypothesis that PRDM9 plays a universal role in interspecific hybrid sterility

    Targeted genome engineering via zinc finger nucleases

    Get PDF
    With the development of next-generation sequencing technology, ever-expanding databases of genetic information from various organisms are available to researchers. However, our ability to study the biological meaning of genetic information and to apply our genetic knowledge to produce genetically modified crops and animals is limited, largely due to the lack of molecular tools to manipulate genomes. Recently, targeted cleavage of the genome using engineered DNA scissors called zinc finger nucleases (ZFNs) has successfully supported the precise manipulation of genetic information in various cells, animals, and plants. In this review, we will discuss the development and applications of ZFN technology for genome engineering and highlight recent reports on its use in plants

    Computational Treatment of Metalloproteins

    Full text link
    Metalloproteins present a considerable challenge for modeling, especially when the starting point is far from thermodynamic equilibrium. Examples include formidable problems such as metalloprotein folding and structure prediction upon metal addition, removal, or even just replacement; metalloenzyme design, where stabilization of a transition state of the catalyzed reaction in the specific binding pocket around the metal needs to be achieved; docking to metal-containing sites and design of metalloenzyme inhibitors. Even more conservative computations, such as elucidations of the mechanisms and energetics of the reaction catalyzed by natural metalloenzymes, are often nontrivial. The reason is the vast span of time and length scales over which these proteins operate, and thus the resultant difficulties in estimating their energies and free energies. It is required to perform extensive sampling, properly treat the electronic structure of the bound metal or metals, and seamlessly merge the required techniques to assess energies and entropies, or their changes, for the entire system. Additionally, the machinery needs to be computationally affordable. Although a great advancement has been made over the years, including some of the seminal works resulting in the 2013 Nobel Prize in chemistry, many aforementioned exciting applications remain far from reach. We review the methodology on the forefront of the field, including several promising methods developed in our lab that bring us closer to the desired modern goals. We further highlight their performance by a few examples of applications

    Computer-aided model-building strategies for protein design

    No full text
    corecore