228 research outputs found

    Reconstructing Ancient Forms of Life

    Get PDF
    Progress in the past three months has occurred in two areas, reconstruction of ancestral proteins and improved understanding of chemical features that are likely to be universal in generic matter regardless of its genesis. Ancestral ribonucleases have been reconstructed, and an example has been developed that shows how physiological function can be assigned to in vitro behaviors observed in biological systems. Sequence data have been collected to permit the reconstruction of src homology 2 domains that underwent radiative divergence at the time of the radiative divergence of chordates. New studies have been completed that show how genetic matter (or its remnants) might be detected on Mars (or other non-terrean locations.) Last, the first in vitro selection experiments have been completed using a nucleoside library carrying positively charged functionality, illustrating the importance of non-standard nucleotides to those attempting to obtain evidence for an "RNA world" as an early episode of life on earth

    Engineering yeast alcohol dehydrogenase. Replacing Trp54 by Leu broadens substrate specificity

    Get PDF
    Analysis of a crystal structure of alcohol dehydrogenase (Adh) from horse liver suggests that Trp54 in the homologous yeast alcohol dehydrogenase prevents the yeast enzyme from efficiently catalysing the oxidation of long-chain primary alcohols with branching at the 4 position (e.g. 4-methyl-1-pentanol, cinnamyl alcohol). This residue has been altered to Leu by site-directed mutagenesis. The alteration yields an enzyme that serves as an effective catalyst for both longer straight-chain primary alcohols and branched chain alcohol

    Phylogenomic approaches to common problems encountered in the analysis of low copy repeats: The sulfotransferase 1A gene family example

    Get PDF
    BACKGROUND: Blocks of duplicated genomic DNA sequence longer than 1000 base pairs are known as low copy repeats (LCRs). Identified by their sequence similarity, LCRs are abundant in the human genome, and are interesting because they may represent recent adaptive events, or potential future adaptive opportunities within the human lineage. Sequence analysis tools are needed, however, to decide whether these interpretations are likely, whether a particular set of LCRs represents nearly neutral drift creating junk DNA, or whether the appearance of LCRs reflects assembly error. Here we investigate an LCR family containing the sulfotransferase (SULT) 1A genes involved in drug metabolism, cancer, hormone regulation, and neurotransmitter biology as a first step for defining the problems that those tools must manage. RESULTS: Sequence analysis here identified a fourth sulfotransferase gene, which may be transcriptionally active, located on human chromosome 16. Four regions of genomic sequence containing the four human SULT1A paralogs defined a new LCR family. The stem hominoid SULT1A progenitor locus was identified by comparative genomics involving complete human and rodent genomes, and a draft chimpanzee genome. SULT1A expansion in hominoid genomes was followed by positive selection acting on specific protein sites. This episode of adaptive evolution appears to be responsible for the dopamine sulfonation function of some SULT enzymes. Each of the conclusions that this bioinformatic analysis generated using data that has uncertain reliability (such as that from the chimpanzee genome sequencing project) has been confirmed experimentally or by a "finished" chromosome 16 assembly, both of which were published after the submission of this manuscript. CONCLUSION: SULT1A genes expanded from one to four copies in hominoids during intra-chromosomal LCR duplications, including (apparently) one after the divergence of chimpanzees and humans. Thus, LCRs may provide a means for amplifying genes (and other genetic elements) that are adaptively useful. Being located on and among LCRs, however, could make the human SULT1A genes susceptible to further duplications or deletions resulting in 'genomic diseases' for some individuals. Pharmacogenomic studies of SULT1Asingle nucleotide polymorphisms, therefore, should also consider examining SULT1A copy number variability when searching for genotype-phenotype associations. The latest duplication is, however, only a substantiated hypothesis; an alternative explanation, disfavored by the majority of evidence, is that the duplication is an artifact of incorrect genome assembly

    Integrating protein structures and precomputed genealogies in the Magnum database: Examples with cellular retinoid binding proteins

    Get PDF
    BACKGROUND: When accurate models for the divergent evolution of protein sequences are integrated with complementary biological information, such as folded protein structures, analyses of the combined data often lead to new hypotheses about molecular physiology. This represents an excellent example of how bioinformatics can be used to guide experimental research. However, progress in this direction has been slowed by the lack of a publicly available resource suitable for general use. RESULTS: The precomputed Magnum database offers a solution to this problem for ca. 1,800 full-length protein families with at least one crystal structure. The Magnum deliverables include 1) multiple sequence alignments, 2) mapping of alignment sites to crystal structure sites, 3) phylogenetic trees, 4) inferred ancestral sequences at internal tree nodes, and 5) amino acid replacements along tree branches. Comprehensive evaluations revealed that the automated procedures used to construct Magnum produced accurate models of how proteins divergently evolve, or genealogies, and correctly integrated these with the structural data. To demonstrate Magnum's capabilities, we asked for amino acid replacements requiring three nucleotide substitutions, located at internal protein structure sites, and occurring on short phylogenetic tree branches. In the cellular retinoid binding protein family a site that potentially modulates ligand binding affinity was discovered. Recruitment of cellular retinol binding protein to function as a lens crystallin in the diurnal gecko afforded another opportunity to showcase the predictive value of a browsable database containing branch replacement patterns integrated with protein structures. CONCLUSION: We integrated two areas of protein science, evolution and structure, on a large scale and created a precomputed database, known as Magnum, which is the first freely available resource of its kind. Magnum provides evolutionary and structural bioinformatics resources that are useful for identifying experimentally testable hypotheses about the molecular basis of protein behaviors and functions, as illustrated with the examples from the cellular retinoid binding proteins

    Application of DETECTER, an evolutionary genomic tool to analyze genetic variation, to the cystic fibrosis gene family

    Get PDF
    BACKGROUND: The medical community requires computational tools that distinguish missense genetic differences having phenotypic impact within the vast number of sense mutations that do not. Tools that do this will become increasingly important for those seeking to use human genome sequence data to predict disease, make prognoses, and customize therapy to individual patients. RESULTS: An approach, termed DETECTER, is proposed to identify sites in a protein sequence where amino acid replacements are likely to have a significant effect on phenotype, including causing genetic disease. This approach uses a model-dependent tool to estimate the normalized replacement rate at individual sites in a protein sequence, based on a history of those sites extracted from an evolutionary analysis of the corresponding protein family. This tool identifies sites that have higher-than-average, average, or lower-than-average rates of change in the lineage leading to the sequence in the population of interest. The rates are then combined with sequence data to determine the likelihoods that particular amino acids were present at individual sites in the evolutionary history of the gene family. These likelihoods are used to predict whether any specific amino acid replacements, if introduced at the site in a modern human population, would have a significant impact on fitness. The DETECTER tool is used to analyze the cystic fibrosis transmembrane conductance regulator (CFTR) gene family. CONCLUSION: In this system, DETECTER retrodicts amino acid replacements associated with the cystic fibrosis disease with greater accuracy than alternative approaches. While this result validates this approach for this particular family of proteins only, the approach may be applicable to the analysis of polymorphisms generally, including SNPs in a human population

    A hybrid of bovine pancreatic ribonuclease and human angiogenin: an external loop as a module controlling substrate specificity?

    Get PDF
    A comparison of the sequences of three homologous ribonucleases (RNase A, angiogenin and bovine seminal RNase) identifies three surface loops that are highly variable between the three proteins. Two hypotheses were contrasted: (i) that this variation might be responsible for the different catalytic activities of the three proteins; and (ii) that this variation is simply an example of surface loops undergoing rapid neutral divergence in sequence. Three hybrids of angiogenin and bovine pancreatic ribonuclease (RNase) A were prepared where regions in these loops taken from angiogenin were inserted into RNase A. Two of the three hybrids had unremarkable catalytic properties. However, the RNase A mutant containing residues 63-74 of angiogenin had greatly diminished catalytic activity against uridylyl-(3′ - 5′)-adenosine (UpA), and slightly increased catalytic activity as an inhibitor of translation in vitro. Both catalytic behaviors are characteristic of angiogenin. This is one of the first examples of an engineered external loop in a protein. Further, these results are complementary to those recently obtained from the complementary experiment, where residues 59-70 of RNase were inserted into angiogenin [Harper and Vallee (1989) Biochemistry, 28, 1875-1884]. Thus, the external loop in residues 63-74 of RNase A appears to behave, at least in part, as an interchangeable ‘module' that influences substrate specificity in an enzyme in a way that is isolated from the influences of other regions in the protei

    Snapshots of an evolved DNA polymerase pre- and post-incorporation of an unnatural nucleotide

    Get PDF
    The next challenge in synthetic biology is to be able to replicate synthetic nucleic acid sequences efficiently. The synthetic pair, 2-amino-8-(1-beta-d-2'- deoxyribofuranosyl) imidazo [1,2-a]-1,3,5-triazin-[8H]-4-one (trivially designated P) with 6-amino-3-(2'-deoxyribofuranosyl)-5-nitro-1H-pyridin-2-one (trivially designated Z), is replicated by certain Family A polymerases, albeit with lower efficiency. Through directed evolution, we identified a variant KlenTaq polymerase (M444V, P527A, D551E, E832V) that incorporates dZTP opposite P more efficiently than the wild-type enzyme. Here, we report two crystal structures of this variant KlenTaq, a post-incorporation complex that includes a template-primer with P:Z trapped in the active site (binary complex) and a pre-incorporation complex with dZTP paired to template P in the active site (ternary complex). In forming the ternary complex, the fingers domain exhibits a larger closure angle than in natural complexes but engages the template-primer and incoming dNTP through similar interactions. In the binary complex, although many of the interactions found in the natural complexes are retained, there is increased relative motion of the thumb domain. Collectively, our analyses suggest that it is the post-incorporation complex for unnatural substrates that presents a challenge to the natural enzyme and that more efficient replication of P:Z pairs requires a more flexible polymerase
    • …
    corecore