545,127 research outputs found

    An Alternative Model of Amino Acid Replacement

    Full text link
    The observed correlations between pairs of homologous protein sequences are typically explained in terms of a Markovian dynamic of amino acid substitution. This model assumes that every location on the protein sequence has the same background distribution of amino acids, an assumption that is incompatible with the observed heterogeneity of protein amino acid profiles and with the success of profile multiple sequence alignment. We propose an alternative model of amino acid replacement during protein evolution based upon the assumption that the variation of the amino acid background distribution from one residue to the next is sufficient to explain the observed sequence correlations of homologs. The resulting dynamical model of independent replacements drawn from heterogeneous backgrounds is simple and consistent, and provides a unified homology match score for sequence-sequence, sequence-profile and profile-profile alignment.Comment: Minor improvements. Added figure and reference

    Structure, Organization, and Expression of the lct Gene for Lacticin 481, a Novel Lantibiotic Produced by Lactococcus lactis

    Get PDF
    The structural gene for the lactococcal lantibiotic lacticin 481 (lct) has been identified and cloned using a degenerated 20-mer DNA oligonucleotide based on the amino-terminal 7 amino acid residues of the purified protein. The transcription of the lct gene was analyzed, and its promoter was mapped. DNA sequence analysis of the lct gene revealed an open reading frame encoding a peptide of 51 amino acids. Comparison of its deduced amino acid sequence with the amino-terminal sequence and the amino acid composition of lacticin 481 indicates that the 61-residue peptide is prelacticin 481, containing a 27-residue carboxyl-terminal propeptide and a 24-residue amino-terminal leader peptide which lacks the properties of a typical signal sequence and which is significantly different from the leaders of other lantibiotics. The predicted amino acid sequence of prolacticin 481 contains 3 cysteines, 2 serines, and 2 threonines which were not detectable in amino acid analyses of mature lacticin 481. Based on these results and on characterization by two-dimensional NMR techniques, a structural model is proposed in which 2 cysteine residues are involved in lanthionine and one in β-methyllanthionine formation, and a 4th threonine residue is dehydrated. This model predicts a molecular mass for lacticin 481 of 2,901, which is in excellent agreement with that obtained from mass spectrometry.

    Amino acid metabolism conflicts with protein diversity

    Get PDF
    The twenty protein coding amino acids are found in proteomes with different relative abundances. The most abundant amino acid, leucine, is nearly an order of magnitude more prevalent than the least abundant amino acid, cysteine. Amino acid metabolic costs differ similarly, constraining their incorporation into proteins. On the other hand, sequence diversity is necessary for protein folding, function and evolution. Here we present a simple model for a cost-diversity trade-off postulating that natural proteomes minimize amino acid metabolic flux while maximizing sequence entropy. The model explains the relative abundances of amino acids across a diverse set of proteomes. We found that the data is remarkably well explained when the cost function accounts for amino acid chemical decay. More than one hundred proteomes reach comparable solutions to the trade-off by different combinations of cost and diversity. Quantifying the interplay between proteome size and entropy shows that proteomes can get optimally large and diverse

    Molecular cloning and sequence analysis of the cDNA encoding the human acrosin-trypsin inhibitor (HUSI-II)

    Get PDF
    A complete cDNA clone encoding the human acrosin-trypsin inhibitor HUSI-II has been isolated from a cDNA library of human testis and completely sequenced. The cDNA of 594 bp contained an open reading frame of 252 base pairs, The deduced amino acid sequence comprised the complete amino acid sequence of HUSI-II[1] and a putative signal peptide. Northern blotting analysis revealed that HUSI-II is synthesized in testis, epididymis and seminal vesicle, but not in the prostate gland

    A FAMILY OF CATION ATPASE-LIKE MOLECULES FROM PLASMODIUM-FALCIPARUM

    Get PDF
    Abstract. We report the nucleotide and derived amino acid sequence of the ATPase 1 gene from Plasmodium falciparum. The amino acid sequence shares homology with the family of "P-type cation transloeating ATPases in conserved regions important for nucleotide binding, conformational change, or phosphorylation. The gene, which is present on chromosome 5, has a product longer than any other reported for a P-type ATPase. Interstrain analysis from 12 parasite isolates by the polymerase chain reaction reveals that a 330-bp nucleotide sequence encoding three cytoplasmic regions conserved in cation ATPases (regions a-c) is of constant length. By contrast, another 360-bp sequence which is one of four regions we refer to as

    Sequence homology between RNAs encoding rat α-fetoprotein and rat serum albumin

    Get PDF
    We have determined the sequences of the recombinant DNA inserts of three bacterial plasmid cDNA clones containing most of the rat α-fetoprotein mRNA. The resultant nucleotide sequence of α-fetoprotein was exhaustively compared to the nucleotide sequence of the mRNA encoding rat serum albumin. These two mRNAs have extensive homology (50%) throughout and the same intron locations. The amino acid sequence of rat α-fetoprotein has been deduced from the nucleotide sequence, and its comparison to rat serum albumin's amino acid sequence reveals a 34% homology. The regularly spaced positions of the cysteines found in serum albumin are conserved in rat α-fetoprotein, indicating that these two proteins may have a similar secondary folding structure. These homologies indicate that α-fetoprotein and serum albumin were derived by duplication of a common ancestral gene and constitute a gene family

    Identification and Analysis of Novel Amino-Acid Sequence Repeats in Bacillus anthracis str. Ames Proteome Using Computational Tools

    Get PDF
    We have identified four repeats and ten domains that are novel in proteins encoded by the Bacillus anthracis str. Ames proteome using automated in silico methods. A “repeat” corresponds to a region comprising less than 55-amino-acid residues that occur more than once in the protein sequence and sometimes present in tandem. A “domain” corresponds to a conserved region with greater than 55-amino-acid residues and may be present as single or multiple copies in the protein sequence. These correspond to (1) 57-amino-acid-residue PxV domain, (2) 122-amino-acid-residue FxF domain, (3) 111-amino-acid-residue YEFF domain, (4) 109-amino-acid-residue IMxxH domain, (5) 103-amino-acid-residue VxxT domain, (6) 84-amino-acid-residue ExW domain, (7) 104-amino-acid-residue NTGFIG domain, (8) 36-amino-acid-residue NxGK repeat, (9) 95-amino-acid-residue VYV domain, (10) 75-amino-acid-residue KEWE domain, (11) 59-amino-acid-residue AFL domain, (12) 53-amino-acid-residue RIDVK repeat, (13) (a) 41-amino-acid-residue AGQF repeat and (b) 42-amino-acid-residue GSAL repeat. A repeat or domain type is characterized by specific conserved sequence motifs. We discuss the presence of these repeats and domains in proteins from other genomes and their probable secondary structure
    corecore