38 research outputs found

    Structural Variation and Site Variability in Proteins

    No full text
    <p>What does protein structure tell us about site variability? We seek to answer this question by analyzing the relationship between the variability at individual sites in alignments of viral sequences to properties of those sites in the three-dimensional structures of the corresponding proteins.</p

    Calculating site-specific evolutionary rates at the amino-acid or codon level yields similar rate estimates

    No full text
    Site-specific evolutionary rates can be estimated from codon sequences or from amino-acid sequences. For codon sequences, the most popular methods use some variation of the dN∕dS ratio. For amino-acid sequences, one widely-used method is called Rate4Site, and it assigns a relative conservation score to each site in an alignment. How site-wise dN∕dS values relate to Rate4Site scores is not known. Here we elucidate the relationship between these two rate measurements. We simulate sequences with known dN∕dS, using either dN∕dS models or mutation–selection models for simulation. We then infer Rate4Site scores on the simulated alignments, and we compare those scores to either true or inferred dN∕dS values on the same alignments. We find that Rate4Site scores generally correlate well with true dN∕dS, and the correlation strengths increase in alignments with greater sequence divergence and more taxa. Moreover, Rate4Site scores correlate very well with inferred (as opposed to true) dN∕dS values, even for small alignments with little divergence. Finally, we verify this relationship between Rate4Site and dN∕dS in a variety of empirical datasets. We conclude that codon-level and amino-acid-level analysis frameworks are directly comparable and yield very similar inferences

    Calculating site-specific evolutionary rates at the amino-acid or codon level yields similar rate estimates

    No full text
    ABSTRACT Site-specific evolutionary rates can be estimated from codon sequences or from aminoacid sequences. For codon sequences, the most popular methods use some variation of the dN /dS ratio. For amino-acid sequences, one widely-used method is called Rate4Site, and it assigns a relative conservation score to each site in an alignment. How site-wise dN /dS values relate to Rate4Site scores is not known. Here we elucidate the relationship between these two rate measurements. We simulate sequences with known dN /dS, using either dN /dS models or mutation-selection models for simulation. We then infer Rate4Site scores on the simulated alignments, and we compare those scores to either true or inferred dN /dS values on the same alignments. We find that Rate4Site scores generally correlate well with true dN /dS, and the correlation strengths increase in alignments with greater sequence divergence and more taxa. Moreover, Rate4Site scores correlate very well with inferred (as opposed to true) dN /dS values, even for small alignments with little divergence. Finally, we verify this relationship between Rate4Site and dN /dS in a variety of empirical datasets. We conclude that codon-level and amino-acid-level analysis frameworks are directly comparable and yield very similar inferences

    Reviewing Manuscript

    No full text
    PeptideBuilder: A simple Python library to generate model peptides We present a simple Python library to construct models of polypeptides from scratch. The intended use case is the generation of peptide models with pre-specified backbone angles. For example, using our library, one can generate a model of a set of amino acids in a specific conformation using just a few Reviewing Manuscript lines of python code. We do not provide any tools for energy minimization or rotamer packing, since powerful tools are available for these purposes. Instead, we provide a simple Python interface that enables one to add residues to a peptide chain in any desired conformation. Bond angles and bond lengths can be manipulated if so desired, and reasonable values are used by default

    Measuring evolutionary rates of proteins in a structural context [version 1; referees: 3 approved]

    No full text
    We describe how to measure site-specific rates of evolution in protein-coding genes and how to correlate these rates with structural features of the expressed protein, such as relative solvent accessibility, secondary structure, or weighted contact number. We present two alternative approaches to rate calculations, one based on relative amino-acid rates and the other based on site-specific codon rates measured as dN/dS. In addition to describing the specific analysis protocols we recommend, we also provide a code repository containing scripts to facilitate these kinds of analyses

    Replication Data for "Moderate amounts of epistasis are not evolutionarily stable in small populations" by Sydykova et al.

    No full text
    Simulation data of small populations evolving in epistatic fitness landscapes. The data were generated with and subsequently analyzed by the code archived at: https://doi.org/10.5281/zenodo.355880

    Measuring evolutionary rates of proteins in a structural context [version 2; referees: 4 approved]

    No full text
    We describe how to measure site-specific rates of evolution in protein-coding genes and how to correlate these rates with structural features of the expressed protein, such as relative solvent accessibility, secondary structure, or weighted contact number. We present two alternative approaches to rate calculations: One based on relative amino-acid rates, and the other based on site-specific codon rates measured as dN/dS. We additionally provide a code repository containing scripts to facilitate the specific analysis protocols we recommend
    corecore