30 research outputs found

    HIV-Specific Probabilistic Models of Protein Evolution

    Get PDF
    Comparative sequence analyses, including such fundamental bioinformatics techniques as similarity searching, sequence alignment and phylogenetic inference, have become a mainstay for researchers studying type 1 Human Immunodeficiency Virus (HIV-1) genome structure and evolution. Implicit in comparative analyses is an underlying model of evolution, and the chosen model can significantly affect the results. In general, evolutionary models describe the probabilities of replacing one amino acid character with another over a period of time. Most widely used evolutionary models for protein sequences have been derived from curated alignments of hundreds of proteins, usually based on mammalian genomes. It is unclear to what extent these empirical models are generalizable to a very different organism, such as HIV-1–the most extensively sequenced organism in existence. We developed a maximum likelihood model fitting procedure to a collection of HIV-1 alignments sampled from different viral genes, and inferred two empirical substitution models, suitable for describing between-and within-host evolution. Our procedure pools the information from multiple sequence alignments, and provided software implementation can be run efficiently in parallel on a computer cluster. We describe how the inferred substitution models can be used to generate scoring matrices suitable for alignment and similarity searches. Our models had a consistently superior fit relative to the best existing models and to parameter-rich data-driven models when benchmarked on independent HIV-1 alignments, demonstrating evolutionary biases in amino-acid substitution that are unique to HIV, and that are not captured by the existing models. The scoring matrices derived from the models showed a marked difference from common amino-acid scoring matrices. The use of an appropriate evolutionary model recovered a known viral transmission history, whereas a poorly chosen model introduced phylogenetic error. We argue that our model derivation procedure is immediately applicable to other organisms with extensive sequence data available, such as Hepatitis C and Influenza A viruses

    Genetic Signature of Rapid IHHNV (Infectious Hypodermal and Hematopoietic Necrosis Virus) Expansion in Wild Penaeus Shrimp Populations

    Get PDF
    Infectious hypodermal and hematopoietic necrosis virus (IHHNV) is a widely distributed single-stranded DNA parvovirus that has been responsible for major losses in wild and farmed penaeid shrimp populations on the northwestern Pacific coast of Mexico since the early 1990's. IHHNV has been considered a slow-evolving, stable virus because shrimp populations in this region have recovered to pre-epizootic levels, and limited nucleotide variation has been found in a small number of IHHNV isolates studied from this region. To gain insight into IHHNV evolutionary and population dynamics, we analyzed IHHNV capsid protein gene sequences from 89 Penaeus shrimp, along with 14 previously published sequences. Using Bayesian coalescent approaches, we calculated a mean rate of nucleotide substitution for IHHNV that was unexpectedly high (1.39×10−4 substitutions/site/year) and comparable to that reported for RNA viruses. We found more genetic diversity than previously reported for IHHNV isolates and highly significant subdivision among the viral populations in Mexican waters. Past changes in effective number of infections that we infer from Bayesian skyline plots closely correspond to IHHNV epizootiological historical records. Given the high evolutionary rate and the observed regional isolation of IHHNV in shrimp populations in the Gulf of California, we suggest regular monitoring of wild and farmed shrimp and restriction of shrimp movement as preventative measures for future viral outbreaks

    Evidence for positive selection in the gene fruitless in Anastrepha fruit flies

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Many genes involved in the sex determining cascade have indicated signals of positive selection and rapid evolution across different species. Even though <it>fruitless </it>is an important gene involved mostly in several aspects of male courtship behavior, the few studies so far have explained its high rates of evolution by relaxed selective constraints. This would indicate that a large portion of this gene has evolved neutrally, contrary to what has been observed for other genes in the sex cascade.</p> <p>Results</p> <p>Here we test whether the <it>fruitless </it>gene has evolved neutrally or under positive selection in species of <it>Anastrepha </it>(Tephritidae: Diptera) using two different approaches, a long-term evolutionary analysis and a populational genetic data analysis. The first analysis was performed by using sequences of three species of <it>Anastrepha </it>and sequences from several species of <it>Drosophila </it>using the ratio of nonsynonymous to synonymous rates of evolution in PAML, which revealed that the <it>fru </it>region here studied has evolved by positive selection. Using Bayes Empirical Bayes we estimated that 16 sites located in the connecting region of the <it>fruitless </it>gene were evolving under positive selection. We also investigated for signs of this positive selection using populational data from 50 specimens from three species of <it>Anastrepha </it>from different localities in Brazil. The use of standard tests of selection and a new test that compares patterns of differential survival between synonymous and nonsynonymous in evolutionary time also provide evidence of positive selection across species and of a selective sweep for one of the species investigated.</p> <p>Conclusions</p> <p>Our data indicate that the high diversification of <it>fru </it>connecting region in <it>Anastrepha </it>flies is due at least in part to positive selection, not merely as a consequence of relaxed selective constraint. These conclusions are based not only on the comparison of distantly related taxa that show long-term divergence time, but also on recently diverged lineages and suggest that episodes of adaptive evolution in <it>fru </it>may be related to sexual selection and/or conflict related to its involvement in male courtship behavior.</p

    Computational analysis of the evolutionarily conserved Missing In Metastasis/Metastasis Suppressor 1 gene predicts novel interactions, regulatory regions and transcriptional control

    Get PDF
    Missing in Metastasis (MIM), or Metastasis Suppressor 1 (MTSS1), is a highly conserved protein, which links the plasma membrane to the actin cytoskeleton. MIM has been implicated in various cancers, however, its modes of action remain largely enigmatic. Here, we performed an extensive in silico characterisation of MIM to gain better understanding of its function. We detected previously unappreciated functional motifs including adaptor protein (AP) complex interaction site and a C-helix, pointing to a role in endocytosis and regulation of actin dynamics, respectively. We also identified new functional regions, characterised with phosphorylation sites or distinct hydrophilic properties. Strong negative selection during evolution, yielding high conservation of MIM, has been combined with positive selection at key sites. Interestingly, our analysis of intra-molecular co-evolution revealed potential regulatory hotspots that coincided with reduced potentially\ua0pathogenic polymorphisms. We explored databases for the mutations and expression levels of MIM in cancer. Experimentally, we focused on chronic lymphocytic leukaemia (CLL), where MIM showed high overall expression, however, downregulation on poor prognosis samples. Finally, we propose strong conservation of MTSS1 also on the transcriptional level and predict novel transcriptional regulators. Our data highlight important targets for future studies on the role of MIM in different tissues and cancers

    Accelerated Evolution of Mitochondrial but Not Nuclear Genomes of Hymenoptera: New Evidence from Crabronid Wasps

    Get PDF
    Mitochondrial genes in animals are especially useful as molecular markers for the reconstruction of phylogenies among closely related taxa, due to the generally high substitution rates. Several insect orders, notably Hymenoptera and Phthiraptera, show exceptionally high rates of mitochondrial molecular evolution, which has been attributed to the parasitic lifestyle of current or ancestral members of these taxa. Parasitism has been hypothesized to entail frequent population bottlenecks that increase rates of molecular evolution by reducing the efficiency of purifying selection. This effect should result in elevated substitution rates of both nuclear and mitochondrial genes, but to date no extensive comparative study has tested this hypothesis in insects. Here we report the mitochondrial genome of a crabronid wasp, the European beewolf (Philanthus triangulum, Hymenoptera, Crabronidae), and we use it to compare evolutionary rates among the four largest holometabolous insect orders (Coleoptera, Diptera, Hymenoptera, Lepidoptera) based on phylogenies reconstructed with whole mitochondrial genomes as well as four single-copy nuclear genes (18S rRNA, arginine kinase, wingless, phosphoenolpyruvate carboxykinase). The mt-genome of P. triangulum is 16,029 bp in size with a mean A+T content of 83.6%, and it encodes the 37 genes typically found in arthropod mt genomes (13 protein-coding, 22 tRNA, and two rRNA genes). Five translocations of tRNA genes were discovered relative to the putative ancestral genome arrangement in insects, and the unusual start codon TTG was predicted for cox2. Phylogenetic analyses revealed significantly longer branches leading to the apocritan Hymenoptera as well as the Orussoidea, to a lesser extent the Cephoidea, and, possibly, the Tenthredinoidea than any of the other holometabolous insect orders for all mitochondrial but none of the four nuclear genes tested. Thus, our results suggest that the ancestral parasitic lifestyle of Apocrita is unlikely to be the major cause for the elevated substitution rates observed in hymenopteran mitochondrial genomes
    corecore