4 research outputs found
Orthopoxvirus Genome Evolution: The Role of Gene Loss
Poxviruses are highly successful pathogens, known to infect a variety of hosts. The family Poxviridae includes Variola virus, the causative agent of smallpox, which has been eradicated as a public health threat but could potentially reemerge as a bioterrorist threat. The risk scenario includes other animal poxviruses and genetically engineered manipulations of poxviruses. Studies of orthologous gene sets have established the evolutionary relationships of members within the Poxviridae family. It is not clear, however, how variations between family members arose in the past, an important issue in understanding how these viruses may vary and possibly produce future threats. Using a newly developed poxvirus-specific tool, we predicted accurate gene sets for viruses with completely sequenced genomes in the genus Orthopoxvirus. Employing sensitive sequence comparison techniques together with comparison of syntenic gene maps, we established the relationships between all viral gene sets. These techniques allowed us to unambiguously identify the gene loss/gain events that have occurred over the course of orthopoxvirus evolution. It is clear that for all existing Orthopoxvirus species, no individual species has acquired protein-coding genes unique to that species. All existing species contain genes that are all present in members of the species Cowpox virus and that cowpox virus strains contain every gene present in any other orthopoxvirus strain. These results support a theory of reductive evolution in which the reduction in size of the core gene set of a putative ancestral virus played a critical role in speciation and confining any newly emerging virus species to a particular environmental (host or tissue) niche
VADR: validation and annotation of virus sequence submissions to GenBank
Background
GenBank contains over 3 million viral sequences. The National Center for Biotechnology Information (NCBI) previously made available a tool for validating and annotating influenza virus sequences that is used to check submissions to GenBank. Before this project, there was no analogous tool in use for non-influenza viral sequence submissions.
Results
We developed a system called VADR (Viral Annotation DefineR) that validates and annotates viral sequences in GenBank submissions. The annotation system is based on the analysis of the input nucleotide sequence using models built from curated RefSeqs. Hidden Markov models are used to classify sequences by determining the RefSeq they are most similar to, and feature annotation from the RefSeq is mapped based on a nucleotide alignment of the full sequence to a covariance model. Predicted proteins encoded by the sequence are validated with nucleotide-to-protein alignments using BLAST. The system identifies 43 types of “alerts” that (unlike the previous BLAST-based system) provide deterministic and rigorous feedback to researchers who submit sequences with unexpected characteristics. VADR has been integrated into GenBank’s submission processing pipeline allowing for viral submissions passing all tests to be accepted and annotated automatically, without the need for any human (GenBank indexer) intervention. Unlike the previous submission-checking system, VADR is freely available (https://github.com/nawrockie/vadr) for local installation and use. VADR has been used for Norovirus submissions since May 2018 and for Dengue virus submissions since January 2019. Since March 2020, VADR has also been used to check SARS-CoV-2 sequence submissions. Other viruses with high numbers of submissions will be added incrementally.
Conclusion
VADR improves the speed with which non-flu virus submissions to GenBank can be checked and improves the content and quality of the GenBank annotations. The availability and portability of the software allow researchers to run the GenBank checks prior to submitting their viral sequences, and thereby gain confidence that their submissions will be accepted immediately without the need to correspond with GenBank staff. Reciprocally, the adoption of VADR frees GenBank staff to spend more time on services other than checking routine viral sequence submissions
The Phylogeny and Pathogenesis of Sacbrood Virus (SBV) Infection in European Honey Bees, Apis mellifera
RNA viruses that contain single-stranded RNA genomes of positive sense make up the largest group of pathogens infecting honey bees. Sacbrood virus (SBV) is one of the most widely distributed honey bee viruses and infects the larvae of honey bees, resulting in failure to pupate and death. Among all of the viruses infecting honey bees, SBV has the greatest number of complete genomes isolated from both European honey bees Apis mellifera and Asian honey bees A. cerana worldwide. To enhance our understanding of the evolution and pathogenicity of SBV, in this study, we present the first report of whole genome sequences of two U.S. strains of SBV. The complete genome sequences of the two U.S. SBV strains were deposited in GenBank under accession numbers: MG545286.1 and MG545287.1. Both SBV strains show the typical genomic features of the Iflaviridae family. The phylogenetic analysis of the single polyprotein coding region of the U.S. strains, and other GenBank SBV submissions revealed that SBV strains split into two distinct lineages, possibly reflecting host affiliation. The phylogenetic analysis based on the 5′UTR revealed a monophyletic clade with the deep parts of the tree occupied by SBV strains from both A. cerane and A. mellifera, and the tips of branches of the tree occupied by SBV strains from A. mellifera. The study of the cold stress on the pathogenesis of the SBV infection showed that cold stress could have profound effects on sacbrood disease severity manifested by increased mortality of infected larvae. This result suggests that the high prevalence of sacbrood disease in early spring may be due to the fluctuating temperatures during the season. This study will contribute to a better understanding of the evolution and pathogenesis of SBV infection in honey bees, and have important epidemiological relevance