Article thumbnail

Discovery of the First Insect Nidovirus, a Missing Evolutionary Link in the Emergence of the Largest RNA Virus Genomes

By Phan Thi Nga, Maria del Carmen Parquet, Chris Lauber, Manmohan Parida, Takeshi Nabeshima, Fuxun Yu, Nguyen Thanh Thuy, Shingo Inoue, Takashi Ito, Kenta Okamoto, Akitoyo Ichinose, Eric J. Snijder, Kouichi Morita and Alexander E. Gorbalenya


Nidoviruses with large genomes (26.3–31.7 kb; ‘large nidoviruses’), including Coronaviridae and Roniviridae, are the most complex positive-sense single-stranded RNA (ssRNA+) viruses. Based on genome size, they are far separated from all other ssRNA+ viruses (below 19.6 kb), including the distantly related Arteriviridae (12.7–15.7 kb; ‘small nidoviruses’). Exceptionally for ssRNA+ viruses, large nidoviruses encode a 3′-5′exoribonuclease (ExoN) that was implicated in controlling RNA replication fidelity. Its acquisition may have given rise to the ancestor of large nidoviruses, a hypothesis for which we here provide evolutionary support using comparative genomics involving the newly discovered first insect-borne nidovirus. This Nam Dinh virus (NDiV), named after a Vietnamese province, was isolated from mosquitoes and is yet to be linked to any pathology. The genome of this enveloped 60–80 nm virus is 20,192 nt and has a nidovirus-like polycistronic organization including two large, partially overlapping open reading frames (ORF) 1a and 1b followed by several smaller 3′-proximal ORFs. Peptide sequencing assigned three virion proteins to ORFs 2a, 2b, and 3, which are expressed from two 3′-coterminal subgenomic RNAs. The NDiV ORF1a/ORF1b frameshifting signal and various replicative proteins were tentatively mapped to canonical positions in the nidovirus genome. They include six nidovirus-wide conserved replicase domains, as well as the ExoN and 2′-O-methyltransferase that are specific to large nidoviruses. NDiV ORF1b also encodes a putative N7-methyltransferase, identified in a subset of large nidoviruses, but not the uridylate-specific endonuclease that – in deviation from the current paradigm - is present exclusively in the currently known vertebrate nidoviruses. Rooted phylogenetic inference by Bayesian and Maximum Likelihood methods indicates that NDiV clusters with roniviruses and that its branch diverged from large nidoviruses early after they split from small nidoviruses. Together these characteristics identify NDiV as the prototype of a new nidovirus family and a missing link in the transition from small to large nidoviruses

Topics: Research Article
Publisher: Public Library of Science
OAI identifier:
Provided by: PubMed Central

Suggested articles


  1. (2010). 2 9-O methylation of the viral mRNA cap evades host restriction by IFIT family members.
  2. (2003). A comparative sequence analysis to revise the current taxonomy of the family Coronaviridae.
  3. (2005). A complex zinc finger controls the enzymatic activities of nidovirus helicases.
  4. (2007). A contemporary view of coronavirus transcription.
  5. (2001). A general empirical model of protein evolution derived from multiple protein families using a maximum-likelihood approach.
  6. (1999). A human RNA viral cysteine proteinase that depends upon a unique Zn2+-binding finger connecting the two domains of a papain-like fold [published erratum appears in
  7. (2007). A novel virus isolated from the aphid Brevicoryne brassicae with similarity to Hymenoptera picorna-like viruses.
  8. (2006). A second, non-canonical RNA-dependent RNA polymerase in SARS coronavirus.
  9. (2003). A simple, fast, and accurate algorithm to estimate large phylogenies by maximum likelihood.
  10. (2005). A three-stemmed mRNA pseudoknot in the SARS coronavirus frameshift signal.
  11. (2002). An RNA cap (nucleoside-2 9-O-)-methyltransferase in the flavivirus RNA polymerase NS5: crystal structure and functional characterization.
  12. (1989). Association of the Sindbis virus RNA methytransferase activity with the nonstructural protein nsP1.
  13. (2008). BAGG - Blocks Accepting Gaps Generator, version. Available:
  14. (2001). Big nidovirus genome - When count and order of domains matter.
  15. (2007). Biochemical and genetic analyses of murine hepatitis virus nsp15 endoribonuclease.
  16. (2006). Biochemical aspects of coronavirus replication and virus-host interaction.
  17. (2007). Biochemical characterization of exoribonuclease encoded by SARS coronavirus.
  18. (2010). Cap binding and immune evasion revealed by Lassa nucleoprotein structure.
  19. (2006). Characterization of White bream virus reveals a novel genetic cluster of nidoviruses.
  20. (1993). Comparative analysis of the amino acid sequences of the key enzymes of the replication and expression of positive-strand RNA viruses. Validity of the approach and functional and evolutionary implications.
  21. (2006). Comparative and functional genomics of closteroviruses.
  22. (1987). Completion of the sequence of the genome of the coronavirus avian infectious bronchitis virus.
  23. (1992). Conservation of the putative methyltransferase domain: a hallmark of the ‘Sindbis-like’ supergroup of positive-strand RNA viruses.
  24. (2009). Core Team
  25. (2005). Coronavirus genome structure and replication.
  26. (1989). Coronavirus genome: prediction of putative functional domains in the non-structural Insect Nidovirus Links Arteri-
  27. (2008). Coronavirus nonstructural protein 16 is a cap-0 binding enzyme possessing (nucleoside-29O)-methyltransferase activity.
  28. (2011). Coronaviruses An RNA proofreading machine regulates replication fidelity and diversity.
  29. (1997). Critical residues of Semliki Forest virus RNA capping enzyme involved in methyltransferase and guanylyltransferase-like activities.
  30. (2006). Crystal structure and mechanistic determinants of SARS coronavirus nonstructural protein 15 define an endoribonuclease family.
  31. (2007). Crystal structure of a monomeric form of severe acute respiratory syndrome coronavirus endonuclease nsp15 suggests a role for hexamerization as an allosteric switch.
  32. (2005). De novo identification of repeat families in large genomes.
  33. (2006). Discovery of an RNA virus 39R59 exoribonuclease that is critically involved in coronavirus RNA synthesis.
  34. (2003). Electron microscopy for rapid diagnosis of infectious agents in emergent situations.
  35. (1991). Equine arteritis virus is not a togavirus but belongs to the coronaviruslike superfamily.
  36. (2010). Euprosterna elaeasa virus genome sequence and evolution of the Tetraviridae Insect Nidovirus Links Arteri- and Coronaviruses PLoS Pathogens |
  37. (2009). Examining Landscape Factors Influencing Relative Distribution of Mosquito Genera and Frequency of Virus Infection.
  38. (1990). Fingerprinting Genomes Using Pcr with Arbitrary Primers.
  39. (2009). Functional screen reveals SARS coronavirus nonstructural protein nsp14 as a novel cap N7 methyltransferase.
  40. (1997). Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res.
  41. (2008). Genomics and Evolution of the Nidovirales.
  42. (2000). Gill-associated virus of Penaeus monodon prawns: an invertebrate virus with ORF1a and ORF1b genes related to arteri- and coronaviruses.
  43. (2007). High fidelity of murine hepatitis virus replication is decreased in nsp14 exoribonuclease mutants.
  44. (1994). Identification and Analysis of the Site of 21 Ribosomal Frameshifting in Red-Clover Necrotic Mosaic-Virus.
  45. (2010). Infidelity of SARS-CoV Nsp14-Exonuclease Mutant Virus Replication Is Revealed by Complete Genome Sequencing.
  46. (2004). Information Viral Genomes Project.
  47. (1978). Isolation of a Singh’s Aedes albopictus cell clone sensitive to Dengue and Chikungunya viruses.
  48. (2009). Jalview Version 2-a multiple sequence alignment editor and analysis workbench.
  49. (2004). Major genetic marker of nidoviruses encodes a replicative endoribonuclease.
  50. (2003). Mfold web server for nucleic acid folding and hybridization prediction.
  51. (2004). MUSCLE: a multiple sequence alignment method with reduced time and space complexity.
  52. (1999). Mutation rates among RNA viruses.
  53. (2005). Mutational analysis of the SARS virus Nsp15 endoribonuclease: Identification of residues affecting hexamer formation.
  54. (2006). Nidovirales: Evolving the largest RNA virus genome.
  55. (2006). Nidovirus transcription: how to make sense … ?
  56. (1996). On the nature of virus quasispecies.
  57. (2008). Pacing a small cage: mutation and RNA viruses.
  58. (2007). pknotsRG: RNA pseudoknot folding including near-optimal structures and sliding windows.
  59. (2010). Practical application of bioinformatics by the multidisciplinary VIZIER consortium.
  60. (1987). Profile analysis: Detection of distantly related proteins.
  61. (1998). Profile hidden Markov models.
  62. (2005). Protein homology detection by HMM-HMM comparison.
  63. (1999). Protein secondary structure prediction based on positionspecific scoring matrices.
  64. (2010). Quasispecies Theory and the Behavior of RNA Viruses.
  65. (2006). Rambaut A
  66. (2002). Rates of molecular evolution in RNA viruses: A quantitative phylogenetic analysis.
  67. (2009). Reveals the Nidovirus-Wide Conservation of a Replicative Endoribonuclease.
  68. (2011). Ribose 2 9-O-methylation provides a molecular signature for the distinction of self and non-self mRNA dependent on the RNA sensor Mda5.
  69. (1995). Ribosomal frameshifting viral RNAs.
  70. (2006). RNA recognition and cleavage by the SARS coronavirus endoribonuclease.
  71. (1995). SCOP: a structural classification of proteins database for the investigation of sequences and structures.
  72. (2000). Selection of conserved blocks from multiple alignments for their use in phylogenetic analysis.
  73. (1998). Sequence element required for efficient 21 ribosomal frameshifting in red clover necrotic mosaic dianthovirus.
  74. (2006). Site-directed mutagenesis of the nidovirus replicative endoribonuclease NendoU exerts pleiotropic effects on the arterivirus life cycle.
  75. (2008). Structural and functional analyses of the severe acute respiratory syndrome coronavirus endoribonuclease Nsp15.
  76. (2002). Structure of coronavirus main proteinase reveals combination of a chymotrypsin fold with an extra alpha-helical domain.
  77. (2011). Structure of the Lassa virus nucleoprotein reveals a dsRNA-specific 39 to 59 exonuclease activity essential for immune suppression.
  78. (2009). The Evolution and Emergence of RNA Viruses.
  79. (2009). The Evolutionary Genetics of Emerging Viruses.
  80. (2006). The molecular biology of coronaviruses.
  81. (2002). The palm subdomain-based active site is internally permuted in viral RNA-dependent RNA polymerases of an ancient lineage.
  82. (2008). The Pfam protein families database.
  83. (2010). The RNA polymerase activity of SARS-coronavirus nsp12 is primer dependent.
  84. (2004). The severe acute respiratory syndrome coronavirus Nsp15 protein is an endoribonuclease that prefers manganese as a cofactor.
  85. (1993). TMbase - A database of membrane spanning proteins segments.
  86. (1999). Toward evidence-based medical statistics. 2: The Bayes factor.
  87. (2003). Unique and conserved features of genome and proteome of SARScoronavirus, an early split-off from the coronavirus group 2 lineage.
  88. (1996). Viral cysteine proteinases.
  89. (2010). Viral Mutation Rates.
  90. (2007). Virus Evolution. In: Knipe
  91. (2000). Virus-encoded proteinases and proteolytic processing in the Nidovirales.

To submit an update or takedown request for this paper, please submit an Update/Correction/Removal Request.