32 research outputs found

    Fast and Accurate mapping of Next Generation Sequencing Data

    Get PDF
    Ph.DDOCTOR OF PHILOSOPH

    Deletion of the s2m RNA Structure in the Avian Coronavirus Infectious Bronchitis Virus and Human Astrovirus Results in Sequence Insertions

    Get PDF
    Coronaviruses infect a wide variety of host species, resulting in a range of diseases in both humans and animals. The coronavirus genome consists of a large positive-sense single-stranded molecule of RNA containing many RNA structures. One structure, denoted s2m and consisting of 41 nucleotides, is located within the 3′ untranslated region (3′ UTR) and is shared between some coronavirus species, including infectious bronchitis virus (IBV), severe acute respiratory syndrome coronavirus (SARS-CoV), and SARS-CoV-2, as well as other pathogens, including human astrovirus. Using a reverse genetic system to generate recombinant viruses, we investigated the requirement of the s2m structure in the replication of IBV, a globally distributed economically important Gammacoronavirus that infects poultry causing respiratory disease. Deletion of three nucleotides predicted to destabilize the canonical structure of the s2m or the deletion of the nucleotides corresponding to s2m impacted viral replication in vitro. In vitro passaging of the recombinant IBV with the s2m sequence deleted resulted in a 36-nucleotide insertion in place of the deletion, which was identified to be composed of a duplication of flanking sequences. A similar result was observed following serial passage of human astrovirus with a deleted s2m sequence. RNA modeling indicated that deletion of the nucleotides corresponding to the s2m impacted other RNA structures present in the IBV 3′ UTR. Our results indicated for both IBV and human astrovirus a preference for nucleotide occupation in the genome location corresponding to the s2m, which is independent of the specific s2m sequence.publishedVersio

    Multiple novel non-canonically transcribed sub-genomic mRNAs produced by avian coronavirus infectious bronchitis virus

    Get PDF
    Funding: This work was supported by Biotechnology and Biological Sciences Research Council (BBSRC) grants BB/L003988/1 and 1645891, and strategic funding to The Pirbright Institute, BBS/E/I/00007035, BBS/E/I/00007034, BBS/E/I/00007037 and BBS/E/I/00007039.Coronavirus sub-genomic mRNA (sgmRNA) synthesis occurs via a process of discontinuous transcription involving complementary transcription regulatory sequences (TRSs), one (TRS-L) encompassing the leader sequence of the 5' untranslated region (UTR), and the other upstream of each structural and accessory gene (TRS-B). Several coronaviruses have an ORF located between the N gene and the 3'-UTR, an area previously thought to be non-coding in the Gammacoronavirus infectious bronchitis virus (IBV) due to a lack of a canonical TRS-B. Here, we identify a non-canonical TRS-B allowing for a novel sgmRNA relating to this ORF to be produced in several strains of IBV: Beaudette, CR88, H120, D1466, Italy-02 and QX. Interestingly, the potential protein produced by this ORF is prematurely truncated in the Beaudette strain. A single nucleotide deletion was made in the Beaudette strain allowing for the generation of a recombinant IBV (rIBV) that had the potential to express a full-length protein. Assessment of this rIBV in vitro demonstrated that restoration of the full-length potential protein had no effect on viral replication. Further assessment of the Beaudette-derived RNA identified a second non-canonically transcribed sgmRNA located within gene 2. Deep sequencing analysis of allantoic fluid from Beaudette-infected embryonated eggs confirmed the presence of both the newly identified non-canonically transcribed sgmRNAs and highlighted the potential for further yet unidentified sgmRNAs. This HiSeq data, alongside the confirmation of non-canonically transcribed sgmRNAs, indicates the potential of the coronavirus genome to encode a larger repertoire of genes than has currently been identified.Publisher PDFPeer reviewe

    Sequencing and Analysis of Lumpy Skin Disease Virus Whole Genomes Reveals a New Viral Subgroup in West and Central Africa

    Get PDF
    Lumpy skin disease virus (LSDV) is a member of the capripoxvirus (CPPV) genus of the Poxviridae family. LSDV is a rapidly emerging, high-consequence pathogen of cattle, recently spreading from Africa and the Middle East into Europe and Asia. We have sequenced the whole genome of historical LSDV isolates from the Pirbright Institute virus archive, and field isolates from recent disease outbreaks in Sri Lanka, Mongolia, Nigeria and Ethiopia. These genome sequences were compared to published genomes and classified into different subgroups. Two subgroups contained vaccine or vaccine-like samples ("Neethling-like" clade 1.1 and "Kenya-like" subgroup, clade 1.2.2). One subgroup was associated with outbreaks of LSD in the Middle East/Europe (clade 1.2.1) and a previously unreported subgroup originated from cases of LSD in west and central Africa (clade 1.2.3). Isolates were also identified that contained a mix of genes from both wildtype and vaccine samples (vaccine-like recombinants, grouped in clade 2). Whole genome sequencing and analysis of LSDV strains isolated from different regions of Africa, Europe and Asia have provided new knowledge of the drivers of LSDV emergence, and will inform future disease control strategies.</p

    Comprehensive Intrinsic Disorder Analysis of 6108 Viral Proteomes: From the Extent of Intrinsic Disorder Penetrance to Functional Annotation of Disordered Viral Proteins

    No full text
    International audienceMuch of our understanding of proteins and proteomes comes from the traditional protein structure-function paradigm. However, in the last two decades, both computational and experimental studies have provided evidence that a large fraction of functional proteomes across different domains of life consists of intrinsically disordered proteins thus triggering a quest to unravel and decipher protein intrinsic disorder. Unlike structured/ordered proteins, intrinsically disordered proteins/regions (IDPs/IDRs) do not possess a well-defined structure under physiological conditions, and exist as highly dynamic conformational ensembles. In spite of this peculiarity, these proteins have crucial roles in cell signaling and regulation. To date, studies on the abundance and function of IDPs/IDRs in viruses are rather limited. In order to fill this gap, we carried out an extensive and thorough bioinformatics analysis of 283,000 proteins from 6,108 reference viral proteomes. We analyzed protein intrinsic disorder from multiple perspectives, such as abundance of IDPs/IDRs across diverse virus types, their functional annotations, and subcellular localization in taxonomically divergent hosts. We show that the content of IDPs/IDRs in viral proteomes varies broadly as a function of virus genome types an
    corecore