25 research outputs found

    Myotis fimbriatus Virome, a Window to Virus Diversity and Evolution in the Genus Myotis

    No full text
    Significant efforts have been made to characterize viral diversity in bats from China. Many of these studies were prospective and focused mainly on Rhinolophus bats that could be related to zoonotic events. However, other species of bats that are part of ecosystems identified as virus diversity hotspots have not been studied in-depth. We analyzed the virome of a group of Myotis fimbriatus bats collected from the Yunnan Province during 2020. The virome of M. fimbriatus revealed the presence of families of pathogenic viruses such as Coronavirus, Astrovirus, Mastadenovirus, and Picornavirus, among others. The viral sequences identified in M. fimbriatus were characterized by significant divergence from other known viral sequences of bat origin. Complex phylogenetic landscapes implying a tendency of co-specificity and relationships with viruses from other mammals characterize these groups. The most prevalent and abundant virus in M. fimbriatus individuals was an alphacoronavirus. The genome of this virus shows evidence of recombination and is likely the product of ancestral host-switch. The close phylogenetic and ecological relationship of some species of the Myotis genus in China may have played an important role in the emergence of this alphacoronavirus

    Intra-host diversity of SARS-Cov-2 should not be neglected : case of the State of Victoria, Australia

    No full text
    International audienceSince the identification of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) as the etiological agent of the current COVID-19 pandemic, a rapid and massive effort has been made to obtain the genomic sequences of this virus to monitor (in near real time) the phylodynamic and diversity of this new pathogen. However, less attention has been given to the assessment of intra-host diversity. RNA viruses such as SARS-CoV-2 inhabit the host as a population of variants called quasispecies. We studied the quasispecies diversity in four of the main SARS-CoV-2 genes (ORF1a, ORF1b, S and N genes), using a dataset consisting of 210 next-generation sequencing (NGS) samples collected between January and early April of 2020 in the State of Victoria, Australia. We found evidence of quasispecies diversity in 68% of the samples, 76% of which was nonsynonymous variants with a higher density in the spike (S) glycoprotein and ORF1a genes. About one-third of the nonsynonymous intra-host variants were shared among the samples, suggesting host-to-host transmission. Quasispecies diversity changed over time. Phylogenetic analysis showed that some of the intra-host single-nucleotide variants (iSNVs) were restricted to specific lineages, highlighting their potential importance in the epidemiology of this virus. A greater effort must be made to determine the magnitude of the genetic bottleneck during transmission and the epidemiological and/or evolutionary factors that may play a role in the changes in the diversity of quasispecies over time

    Improving transcriptome de novo assembly by using a reference genome of a related species: Translational genomics from oil palm to coconut

    No full text
    The palms are a family of tropical origin and one of the main constituents of the ecosystems of these regions around the world. The two main species of palm represent different challenges: coconut (Cocos nucifera L.) is a source of multiple goods and services in tropical communities, while oil palm (Elaeis guineensis Jacq) is the main protagonist of the oil market. In this study, we present a workflow that exploits the comparative genomics between a target species (coconut) and a reference species (oil palm) to improve the transcriptomic data, providing a proteome useful to answer functional or evolutionary questions. This workflow reduces redundancy and fragmentation, two inherent problems of transcriptomic data, while preserving the functional representation of the target species. Our approach was validated in Arabidopsis thaliana using Arabidopsis lyrata and Capsella rubella as references species. This analysis showed the high sensitivity and specificity of our strategy, relatively independent of the reference proteome. The workflow increased the length of proteins products in A. thaliana by 13%, allowing, often, to recover 100% of the protein sequence length. In addition redundancy was reduced by a factor greater than 3. In coconut, the approach generated 29,366 proteins, 1,246 of these proteins deriving from new contigs obtained with the BRANCH software. The coconut proteome presented a functional profile similar to that observed in rice and an important number of metabolic pathways related to secondary metabolism. The new sequences found with BRANCH software were enriched in functions related to biotic stress. Our strategy can be used as a complementary step to de novo transcriptome assembly to get a representative proteome of a target species. The results of the current analysis are available on the website PalmComparomics (http://palm-comparomics.southgreen.fr/)

    Improving transcriptome de novo assembly by using a reference genome of a related species: Translational genomics from oil palm to coconut.

    Get PDF
    The palms are a family of tropical origin and one of the main constituents of the ecosystems of these regions around the world. The two main species of palm represent different challenges: coconut (Cocos nucifera L.) is a source of multiple goods and services in tropical communities, while oil palm (Elaeis guineensis Jacq) is the main protagonist of the oil market. In this study, we present a workflow that exploits the comparative genomics between a target species (coconut) and a reference species (oil palm) to improve the transcriptomic data, providing a proteome useful to answer functional or evolutionary questions. This workflow reduces redundancy and fragmentation, two inherent problems of transcriptomic data, while preserving the functional representation of the target species. Our approach was validated in Arabidopsis thaliana using Arabidopsis lyrata and Capsella rubella as references species. This analysis showed the high sensitivity and specificity of our strategy, relatively independent of the reference proteome. The workflow increased the length of proteins products in A. thaliana by 13%, allowing, often, to recover 100% of the protein sequence length. In addition redundancy was reduced by a factor greater than 3. In coconut, the approach generated 29,366 proteins, 1,246 of these proteins deriving from new contigs obtained with the BRANCH software. The coconut proteome presented a functional profile similar to that observed in rice and an important number of metabolic pathways related to secondary metabolism. The new sequences found with BRANCH software were enriched in functions related to biotic stress. Our strategy can be used as a complementary step to de novo transcriptome assembly to get a representative proteome of a target species. The results of the current analysis are available on the website PalmComparomics (http://palm-comparomics.southgreen.fr/)

    Analysis of sensitivity and specificity in <i>Arabidopsis thaliana</i>.

    No full text
    <p>Curves of sensitivity (triangles) and specificity (stars) in <i>Arabidopsis thaliana</i> for different filtering parameters (step 4) using <i>Arabidopsis lyrata</i> (in red) and <i>Capsella rubella</i> (in blue) as reference species. The filtering parameters consist of three digits. The first represents the identity, the second is the coverage for target polypeptides covering more than 40% of the protein of expected group (reference), and the last is the coverage of target polypeptides covering 40% or less of the protein of the expected group.</p

    Increase on the length of <i>A</i>. <i>thaliana</i> PPs.

    No full text
    <p>Length distribution of polypeptides in step 4 (in orange) and 5 (in green) and distribution of <i>Arabidopsis thaliana</i> TAIR10 proteome (in blue). Protein lengths are indicated as discrete length ranges, from 0–200 to >2000 aa.</p

    Scaffolding of coconut polypeptides homologous to oil palm ECERIFERUM 26-like protein.

    No full text
    <p>(A) Snapshot of oil palm protein, coconut polypeptides and coconut PP in ‘Palmcomparomics’ Jbrowse. Polypeptides within red squares were used for scaffolding of coconut PP. (B) Functional domain identified in coconut sequences and oil palm protein. The sequence O64470.1 of <i>Arabidopsis thaliana</i> is representative of the condensation superfamily. (C) Alignment between oil palm protein, coconut polypeptides and coconut PP around the overlap region.</p

    Methodology and results obtained on coconut.

    No full text
    <p>(A) Principal steps of the methodology to recover a target proteome, using transcriptomic data and a reference proteome from a related better studied organism. (B). Number of sequences generated on coconut by our methodology starting from four coconut transcriptomes. Sequences in red represent the number of new sequences found by BRANCH. Sequences in orange represent <i>de novo</i> transcriptomes assembled by Trinity. Protein products (PP) generated at step 5 (in dark red) combine information from both categories of sequences.</p
    corecore